Papers
The Multilingual Corpus of World’s Constitutions (MCWC)
Mo El-Haj, Saad Ezzini
The Need for Grounding in LLM-based Dialogue Systems
Kristiina Jokinen
The Onomastic Repertoire of the Roman d’Alexandre (ORNARE). Designing an Integrated Digital Onomastic Tool for Medieval French Romance
Marta Milazzo, Giorgio Maria Di Nunzio
The Open-World Lottery Ticket Hypothesis for OOD Intent Classification
Yunhua Zhou, Pengyu Wang, Peiju Liu et al.
Theoretical and Empirical Advantages of Dense-Vector to One-Hot Encoding of Intent Classes in Open-World Scenarios
Paulo Cavalin, Claudio Santos Pinhanez
The ParCoLab Parallel Corpus and Its Extension to Four Regional Languages of France
Dejan Stosic, Saša Marjanović, Delphine Bernhard et al.
The ParlaSent Multilingual Training Dataset for Sentiment Identification in Parliamentary Proceedings
Michal Mochtak, Peter Rupnik, Nikola Ljubešić
The Relative Clauses AMR Parsers Hate Most
Xiulin Yang, Nathan Schneider
There’s Something New about the Italian Parliament: The IPSA Corpus
Valentino Frasnelli, Alessio Palmero Aprosio
The RIP Corpus of Collaborative Hypothesis-Making
Ella Schad, Jacky Visser, Chris Reed
The Rise and Fall of Dependency Parsing in Dante Alighieri’s Divine Comedy
Claudia Corbetta, Marco Passarotti, Giovanni Moretti
The Role of Creaky Voice in Turn Taking and the Perception of Speaker Stance: Experiments Using Controllable TTS
Harm Lameris, Eva Szekely, Joakim Gustafson
The Role of Syntactic Span Preferences in Post-Hoc Explanation Disagreement
Jonathan Kamp, Lisa Beinborn, Antske Fokkens
The SAMER Arabic Text Simplification Corpus
Bashar Alhafni, Reem Hazim, Juan David Pineros Liberato et al.
The Semantic Relations in LLMs: An Information-theoretic Compression Approach
Yu-Hsiang Tseng, Pin-Er Chen, Da-Chen Lian et al.
The Services of the LiLa Knowledge Base of Interoperable Linguistic Resources for Latin
Marco Passarotti, Francesco Mambrini, Giovanni Moretti
The Simplification of the Language of Public Administration: The Case of Ombudsman Institutions
Gabriel Gonzalez-Delgado, Borja Navarro-Colorado
The Slovak Autistic and Non-Autistic Child Speech Corpus:Task-Oriented Child-Adult Interactions
Joanna Kruyt, Róbert Sabo, Katarína Polónyiová et al.
The Swedish Parliament Corpus 1867 – 2022
Väinö Aleksi Yrjänäinen, Fredrik Mohammadi Norén, Robert Borges et al.
The Touché23-ValueEval Dataset for Identifying Human Values behind Arguments
Nailia Mirzakhmedova, Johannes Kiesel, Milad Alshomary et al.
The UNLP 2024 Shared Task on Fine-Tuning Large Language Models for Ukrainian
Mariana Romanyshyn, Oleksiy Syvokon, Roman Kyslyi
The Vedic Compound Dataset
Sven Sellmer, Oliver Hellwig
This Word Mean What: Constructing a Singlish Dictionary with ChatGPT
Siew Yeng Chow, Chang-Uk Shin, Francis Bond