Papers
On Instruction-Finetuning Neural Machine Translation Models
Vikas Raunak, Roman Grundkiewicz, Marcin Junczys-Dowmunt
On Leakage of Code Generation Evaluation Datasets
Alexandre Matton, Tom Sherborne, Dennis Aumiller et al.
On Mitigating Performance Disparities in Multilingual Speech Recognition
Monorama Swain, Anna Katrine Van Zee, Anders Søgaard
On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices
Branislav Pecher, Ivan Srba, Maria Bielikova
On the alignment of LM language generation and human language comprehension
Lena Sophia Bolliger, Patrick Haller, Lena Ann Jäger
On the Empirical Complexity of Reasoning and Planning in LLMs
Liwei Kang, Zirui Zhao, David Hsu et al.
On the Fragility of Active Learners for Text Classification
Abhishek Ghose, Emma Thuong Nguyen
On the Generalization of Training-based ChatGPT Detection Methods
Han Xu, Jie Ren, Pengfei He et al.
On the In-context Generation of Language Models
Zhongtao Jiang, Yuanzhe Zhang, Kun Luo et al.
On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
Abhilasha Sancheti, Haozhe An, Rachel Rudinger
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization
Yong Lin, Skyler Seto, Maartje Ter Hoeve et al.
On the Proper Treatment of Tokenization in Psycholinguistics
Mario Giulianelli, Luca Malagutti, Juan Luis Gastaldi et al.
On the Relationship between Truth and Political Bias in Language Models
Suyash Fulay, William Brannon, Shrestha Mohanty et al.
On the Reliability of Psychological Scales on Large Language Models
Jen-tse Huang, Wenxiang Jiao, Man Ho Lam et al.
On the Rigour of Scientific Writing: Criteria, Analysis, and Insights
Joseph James, Chenghao Xiao, Yucheng Li et al.
On the Robustness of Editing Large Language Models
Xinbei Ma, Tianjie Ju, Jiyang Qiu et al.
On the Role of Context in Reading Time Prediction
Andreas Opedal, Eleanor Chodroff, Ryan Cotterell et al.
On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task
Javier Ferrando, Marta R. Costa-jussà
On the token distance modeling ability of higher RoPE attention dimension
Xiangyu Hong, Che Jiang, Biqing Qi et al.
On the Universal Truthfulness Hyperplane Inside LLMs
Junteng Liu, Shiqi Chen, Yu Cheng et al.
Ontologically Faithful Generation of Non-Player Character Dialogues
Nathaniel Weir, Ryan Thomas, Randolph d’Amore et al.
On Training Data Influence of GPT Models
Yekun Chai, Qingyi Liu, Shuohuan Wang et al.
OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
Hasan Iqbal, Yuxia Wang, Minghan Wang et al.
OpenGraph: Towards Open Graph Foundation Models
Lianghao Xia, Ben Kao, Chao Huang
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak
Mukhammadsaid Mamasaidov, Abror Shopulatov