Papers
2,781 papers found
Large Temporal Models: Unlocking Temporal Understanding in LLMs for Temporal Relation Classification
Omri Homburger, Kfir Bar
Interpreting the Effects of Quantization on LLMs
Manpreet Singh, Hassan Sajjad
Can AI Validate Science? Benchmarking LLMs on Claim →Evidence Reasoning in AI Papers
Shashidhar Reddy Javaji, Yupeng Cao, Haohang Li et al.
Pragmatic Theories Enhance Understanding of Implied Meanings in LLMs
Takuma Sato, Seiya Kawano, Koichiro Yoshino
From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning
Chalamalasetti Kranti, Sherzod Hakimov, David Schlangen
Small Changes, Large Consequences: Analyzing the Allocational Fairness of LLMs in Hiring Contexts
Preethi Seshadri, Hongyu Chen, Sameer Singh et al.
Do Persona-Infused LLMs Affect Performance in a Strategic Reasoning Game?
John Licato, Stephen Steinle
PII-Scope: A Comprehensive Study on Training Data Privacy Leakage in Pretrained LLMs
Krishna Kanth Nakka, Ahmed Frikha, Ricardo Mendes et al.
To Labor is Not to Suffer: Exploration of Polarity Association Bias in LLMs for Sentiment Analysis
Jiyu Chen, Sarvnaz Karimi, Diego Molla et al.
Gatsby without the ‘E’: Creating Lipograms with LLMs
Nitish Gokulakrishnan, Rohan Balasubramanian, Syeda Jannatus Saba et al.
Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?
Kunal Kingkar Das, Manoj Balaji Jagadeeshan, Nallani Chakravartula Sahith et al.
Broken Words, Broken Performance: Effect of Tokenization on Performance of LLMs
Sachin Pawar, Manoj Apte, Kshitij Jadhav et al.
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
Chloe Li, Noah Y. Siegel
Testing Simulation Theory in LLMs’ Theory of Mind
Koshiro Aoki, Daisuke Kawahara
Two Step Automatic Post Editing of Patent Machine Translation based on Pre-trained Encoder Models and LLMs
Kosei Buma, Takehito Utsuro, Masaaki Nagata
Are LLMs Good for Semantic Role Labeling via Question Answering?: A Preliminary Analysis
Ritwik Raghav, Abhik Jana
VariantBench: A Framework for Evaluating LLMs on Justifications for Genetic Variant Interpretation
Humair Basharat, Simon Plotkin, Charlotte Le et al.
Tutorial on Trustworthy Legal Text Processing with LLMs: Retrieval, Rhetorical Roles, Summarization, and Trustworthy Generation
Anand Kumar M, Sangeetha S, Manikandan R et al.
Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs
Peng Yifeng, Zhizheng Wu, Chen Chen
LLMs as Architects and Critics for Multi-Source Opinion Summarization
Anuj Attri, Arnav Attri, Suman Banerjee et al.
Atomic Calibration of LLMs in Long-Form Generations
Caiqi Zhang, Ruihan Yang, Zhisong Zhang et al.
Estimating Causal Effects of Text Interventions Leveraging LLMs
Siyi Guo, Myrl G Marmarelis, Fred Morstatter et al.
Smruti: Grammatical Error Correction for Gujarati using LLMs with Non-Parametric Memory
Vrund Dobariya, Jatayu Baxi, Bhavika Gambhava et al.
Emotion-Aware Dysarthric Speech Reconstruction: LLMs and Multimodal Evaluation with MCDS
Kaushal Attaluri, Radhika Mamidi, Sireesha Chittepu et al.
Learning from Hallucinations: Mitigating Hallucinations in LLMs via Internal Representation Intervention
Sora Kadotani, Kosuke Nishida, Kyosuke Nishida