Papers
5,479 papers found
Decode Like a Clinician: Enhancing LLM Fine-Tuning with Temporal Structured Data Representation
Daniel Fadlon, David Dov, Aviya Bennett et al.
The Confidence Paradox: Can LLM Know When It’s Wrong?
Sahil Tripathi, MD Tabrez Nafis, Imran Hussain et al.
Large Temporal Models: Unlocking Temporal Understanding in LLMs for Temporal Relation Classification
Omri Homburger, Kfir Bar
Interpreting the Effects of Quantization on LLMs
Manpreet Singh, Hassan Sajjad
What Would You Ask When You First Saw a2+b2=c2? Evaluating LLM on Curiosity-Driven Question Generation
Shashidhar Reddy Javaji, Zining Zhu
Can AI Validate Science? Benchmarking LLMs on Claim →Evidence Reasoning in AI Papers
Shashidhar Reddy Javaji, Yupeng Cao, Haohang Li et al.
More Than a Score: Probing the Impact of Prompt Specificity on LLM Code Generation
Yangtian Zi, Harshitha Menon, Arjun Guha
Pragmatic Theories Enhance Understanding of Implied Meanings in LLMs
Takuma Sato, Seiya Kawano, Koichiro Yoshino
Crypto-LLM: Two-Stage Language Model Pre-training with Ciphered and Natural Language Data
Yohei Kobashi, Fumiya Uchiyama, Takeshi Kojima et al.
From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning
Chalamalasetti Kranti, Sherzod Hakimov, David Schlangen
Small Changes, Large Consequences: Analyzing the Allocational Fairness of LLMs in Hiring Contexts
Preethi Seshadri, Hongyu Chen, Sameer Singh et al.
Revisiting Word Embeddings in the LLM Era
Yash Mahajan, Matthew Freestone, Naman Bansal et al.
Agnus LLM: Robust and Flexible Entity Disambiguation with decoder-only Language Models
Kristian Noullet, Ayoub Ourgani, Niklas Thomas Lakner et al.
Found in Translation: Measuring Multilingual LLM Consistency as Simple as Translate then Evaluate
Ashim Gupta, Maitrey Mehta, Zhichao Xu et al.
Do Persona-Infused LLMs Affect Performance in a Strategic Reasoning Game?
John Licato, Stephen Steinle
PII-Scope: A Comprehensive Study on Training Data Privacy Leakage in Pretrained LLMs
Krishna Kanth Nakka, Ahmed Frikha, Ricardo Mendes et al.
To Labor is Not to Suffer: Exploration of Polarity Association Bias in LLMs for Sentiment Analysis
Jiyu Chen, Sarvnaz Karimi, Diego Molla et al.
Gatsby without the ‘E’: Creating Lipograms with LLMs
Nitish Gokulakrishnan, Rohan Balasubramanian, Syeda Jannatus Saba et al.
Still Not There: Can LLMs Outperform Smaller Task-Specific Seq2Seq Models on the Poetry-to-Prose Conversion Task?
Kunal Kingkar Das, Manoj Balaji Jagadeeshan, Nallani Chakravartula Sahith et al.
Broken Words, Broken Performance: Effect of Tokenization on Performance of LLMs
Sachin Pawar, Manoj Apte, Kshitij Jadhav et al.
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
Chloe Li, Noah Y. Siegel
Improving LLM’s Attachment to External Knowledge In Dialogue Generation Tasks Through Entity Anonymization
Hadi Sheikhi, Chenyang Huang, Osmar Zaiane
Testing Simulation Theory in LLMs’ Theory of Mind
Koshiro Aoki, Daisuke Kawahara
Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning
Wendy Yaqiao Liu, Rui Jerry Huang, Anastasia Miin et al.
Two Step Automatic Post Editing of Patent Machine Translation based on Pre-trained Encoder Models and LLMs
Kosei Buma, Takehito Utsuro, Masaaki Nagata