Papers
2,781 papers found
Scaling Intent Understanding: A Framework for Classification with Clarification using Lightweight LLMs
Subhadip Nandi, Tanishka Agarwal, Anshika Singh et al.
LingVarBench: Benchmarking LLMs on Entity Recognitions and Linguistic Verbalization Patterns in Phone-Call Transcripts
Seyedali Mohammadi, Manas Paldhe, Amit Chhabra et al.
Aligning Paralinguistic Understanding and Generation in Speech LLMs via Multi-Task Reinforcement Learning
Minseok Kim, Jingxiang Chen, Seong-Gyun Leem et al.
ELO: Efficient Layer-Specific Optimization for Continual Pretraining of Multilingual LLMs
Hangyeol Yoo, ChangSu Choi, Minjun Kim et al.
Being Kind Isn’t Always Being Safe: Diagnosing Affective Hallucination in LLMs
Sewon Kim, Jiwon Kim, SeungWoo Shin et al.
Position Paper: How Should We Responsibly Adopt LLMs in the Peer Review Process?
Juhwan Choi, JungMin Yun, Changhun Kim et al.
Continual Pretraining on Encrypted Synthetic Data for Privacy-Preserving LLMs
Honghao Liu, Xuhui Jiang, Chengjin Xu et al.
VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy
Yu Cui, Sicheng Pan, Yifei Liu et al.
Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs
Paiheng Xu, Gang Wu, Xiang Chen et al.
Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
Zara Siddique, Irtaza Khalid, Liam Turner et al.
Harmful Factuality: LLMs Correcting What They Shouldn’t
Mingchen Li, Hanzhi Zhang, Heng Fan et al.
Toward Beginner-Friendly LLMs for Language Learning: Controlling Difficulty in Conversation
Meiqing Jin, Liam Dugan, Chris Callison-Burch
ATOM: AdapTive and OptiMized dynamic temporal knowledge graph construction using LLMs
Yassir Lairgi, Ludovic Moncla, Khalid Benabdeslem et al.
Where do LLMs currently stand on biomedical NER in both clean and noisy settings ?
Christophe Ye, Cassie S. Mitchell
The Unintended Trade-off of AI Alignment: Balancing Hallucination Mitigation and Safety in LLMs
Omar Mahmoud, Ali Khalil, Thommen George Karimpanal et al.
The Model’s Language Matters: A Comparative Privacy Analysis of LLMs
Abhishek Kumar Mishra, Antoine Boutet, Lucas Magnana
LLMs Faithfully and Iteratively Compute Answers During CoT: A Systematic Analysis With Multi-step Arithmetics
Keito Kudo, Yoichi Aoki, Tatsuki Kuribayashi et al.
Breaking the Illusion of Reasoning in Polish LLMs: Quality over Quantity of Thought
Dzmitry Pihulski, Mikołaj Langner, Jan Eliasz et al.
Feature Drift: How Fine-Tuning Repurposes Representations in LLMs
Andrey V. Galichin, Anton Korznikov, Alexey Dontsov et al.
MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Dialogue Evaluators
John Mendonça, Alon Lavie, Isabel Trancoso
Aggregating Crowd of LLMs for Cost-Effective Data Annotation
Jiacheng Liu, Xiaofeng Hou
Can LLMs Reason Like Doctors? Exploring the Limits of Large Language Models in Complex Medical Reasoning
Flavio Merenda, Jose Manuel Gomez-Perez, German Rigau
Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish
Cedric Lothritz, Jordi Cabot, Laura Bernardy
Unveiling Decision-Making in LLMs for Text Classification : Extraction of influential and interpretable concepts with Sparse Autoencoders
Mathis Le Bail, Jérémie Dentan, Davide Buscaldi et al.
Are Multimodal LLMs Movie Buffs?
Carlo Bretti, Pascal Mettes, Nanne Van Noord