Papers
Scaling Intent Understanding: A Framework for Classification with Clarification using Lightweight LLMs
Subhadip Nandi, Tanishka Agarwal, Anshika Singh et al.
Beyond IVR: Benchmarking Customer Support LLM Agents for Business-Adherence
Sumanth Balaji, Piyush Mishra, Aashraya Sachdeva et al.
LingVarBench: Benchmarking LLMs on Entity Recognitions and Linguistic Verbalization Patterns in Phone-Call Transcripts
Seyedali Mohammadi, Manas Paldhe, Amit Chhabra et al.
The Subtle Art of Defection: Understanding Uncooperative Behaviors in LLM based Multi-Agent Systems
Devang Kulshreshtha, Wanyu Du, Raghav Jain et al.
Aligning Paralinguistic Understanding and Generation in Speech LLMs via Multi-Task Reinforcement Learning
Minseok Kim, Jingxiang Chen, Seong-Gyun Leem et al.
ELO: Efficient Layer-Specific Optimization for Continual Pretraining of Multilingual LLMs
Hangyeol Yoo, ChangSu Choi, Minjun Kim et al.
A Hybrid Supervised-LLM Pipeline for Actionable Suggestion Mining in Unstructured Customer Reviews
Aakash Trivedi, Aniket Upadhyay, Pratik Narang et al.
Balanced Accuracy: The Right Metric for Evaluating LLM Judges - Explained through Youden’s J statistic
Stephane Collot, Colin Fraser, Justin Zhao et al.
Being Kind Isn’t Always Being Safe: Diagnosing Affective Hallucination in LLMs
Sewon Kim, Jiwon Kim, SeungWoo Shin et al.
Position Paper: How Should We Responsibly Adopt LLMs in the Peer Review Process?
Juhwan Choi, JungMin Yun, Changhun Kim et al.
Continual Pretraining on Encrypted Synthetic Data for Privacy-Preserving LLMs
Honghao Liu, Xuhui Jiang, Chengjin Xu et al.
Do Diacritics Matter? Evaluating the Impact of Arabic Diacritics on Tokenization and LLM Benchmarks
Go Inoue, Bashar Alhafni, Nizar Habash et al.
VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy
Yu Cui, Sicheng Pan, Yifei Liu et al.
Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs
Paiheng Xu, Gang Wu, Xiang Chen et al.
Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
Zara Siddique, Irtaza Khalid, Liam Turner et al.
Harmful Factuality: LLMs Correcting What They Shouldn’t
Mingchen Li, Hanzhi Zhang, Heng Fan et al.
Toward Beginner-Friendly LLMs for Language Learning: Controlling Difficulty in Conversation
Meiqing Jin, Liam Dugan, Chris Callison-Burch
CodeGuard: Improving LLM Guardrails in CS Education
Nishat Raihan, Noah Erdachew, Jayoti Devi et al.
ATOM: AdapTive and OptiMized dynamic temporal knowledge graph construction using LLMs
Yassir Lairgi, Ludovic Moncla, Khalid Benabdeslem et al.
Where do LLMs currently stand on biomedical NER in both clean and noisy settings ?
Christophe Ye, Cassie S. Mitchell
The Unintended Trade-off of AI Alignment: Balancing Hallucination Mitigation and Safety in LLMs
Omar Mahmoud, Ali Khalil, Thommen George Karimpanal et al.
The Model’s Language Matters: A Comparative Privacy Analysis of LLMs
Abhishek Kumar Mishra, Antoine Boutet, Lucas Magnana
LLMs Faithfully and Iteratively Compute Answers During CoT: A Systematic Analysis With Multi-step Arithmetics
Keito Kudo, Yoichi Aoki, Tatsuki Kuribayashi et al.
Intention-Adaptive LLM Fine-Tuning for Text Revision Generation
Zhexiong Liu, Diane Litman
Don’t Judge Code by Its Cover: Exploring Biases in LLM Judges for Code Evaluation
Jiwon Moon, Yerin Hwang, Dongryeol Lee et al.