Papers
5,479 papers found
Attacker’s Noise Can Manipulate Your Audio-based LLM in the Real World
Vinu Sankar Sadasivan, Soheil Feizi, Rajiv Mathews et al.
Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework
Clea Chataigner, Rebecca Ma, Prakhar Ganesh et al.
AutoBool: Reinforcement-Learned LLM for Effective Automatic Systematic Reviews Boolean Query Generation
Shuai Wang, Harrisen Scells, Bevan Koopman et al.
Improving LLM Domain Certification with Pretrained Guide Models
Jiaqian Zhang, Zhaozhi Qian, Faroq AL-Tam et al.
Coordinates from Context: Using LLMs to Ground Complex Location References
Tessa Masis, Brendan O'Connor
SearchLLM: Detecting LLM Paraphrased Text by Measuring the Similarity with Regeneration of the Candidate Source via Search Engine
Hoang-Quoc Nguyen-Son, Minh-Son Dao, Koji Zettsu
Unraveling LLM Jailbreaks Through Safety Knowledge Neurons
Chongwen Zhao, Yutong Ke, Kaizhu Huang
Knowledge Extraction on Semi-Structured Content: Does It Remain Relevant for Question Answering in the Era of LLMs?
Kai Sun, Yin Huang, Srishti Mehra et al.
Don’t Judge a Book by its Cover: Testing LLMs’ Robustness Under Logical Obfuscation
Abhilekh Borah, Shubhra Ghosh, Kedar Joshi et al.
Reasoning or Knowledge: Stratified Evaluation of Biomedical LLMs
Rahul Thapa, Qingyang Wu, Kevin Wu et al.
AfriVox: Probing Multilingual and Accent Robustness of Speech LLMs
Busayo Awobade, Mardhiyah Sanni, Tassallah Abdullahi et al.
PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs
Manuel Frank, Haithem Afli
How Good Are LLMs at Processing Tool Outputs?
Kiran Kate, Yara Rizk, Poulami Ghosh et al.
Tug-of-war between idioms’ figurative and literal interpretations in LLMs
Soyoung Oh, Xinting Huang, Mathis Pink et al.
Do LLM hallucination detectors suffer from low-resource effect?
Debtanu Datta, Mohan Kishore Chilukuri, Yash Kumar et al.
MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection
Arkadiusz Modzelewski, Witold Sosnowski, Eleni Papadopulos et al.
Polyglots or Multitudes? Multilingual LLM Answers to Value-laden Multiple-Choice Questions
Léo Labat, Etienne Ollion, François Yvon
Martingale Foresight Sampling: A Principled Approach to Inference-Time LLM Decoding
Huayu Li, ZhengXiao He, Siyuan Tian et al.
Is This LLM Library Learning? Evaluation Must Account For Compute and Behaviour
Ian Berlot-Attwell, Tobias Sesterhenn, Frank Rudzicz et al.
Automating Android Build Repair: Bridging the Reasoning-Execution Gap in LLM Agents with Domain-Specific Tools
Ha Min Son, Huan Ren, Xin Liu et al.
Word Surprisal Correlates with Sentential Contradiction in LLMs
Ning Shi, Bradley Hauer, David Basil et al.
Where Do LLMs Compose Meaning? A Layerwise Analysis of Compositional Robustness
Nura Aljaafari, Danilo Carvalho, Andre Freitas
Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
Wafaa Mohammed, Vlad Niculae, Chrysoula Zerva