Papers
2,781 papers found
LLMs as Cultural Archives: Cultural Commonsense Knowledge Graph Extraction
Junior Cedric Tonga, Chen Cecilia Liu, Iryna Gurevych et al.
Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs
Pranav Bhandari, Nicolas Fay, Sanjeevan Selvaganapathy et al.
KG-CRAFT: Knowledge Graph-based Contrastive Reasoning with LLMs for Enhancing Automated Fact-checking
Vítor Lourenço, Aline Paes, Tillman Weyde et al.
Elections go bananas: A First Large-scale Multilingual Study of Pluralia Tantum using LLMs
Elena Spaziani, Kamyar Zeinalipour, Pierluigi Cassotti et al.
How Do LLMs Generate Contrastive Sentiments? A Mechanistic Perspective
Van Bach Nguyen, Jörg Schlötterer, Christin Seifert
H3Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
Selim Furkan Tekin, Fatih Ilhan, Sihao Hu et al.
Beyond Understanding: Evaluating the Pragmatic Gap in LLMs’ Cultural Processing of Figurative Language
Mena Attia, Aashiq Muhamed, Mai Alkhamissi et al.
Do You See Me : A Multidimensional Benchmark for Evaluating Visual Perception in Multimodal LLMs
Aditya Sanjiv Kanade, Tanuja Ganu
A Review of Incorporating Psychological Theories in LLMs
Zizhou Liu, Ziwei Gong, Lin Ai et al.
How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities
Aly M. Kassem, Bernhard Schölkopf, Zhijing Jin
Tokenizer-Aware Cross-Lingual Adaptation of Decoder-Only LLMs through Embedding Relearning and Swapping
Fan Jiang, Honglin Yu, Grace Y Chung et al.
Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs
James Beetham, Souradip Chakraborty, Mengdi Wang et al.
Are All Prompt Components Value-Neutral? Understanding the Heterogeneous Adversarial Robustness of Dissected Prompt in LLMs
Yujia Zheng, Tianhao Li, Haotian Huang et al.
What Does Infect Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs
Xinlan Yan, Di Wu, Yibin Lei et al.
Redefining Retrieval Evaluation in the Era of LLMs
Giovanni Trappolini, Florin Cuconasu, Simone Filice et al.
Korean Canonical Legal Benchmark: Toward Knowledge-Independent Evaluation of LLMs’ Legal Reasoning Capabilities
Hongseok Oh, Wonseok Hwang, Kyoung-Woon On
Measuring Linguistic Competence of LLMs on Indigenous Languages of the Americas
Justin Vasselli, Arturo Mp, Frederikus Hudi et al.
Beyond Tokens: Concept-Level Training Objectives for LLMs
Laya Iyer, Pranav Somani, Alice Guo et al.
Persuasion Tokens for Editing Factual Knowledge in LLMs
Paul Youssef, Christin Seifert, Jörg Schlötterer
Funny or Persuasive, but Not Both: Evaluating Fine-Grained Multi-Concept Control in LLMs
Arya Labroo, Ivaxi Sheth, Vyas Raina et al.
LLMs Know More About Numbers than They Can Say
Fengting Yuchi, Li Du, Jason Eisner
From Detection to Explanation: Modeling Fine-Grained Emotional Social Influence Techniques with LLMs and Human Preferences
Maciej Markiewicz, Wiktoria Mieleszczenko-Kowszewicz, Beata Bajcar et al.
Evaluating Cost-Efficiency of LLMs in a RAG Setup on Polish Wikipedia: Quality vs. Energy Consumption
Patrycja Smits, Tomasz Walkowiak
Evaluating the Pre-Consultation Ability of LLMs using Diagnostic Guidelines
Jean Seo, Gibaeg Kim, Kihun Shin et al.