Papers

2,781 papers found

Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?

Zorik Gekhman, Gal Yona, Roee Aharoni et al.

2024 EMNLP

Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification

Pritish Sahu, Karan Sikka, Ajay Divakaran

2024 EMNLP

Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel

Brendan King, Jeffrey Flanigan

2024 EMNLP

Humans or LLMs as the Judge? A Study on Judgement Bias

Guiming Hardy Chen, Shunian Chen, Ziche Liu et al.

2024 EMNLP

Knowledge Conflicts for LLMs: A Survey

Rongwu Xu, Zehan Qi, Zhijiang Guo et al.

2024 EMNLP

A Thorough Examination of Decoding Methods in the Era of LLMs

Chufan Shi, Haoran Yang, Deng Cai et al.

2024 EMNLP

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Liyan Tang, Philippe Laban, Greg Durrett

2024 EMNLP

Learning to Correct for QA Reasoning with Black-box LLMs

Jaehyung Kim, Dongyoung Kim, Yiming Yang

2024 EMNLP

Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs

Sheridan Feucht, David Atkinson, Byron C Wallace et al.

2024 EMNLP

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Philippe Laban, Alexander Fabbri, Caiming Xiong et al.

2024 EMNLP

ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs

Changchun Liu, Kai Zhang, Junzhe Jiang et al.

2024 EMNLP

LLMs Are Prone to Fallacies in Causal Inference

Nitish Joshi, Abulhair Saparov, Yixin Wang et al.

2024 EMNLP

Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations

Nicolò Penzo, Maryam Sajedinia, Bruno Lepri et al.

2024 EMNLP

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs

Haritz Puerto, Martin Tutek, Somak Aditya et al.

2024 EMNLP

PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation

Christoph Leiter, Steffen Eger

2024 EMNLP

Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs

Houman Mehrafarin, Arash Eshghi, Ioannis Konstas

2024 EMNLP

Pragmatic Norms Are All You Need – Why The Symbol Grounding Problem Does Not Apply to LLMs

Reto Gubelmann

2024 EMNLP

Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications

Weize Liu, Yinlong Xu, Hongxia Xu et al.

2024 EMNLP

TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts

Ruida Wang, Jipeng Zhang, Yizhen Jia et al.

2024 EMNLP

Subword Segmentation in LLMs: Looking at Inflection and Consistency

Marion Di Marco, Alexander Fraser

2024 EMNLP

Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models

Yu Yuan, Lili Zhao, Kai Zhang et al.

2024 EMNLP

Subjective Topic meets LLMs: Unleashing Comprehensive, Reflective and Creative Thinking through the Negation of Negation

Fangrui Lv, Kaixiong Gong, Jian Liang et al.

2024 EMNLP

Why Does New Knowledge Create Messy Ripple Effects in LLMs?

Jiaxin Qin, Zixuan Zhang, Chi Han et al.

2024 EMNLP

“Global is Good, Local is Bad?”: Understanding Brand Bias in LLMs

Mahammed Kamruzzaman, Hieu Minh Nguyen, Gene Louis Kim

2024 EMNLP

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

John Dang, Arash Ahmadian, Kelly Marchisio et al.

2024 EMNLP