Research Explorer

FLAP: Flow-Adhering Planning with Constrained Decoding in LLMs

Shamik Roy, Sailik Sengupta, Daniele Bonadiman et al.

2024 NAACL

E5: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit and Extrapolate

Zhehao Zhang, Yan Gao, Jian-Guang Lou

2024 NAACL

Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings

Chen Cecilia Liu, Fajri Koto, Timothy Baldwin et al.

2024 NAACL

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning

Xuansheng Wu, Wenlin Yao, Jianshu Chen et al.

2024 NAACL

How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities

Lingbo Mo, Boshi Wang, Muhao Chen et al.

2024 NAACL

Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two Benchmarks

Ting-Yun Chang, Jesse Thomason, Robin Jia

2024 NAACL

Confronting LLMs with Traditional ML: Rethinking the Fairness of Large Language Models in Tabular Classifications

Yanchen Liu, Srishti Gautam, Jiaqi Ma et al.

2024 NAACL

Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks

Chonghua Wang, Haodong Duan, Songyang Zhang et al.

2024 NAACL

On-the-fly Definition Augmentation of LLMs for Biomedical NER

Monica Munnangi, Sergey Feldman, Byron Wallace et al.

2024 NAACL

Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey

Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi et al.

2024 NAACL

TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Liyan Tang, Igor Shalyminov, Amy Wong et al.

2024 NAACL

Flames: Benchmarking Value Alignment of LLMs in Chinese

Kexin Huang, Xiangyang Liu, Qianyu Guo et al.

2024 NAACL

Fake Alignment: Are LLMs Really Aligned Well?

Yixu Wang, Yan Teng, Kexin Huang et al.

2024 NAACL

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs

Yassir Fathullah, Chunyang Wu, Egor Lakomkin et al.

2024 NAACL

Do Large Language Models Rank Fairly? An Empirical Study on the Fairness of LLMs as Rankers

Yuan Wang, Xuyang Wu, Hsin-Tai Wu et al.

2024 NAACL

TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition

Md Mahadi Hasan Nahid, Davood Rafiei

2024 NAACL

DialogBench: Evaluating LLMs as Human-like Dialogue Systems

Jiao Ou, Junda Lu, Che Liu et al.

2024 NAACL

Beyond Performance: Quantifying and Mitigating Label Bias in LLMs

Yuval Reif, Roy Schwartz

2024 NAACL

Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method

Yukun Zhao, Lingyong Yan, Weiwei Sun et al.

2024 NAACL

Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval

Nandan Thakur, Jianmo Ni, Gustavo Hernandez Abrego et al.

2024 NAACL

Actively Learn from LLMs with Uncertainty Propagation for Generalized Category Discovery

Jinggui Liang, Lizi Liao, Hao Fei et al.

2024 NAACL

SKICSE: Sentence Knowable Information Prompted by LLMs Improves Contrastive Sentence Embeddings

Fangwei Ou, Jinan Xu

2024 NAACL

Unveiling Divergent Inductive Biases of LLMs on Temporal Data

Sindhu Kishore, Hangfeng He

2024 NAACL

Llama meets EU: Investigating the European political spectrum through the lens of LLMs

Ilias Chalkidis, Stephanie Brandl

2024 NAACL

CPopQA: Ranking Cultural Concept Popularity by LLMs

Ming Jiang, Mansi Joshi

2024 NAACL

Papers