Research Explorer

Balancing Forget Quality and Model Utility: A Reverse KL-Divergence Knowledge Distillation Approach for Better Unlearning in LLMs

Bichen Wang, Yuzhe Zi, Yixin Sun et al.

2025 NAACL

Can LLMs Convert Graphs to Text-Attributed Graphs?

Zehong Wang, Sidney Liu, Zheyuan Zhang et al.

2025 NAACL

What Did I Do Wrong? Quantifying LLMs’ Sensitivity and Consistency to Prompt Engineering

Federico Errica, Davide Sanvito, Giuseppe Siracusano et al.

2025 NAACL

SafetyQuizzer: Timely and Dynamic Evaluation on the Safety of LLMs

Zhichao Shi, Shaoling Jing, Yi Cheng et al.

2025 NAACL

The Impact of Inference Acceleration on Bias of LLMs

Elisabeth Kirsten, Ivan Habernal, Vedant Nanda et al.

2025 NAACL

Fine-Tuned LLMs are “Time Capsules” for Tracking Societal Bias Through Books

Sangmitra Madhusudan, Robert Morabito, Skye Reid et al.

2025 NAACL

Have LLMs Reopened the Pandora’s Box of AI-Generated Fake News?

Xinyu Wang, Wenbo Zhang, Sai Koneru et al.

2025 NAACL

Who Relies More on World Knowledge and Bias for Syntactic Ambiguity Resolution: Humans or LLMs?

So Young Lee, Russell Scheinberg, Amber Shore et al.

2025 NAACL

An Efficient Gloss-Free Sign Language Translation Using Spatial Configurations and Motion Dynamics with LLMs

Eui Jun Hwang, Sukmin Cho, Junmyeong Lee et al.

2025 NAACL

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

Yifan Song, Guoyin Wang, Sujian Li et al.

2025 NAACL

MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria

Wentao Ge, Shunian Chen, Hardy Chen et al.

2025 NAACL

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Yu Zhao, Alessio Devoto, Giwon Hong et al.

2025 NAACL

CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs

Amey Hengle, Aswini Kumar Padhi, Anil Bandhakavi et al.

2025 NAACL

RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models

Bang An, Shiyue Zhang, Mark Dredze

2025 NAACL

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Daniil Moskovskiy, Nikita Sushko, Sergey Pletenev et al.

2025 NAACL

Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities

Chung-En Sun, Xiaodong Liu, Weiwei Yang et al.

2025 NAACL

Rethinking the Role of LLMs for Document-level Relation Extraction: a Refiner with Task Distribution and Probability Fusion

Fu Zhang, Xinlong Jin, Jingwei Cheng et al.

2025 NAACL

CharacterBox: Evaluating the Role-Playing Capabilities of LLMs in Text-Based Virtual Worlds

Lei Wang, Jianxun Lian, Yi Huang et al.

2025 NAACL

TRANSIENTTABLES: Evaluating LLMs’ Reasoning on Temporally Evolving Semi-structured Tables

Abhilash Shankarampeta, Harsh Mahajan, Tushar Kataria et al.

2025 NAACL

JRE-L: Journalist, Reader, and Editor LLMs in the Loop for Science Journalism for the General Audience

Gongyao Jiang, Xinran Shi, Qiong Luo

2025 NAACL

Wav2Prompt: End-to-End Speech Prompt Learning and Task-based Fine-tuning for Text-based LLMs

Keqi Deng, Guangzhi Sun, Phil Woodland

2025 NAACL

How to Make the Most of LLMs’ Grammatical Knowledge for Acceptability Judgments

Yusuke Ide, Yuto Nishida, Justin Vasselli et al.

2025 NAACL

A Picture is Worth A Thousand Numbers: Enabling LLMs Reason about Time Series via Visualization

Haoxin Liu, Chenghao Liu, B. Aditya Prakash

2025 NAACL

Towards Robust Knowledge Representations in Multilingual LLMs for Equivalence and Inheritance based Consistent Reasoning

Gaurav Arora, Srujana Merugu, Shreya Jain et al.

2025 NAACL

LLMs as Meta-Reviewers’ Assistants: A Case Study

Eftekhar Hossain, Sanjeev Kumar Sinha, Naman Bansal et al.

2025 NAACL

Papers