Research Explorer

Evaluating ChatNetZero, an LLM-Chatbot to Demystify Climate Pledges

Angel Hsu, Mason Laney, Ji Zhang et al.

2024 ACL

Human-Centered Design Recommendations for LLM-as-a-judge

Qian Pan, Zahra Ashktorab, Michael Desmond et al.

2024 ACL

Improving LLM-based KGQA for multi-hop Question Answering with implicit reasoning in few-shot examples

Mili Shah, Joyce Cahoon, Mirco Milletari et al.

2024 ACL

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs

Xiangwen Wang, Jie Peng, Kaidi Xu et al.

2024 ACL

SRCB at #SMM4H 2024: Making Full Use of LLM-based Data Augmentation in Adverse Drug Event Extraction and Normalization

Hongyu Li, Yuming Zhang, Yongwei Zhang et al.

2024 ACL

UTRad-NLP at #SMM4H 2024: Why LLM-Generated Texts Fail to Improve Text Classification Models

Yosuke Yamagishi, Yuta Nakamura

2024 ACL

PolyuCBS at SMM4H 2024: LLM-based Medical Disorder and Adverse Drug Event Detection with Low-rank Adaptation

Zhai Yu, Xiaoyi Bao, Emmanuele Chersoni et al.

2024 ACL

LHS712_ADENotGood at #SMM4H 2024 Task 1: Deep-LLMADEminer: A deep learning and LLM pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter

Yifan Zheng, Jun Gong, Shushun Ren et al.

2024 ACL

LLM-Powered Test Case Generation for Detecting Bugs in Plausible Programs

Kaibo Liu, Zhenpeng Chen, Yiyang Liu et al.

2025 ACL

Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions

Hang Li, Tianlong Xu, Kaiqi Yang et al.

2025 ACL

CompileAgent: Automated Real-World Repo-Level Compilation with Tool-Integrated LLM-based Agent System

Li Hu, Guoqiang Chen, Xiuwei Shang et al.

2025 ACL

INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent

Haohang Li, Yupeng Cao, Yangyang Yu et al.

2025 ACL

Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents

Fanhang Man, Huandong Wang, Jianjie Fang et al.

2025 ACL

Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation

Mingxuan Xia, Haobo Wang, Yixuan Li et al.

2025 ACL

TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge

Cheng-Han Chiang, Hung-yi Lee, Michal Lukasik

2025 ACL

MAPS: Motivation-Aware Personalized Search via LLM-Driven Consultation Alignment

Weicong Qin, Yi Xu, Weijie Yu et al.

2025 ACL

Text is All You Need: LLM-enhanced Incremental Social Event Detection

Zitai Qiu, Congbo Ma, Jia Wu et al.

2025 ACL

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Qiyuan Zhang, Yufei Wang, Yuxin Jiang et al.

2025 ACL

Learning to Rewrite: Generalized LLM-Generated Text Detection

Wei Hao, Ran Li, Weiliang Zhao et al.

2025 ACL

G-Safeguard: A Topology-Guided Security Lens and Treatment on LLM-based Multi-agent Systems

Shilong Wang, Guibin Zhang, Miao Yu et al.

2025 ACL

AXIS: Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents

Junting Lu, Zhiyang Zhang, Fangkai Yang et al.

2025 ACL

TripleFact: Defending Data Contamination in the Evaluation of LLM-driven Fake News Detection

Cheng Xu, Nan Yan

2025 ACL

Comparing LLM-generated and human-authored news text using formal syntactic theory

Olga Zamaraeva, Dan Flickinger, Francis Bond et al.

2025 ACL

Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings

Austin Xu, Srijan Bansal, Yifei Ming et al.

2025 ACL

Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents

Tao Wu, Jingyuan Chen, Wang Lin et al.

2025 ACL

Papers