Research Explorer

LLM Factoscope: Uncovering LLMs’ Factual Discernment through Measuring Inner States

Jinwen He, Yujia Gong, Zijin Lin et al.

2024 ACL

Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm

Yuanzhen Xie, Xinzhou Jin, Tao Xie et al.

2024 ACL

On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey

Lin Long, Rui Wang, Ruixuan Xiao et al.

2024 ACL

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization

Zhiyu Yang, Zihan Zhou, Shuo Wang et al.

2024 ACL

Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

Junlin Wang, Tianyi Yang, Roy Xie et al.

2024 ACL

Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection

Barah Fazili, Ashish Agrawal, Preethi Jyothi

2024 ACL

CToolEval: A Chinese Benchmark for LLM-Powered Agent Evaluation in Real-World API Interactions

Zishan Guo, Yufei Huang, Deyi Xiong

2024 ACL

Evaluating ChatNetZero, an LLM-Chatbot to Demystify Climate Pledges

Angel Hsu, Mason Laney, Ji Zhang et al.

2024 ACL

Human-Centered Design Recommendations for LLM-as-a-judge

Qian Pan, Zahra Ashktorab, Michael Desmond et al.

2024 ACL

Improving LLM-based KGQA for multi-hop Question Answering with implicit reasoning in few-shot examples

Mili Shah, Joyce Cahoon, Mirco Milletari et al.

2024 ACL

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs

Xiangwen Wang, Jie Peng, Kaidi Xu et al.

2024 ACL

SRCB at #SMM4H 2024: Making Full Use of LLM-based Data Augmentation in Adverse Drug Event Extraction and Normalization

Hongyu Li, Yuming Zhang, Yongwei Zhang et al.

2024 ACL

UTRad-NLP at #SMM4H 2024: Why LLM-Generated Texts Fail to Improve Text Classification Models

Yosuke Yamagishi, Yuta Nakamura

2024 ACL

PolyuCBS at SMM4H 2024: LLM-based Medical Disorder and Adverse Drug Event Detection with Low-rank Adaptation

Zhai Yu, Xiaoyi Bao, Emmanuele Chersoni et al.

2024 ACL

LHS712_ADENotGood at #SMM4H 2024 Task 1: Deep-LLMADEminer: A deep learning and LLM pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter

Yifan Zheng, Jun Gong, Shushun Ren et al.

2024 ACL

LLM-Powered Test Case Generation for Detecting Bugs in Plausible Programs

Kaibo Liu, Zhenpeng Chen, Yiyang Liu et al.

2025 ACL

Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions

Hang Li, Tianlong Xu, Kaiqi Yang et al.

2025 ACL

CompileAgent: Automated Real-World Repo-Level Compilation with Tool-Integrated LLM-based Agent System

Li Hu, Guoqiang Chen, Xiuwei Shang et al.

2025 ACL

INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent

Haohang Li, Yupeng Cao, Yangyang Yu et al.

2025 ACL

Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents

Fanhang Man, Huandong Wang, Jianjie Fang et al.

2025 ACL

Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation

Mingxuan Xia, Haobo Wang, Yixuan Li et al.

2025 ACL

TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge

Cheng-Han Chiang, Hung-yi Lee, Michal Lukasik

2025 ACL

MAPS: Motivation-Aware Personalized Search via LLM-Driven Consultation Alignment

Weicong Qin, Yi Xu, Weijie Yu et al.

2025 ACL

Text is All You Need: LLM-enhanced Incremental Social Event Detection

Zitai Qiu, Congbo Ma, Jia Wu et al.

2025 ACL

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Qiyuan Zhang, Yufei Wang, Yuxin Jiang et al.

2025 ACL

Papers