Research Explorer

MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark

Hongwei Liu, Zilong Zheng, Yuxuan Qiao et al.

2024 ACL

Debiasing In-Context Learning by Instructing LLMs How to Follow Demonstrations

Lvxue Li, Jiaqi Chen, Xinyu Lu et al.

2024 ACL

Penetrative AI: Making LLMs Comprehend the Physical World

Huatao Xu, Liying Han, Qirui Yang et al.

2024 ACL

An Empirical Study of In-context Learning in LLMs for Machine Translation

Pranjal Chitale, Jay Gala, Raj Dabre

2024 ACL

ODA: Observation-Driven Agent for integrating LLMs and Knowledge Graphs

Lei Sun, Zhengwei Tao, Youdi Li et al.

2024 ACL

LLMCrit: Teaching Large Language Models to Use Criteria

Weizhe Yuan, Pengfei Liu, Matthias Gallé

2024 ACL

Ranking Entities along Conceptual Space Dimensions with LLMs: An Analysis of Fine-Tuning Strategies

Nitesh Kumar, Usashi Chatterjee, Steven Schockaert

2024 ACL

ULTRA: Unleash LLMs’ Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Self-Refinement

Xinliang Frederick Zhang, Carter Blum, Temma Choji et al.

2024 ACL

Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games

Dekun Wu, Haochen Shi, Zhiyuan Sun et al.

2024 ACL

Improving LLM Generations via Fine-Grained Self-Endorsement

Ante Wang, Linfeng Song, Baolin Peng et al.

2024 ACL

Simplifying Translations for Children: Iterative Simplification Considering Age of Acquisition with LLMs

Masashi Oshika, Makoto Morishita, Tsutomu Hirao et al.

2024 ACL

TempCompass: Do Video LLMs Really Understand Videos?

Yuanxin Liu, Shicheng Li, Yi Liu et al.

2024 ACL

Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models

Mahammed Kamruzzaman, Md. Shovon, Gene Kim

2024 ACL

Unexpected Phenomenon: LLMs’ Spurious Associations in Information Extraction

Weiyan Zhang, Wanpeng Lu, Jiacheng Wang et al.

2024 ACL

Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data

Xiao Liu, Zirui Wu, Xueqing Wu et al.

2024 ACL

On the Vulnerability of Safety Alignment in Open-Access LLMs

Jingwei Yi, Rui Ye, Qisi Chen et al.

2024 ACL

Pushing the Limits of Low-Resource NER Using LLM Artificial Data Generation

Joan Santoso, Patrick Sutanto, Billy Cahyadi et al.

2024 ACL

Understanding and Patching Compositional Reasoning in LLMs

Zhaoyi Li, Gangwei Jiang, Hong Xie et al.

2024 ACL

Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling

Shenzhi Wang, Chang Liu, Zilong Zheng et al.

2024 ACL

LLM Performance Predictors are good initializers for Architecture Search

Ganesh Jawahar, Muhammad Abdul-Mageed, Laks Lakshmanan et al.

2024 ACL

DORY: Deliberative Prompt Recovery for LLM

Lirong Gao, Ru Peng, Yiming Zhang et al.

2024 ACL

Data Contamination Calibration for Black-box LLMs

Wentao Ye, Jiaqi Hu, Liyao Li et al.

2024 ACL

PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs

An Liu, Zonghan Yang, Zhenhe Zhang et al.

2024 ACL

Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM

Zijin Hong, Zheng Yuan, Hao Chen et al.

2024 ACL

KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge

Jiyoung Lee, Minwoo Kim, Seungho Kim et al.

2024 ACL

Papers