Research Explorer

AutoPenBench: A Vulnerability Testing Benchmark for Generative Agents

Luca Gioacchini, Alexander Delsanto, Idilio Drago et al.

2025 EMNLP

Auto prompting without training labels: An LLM cascade for product quality assessment in e-commerce catalogs

Soham Satyadharma, Fatemeh Sheikholeslami, Swati Kaul et al.

2025 EMNLP

AutoQual: An LLM Agent for Automated Discovery of Interpretable Features for Review Quality Assessment

Xiaochong Lan, Jie Feng, Yinxing Liu et al.

2025 EMNLP

AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists

Yifei Li, Hanane Nour Moussa, Ziru Chen et al.

2025 EMNLP

Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant

Lei Shen, Xiaoyu Shen

2025 EMNLP

AutoSpec: An Agentic Framework for Automatically Drafting Patent Specification

Ryan Shea, Zhou Yu

2025 EMNLP

Auto-Weighted Group Relative Preference Optimization for Multi-Objective Text Generation Tasks

Yuki Ichihara, Yuu Jinnai

2025 EMNLP

Averroes at ImageEval 2025 Shared Task: Advancing Arabic Image Captioning with Augmentation and Two-Stage Generation

Mariam Saeed, Sarah Elshabrawy, Abdelrahman Hagrass et al.

2025 EMNLP

Avoidance Decoding for Diverse Multi-Branch Story Generation

Kyeongman Park, Nakyeong Yang, Kyomin Jung

2025 EMNLP

Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition

Yi Liu, Xiangrong Zhu, Xiangyu Liu et al.

2025 EMNLP

AYA at PalmX 2025: Modeling Cultural and Islamic Knowledge in LLMs

Jannatul Tajrin, Bir Ballav Roy, Firoj Alam

2025 EMNLP

AyahVerse at MAHED Shared Task: Fine-Tuning ArabicBERT with Preprocessing for Hope and Hate Detection

Ibad-ur-Rehman Rashid, Muhammad Hashir Khalil

2025 EMNLP

A Zero-Shot Neuro-Symbolic Approach for Complex Knowledge Graph Question Answering

Prerna Agarwal, Srikanta Bedathur

2025 EMNLP

AZLU at ImagEval Shared Task: Bridging Linguistics and Cultural Gaps in Arabic Image Captioning

Sarah Yassine

2025 EMNLP

Babies Learn to Look Ahead: Multi-Token Prediction in Small LMs

Ansar Aynetdinov, Alan Akbik

2025 EMNLP

BabyLM’s First Constructions: Causal interventions provide a signal of learning

Joshua Rozner, Leonie Weissweiler, Cory Shain

2025 EMNLP

Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models

Zeping Yu, Yonatan Belinkov, Sophia Ananiadou

2025 EMNLP

Backdoor-Powered Prompt Injection Attacks Nullify Defense Methods

Yulin Chen, Haoran Li, Yuan Sui et al.

2025 EMNLP

BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism

Qinzhuo Wu, Pengzhi Gao, Wei Liu et al.

2025 EMNLP

BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text

Ibrahim Al Azher, Miftahul Jannat Mokarrama, Zhishuai Guo et al.

2025 EMNLP

Bag of Tricks for Sparse Mixture-of-Experts: A Benchmark Across Reasoning, Efficiency, and Safety

Mufan Qiu, Zheyu Shen, Pingzhi Li et al.

2025 EMNLP

Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models

Masahiro Kaneko, Alham Fikri Aji, Timothy Baldwin

2025 EMNLP

Balancing Quality and Variation: Spam Filtering Distorts Data Label Distributions

Eve Fleisig, Matthias Orlikowski, Philipp Cimiano et al.

2025 EMNLP

Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models

Benyamin Jamialahmadi, Parsa Kavehzadeh, Mehdi Rezagholizadeh et al.

2025 EMNLP

BALSAM: A Platform for Benchmarking Arabic Large Language Models

Rawan Nasser Almatham, Kareem Mohamed Darwish, Raghad Al-Rasheed et al.

2025 EMNLP

Papers