Research Explorer

AgentSwift: Efficient LLM Agent Design via Value-Guided Hierarchical Search

Yu Li, Lehui Li, Zhihao Wu et al.

2026 AAAI

Can I trust You? LLMs as conversational agents

Marc Döbler, Raghavendran Mahendravarman, Anna Moskvina et al.

2024 EACL

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Yue Wu, Xuan Tang, Tom Mitchell et al.

2024 ICLR

From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

Andrew Szot, Bogdan Mazoure, Omar Attia et al.

2025 CVPR

Automated test generation to evaluate tool-augmented LLMs as conversational AI agents

Samuel Arcadinho, David Oliveira Aparicio, Mariana S. C. Almeida

2024 EMNLP

Language Agents Meet Causality -- Bridging LLMs and Causal World Models

John Gkountouras, Matthias Lindemann, Phillip Lippe et al.

2025 ICLR

A Metacognitive Architecture for Correcting LLM Errors in AI Agents

Jisu Kim, Mahimul Islam, Ashok Goel

2026 AAAI

Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors

Jian Wang, Yinpei Dai, Yichi Zhang et al.

2025 ACL

Mirror in the Model: Ad Banner Image Generation via Reflective Multi-LLM and Multi-modal Agents

Zhao Wang, Bowen Chen, Yotaro Shimose et al.

2025 EMNLP

Aligned LLMs Are Not Aligned Browser Agents

Priyanshu Kumar, Elaine Lau, Saranya Vijayakumar et al.

2025 ICLR

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

Chan Hee Song, Jiaman Wu, Clayton Washington et al.

2023 ICCV

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments

Yu Gu, Yiheng Shu, Hao Yu et al.

2024 EMNLP

Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents

Bandhav Veluri, Benjamin N Peloquin, Bokai Yu et al.

2024 EMNLP

From Standard Transformers to Modern LLMs: Bringing Dialogue Models, RAG, and Agents to the Classroom

Maria Tikhonova, Viktoriia A. Chekalina, Artem Chervyakov et al.

2026 EACL

AgentSense: Virtual Sensor Data Generation Using LLM Agents in Simulated Home Environments

Zikang Leng, Megha Thukral, Yaqi Liu et al.

2026 AAAI

Attack the Messages, Not the Agents: A Multi-round Adaptive Stealthy Tampering Framework for LLM-MAS

Bingyu Yan, Xiaoming Zhang, Ziyi Zhou et al.

2026 AAAI

Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents

Zihao Wang, Shaofei Cai, Guanzhou Chen et al.

2023 NIPS

Agents Under Siege: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks

Rana Shahroz, Zhen Tan, Sukwon Yun et al.

2025 ACL

MemoryART: Enhancing LLMs via Multi-Memory Models with Adaptive Resonance Theory for Healthcare Agents

Renke Dai, Hebin Hu, Jiahui Zhang et al.

2026 AAAI

Papers