Papers

219 papers found
Can I trust You? LLMs as conversational agents
Marc Döbler, Raghavendran Mahendravarman, Anna Moskvina et al.
2024 EACL
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Yue Wu, Xuan Tang, Tom Mitchell et al.
2024 ICLR
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons
Andrew Szot, Bogdan Mazoure, Omar Attia et al.
2025 CVPR
Automated test generation to evaluate tool-augmented LLMs as conversational AI agents
Samuel Arcadinho, David Oliveira Aparicio, Mariana S. C. Almeida
2024 EMNLP
Language Agents Meet Causality -- Bridging LLMs and Causal World Models
John Gkountouras, Matthias Lindemann, Phillip Lippe et al.
2025 ICLR
2026 AAAI
Aligned LLMs Are Not Aligned Browser Agents
Priyanshu Kumar, Elaine Lau, Saranya Vijayakumar et al.
2025 ICLR
2023 ICCV
Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents
Bandhav Veluri, Benjamin N Peloquin, Bokai Yu et al.
2024 EMNLP
2026 EACL