Babak Damavandi

12 papers · 2016–2026 · 5 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🏃 Academic Marathon (9) 🐝 Cross-Pollinator (14)

🏃 Academic Marathon (9) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🧬 Topic Evolution 🤝 Dynamic Duo (10) 🗃️ Keyword Collector (68) 📈 Trend Setter 🔥 Unstoppable (5) 💎 Century Club (12) 🚀 Conference Pioneer

Conferences

EMNLP (6) EACL (3) ACL (1) CVPR (1) INTERSPEECH (1)

Top co-authors

Seungwhan Moon (10) Zhaojiang Lin (5) Andrea Madotto (5) Xin Luna Dong (3) Satwik Kottur (3) Adel Ahmadyan (2) Lambert Mathias (2) Lu Zhang (2) Amy Bearman (2) Anuj Kumar (2)

Keywords

multimodal learning (4) visual grounding (3) multimodal dialog (2) multimodal large language model (2) sensor fusion (2) large language model (2) task-oriented dialog (2) modality alignment (1) information retrieval (1) visual question answering (1) visual reasoning (1) task-oriented dialogue (1) egocentric vision (1) dialogue generation (1) symbolic computation (1) bipartite matching (1) human attention (1) language model (1) synthetic datum (1) parameter efficient (1)

Papers

VideoMind: Thinking in Steps for Long Video Understanding EACL 2026 SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code EACL 2026 TRACE: A Framework for Analyzing and Enhancing Stepwise Reasoning in Vision-Language Models EACL 2026 Proactive Assistant Dialogue Generation from Streaming Egocentric Videos EMNLP 2025 SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM EMNLP 2024 AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model EMNLP 2024 SIMMC-VR: A Task-oriented Multimodal Dialog Dataset with Situated and Immersive VR Streams ACL 2023 IMU2CLIP: Language-grounded Motion Sensor Translation with Multimodal Contrastive Learning EMNLP 2023 Navigating Connected Memories with a Task-oriented Dialog System EMNLP 2022 SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations EMNLP 2021 Connecting What To Say With Where To Look by Modeling Human Attention Traces CVPR 2021 NN-Grams: Unifying Neural Network and n-Gram Language Models for Speech Recognition INTERSPEECH 2016