Babak Damavandi
12 papers · 2016–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🏃 Academic Marathon (9) 🐝 Cross-Pollinator (14)
🏃
Academic Marathon
(9)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🧬
Topic Evolution
🤝
Dynamic Duo
(10)
🗃️
Keyword Collector
(68)
📈
Trend Setter
🔥
Unstoppable
(5)
💎
Century Club
(12)
🚀
Conference Pioneer
Conferences
EMNLP (6)
EACL (3)
ACL (1)
CVPR (1)
INTERSPEECH (1)
Top co-authors
Keywords
multimodal learning
(4)
visual grounding
(3)
multimodal dialog
(2)
multimodal large language model
(2)
sensor fusion
(2)
large language model
(2)
task-oriented dialog
(2)
modality alignment
(1)
information retrieval
(1)
visual question answering
(1)
visual reasoning
(1)
task-oriented dialogue
(1)
egocentric vision
(1)
dialogue generation
(1)
symbolic computation
(1)
bipartite matching
(1)
human attention
(1)
language model
(1)
synthetic datum
(1)
parameter efficient
(1)
Papers
VideoMind: Thinking in Steps for Long Video Understanding
EACL 2026
SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code
EACL 2026
TRACE: A Framework for Analyzing and Enhancing Stepwise Reasoning in Vision-Language Models
EACL 2026
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos
EMNLP 2025
SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
EMNLP 2024
AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
EMNLP 2024
SIMMC-VR: A Task-oriented Multimodal Dialog Dataset with Situated and Immersive VR Streams
ACL 2023
IMU2CLIP: Language-grounded Motion Sensor Translation with Multimodal Contrastive Learning
EMNLP 2023
Navigating Connected Memories with a Task-oriented Dialog System
EMNLP 2022
SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
EMNLP 2021
Connecting What To Say With Where To Look by Modeling Human Attention Traces
CVPR 2021
NN-Grams: Unifying Neural Network and n-Gram Language Models for Speech Recognition
INTERSPEECH 2016