Mahmoud Ahmed
4 papers · 2024–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(4)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
EMNLP (1)
ICCV (1)
ICLR (1)
NIPS (1)
Top co-authors
Keywords
representation learning
(1)
temporal reasoning
(1)
question answering
(1)
point cloud
(1)
3d vision
(1)
video understanding
(1)
spatial reasoning
(1)
long video understanding
(1)
long-form video
(1)
multimodal benchmark
(1)
part segmentation
(1)
multi-modal model
(1)
compositional understanding
(1)
shape retrieval
(1)
video-language evaluation
(1)
grounding-based skill
(1)
3d multimodal
(1)
part-aware segmentation
(1)
grounded description
(1)
Papers
InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
EMNLP 2025
Kestrel: 3D Multimodal LLM for Part-Aware Grounded Description
ICCV 2025
3DCoMPaT200: Language Grounded Large-Scale 3D Vision Dataset for Compositional Recognition
NIPS 2024
CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding
ICLR 2024