Md Mohaiminul Islam
11 papers · 2022–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (22) π Renaissance Researcher (5) π Conference Polyglot (4) π Interdisciplinary Bridge π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(4)
π€
Dynamic Duo
(11)
π₯
Mega-Team
(100)
β‘
Prolific Year
(5)
π
Century Club
(11)
π₯
Unstoppable
(5)
Conferences
CVPR (5)
ECCV (3)
EMNLP (2)
WACV (1)
Top co-authors
Keywords
video understanding
(5)
multimodal learning
(3)
large language model
(3)
temporal grounding
(2)
state-space model
(2)
video question answering
(2)
egocentric vision
(1)
video analysis
(1)
curriculum learning
(1)
hand pose estimation
(1)
chain-of-thought reasoning
(1)
activity recognition
(1)
3d pose estimation
(1)
hierarchical structure
(1)
vision-language model
(1)
language model
(1)
multi-modal learning
(1)
video language model
(1)
long-form video
(1)
video captioning
(1)
Papers
TimeRefine: Temporal Grounding with Time Refining Video LLM
WACV 2026
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning
EMNLP 2025
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering
CVPR 2025
ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
CVPR 2025
A Simple LLM Framework for Long-Range Video Question-Answering
EMNLP 2024
Video ReCap: Recursive Captioning of Hour-Long Videos
CVPR 2024
"Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos"
ECCV 2024
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos
ECCV 2024
Efficient Movie Scene Detection Using State-Space Transformers
CVPR 2023
Long Movie Clip Classification with State-Space Video Models
ECCV 2022