Shraman Pramanick
11 papers · 2021–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (24) π Conference Polyglot (8) π Renaissance Researcher (6) π Interdisciplinary Bridge π§ Keyword Pioneer
π
Conference Polyglot
(8)
π
Renaissance Researcher
(6)
π₯
Mega-Team
(100)
π
Century Club
(11)
β
The Questioner
π₯
Unstoppable
(5)
Conferences
ICCV (3)
CVPR (2)
ACL (1)
ECCV (1)
EMNLP (1)
IJCNLP (1)
NIPS (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(4)
video temporal grounding
(2)
vision-language model
(2)
video understanding
(2)
feature extraction
(1)
optimal transport
(1)
image segmentation
(1)
sarcasm detection
(1)
pose estimation
(1)
egocentric vision
(1)
humor detection
(1)
hand pose estimation
(1)
multiple instance learning
(1)
instruction tuning
(1)
meme analysis
(1)
transfer learning
(1)
activity recognition
(1)
3d pose estimation
(1)
video summarization
(1)
zero-shot learning
(1)
Papers
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
ICCV 2025
Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model
CVPR 2024
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
CVPR 2024
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
NIPS 2024
UniVTG: Towards Unified Video-Language Temporal Grounding
ICCV 2023
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
ICCV 2023
Multimodal Learning Using Optimal Transport for Sarcasm and Humor Detection
WACV 2022
Where in the World Is This Image? Transformer-Based Geo-Localization in the Wild
ECCV 2022
MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets
EMNLP 2021
Detecting Harmful Memes and Their Targets
IJCNLP 2021
Detecting Harmful Memes and Their Targets
ACL 2021