Majid Rabbani
8 papers · 2024–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (6) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (19) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
EMNLP (2)
ICLR (2)
CVPR (1)
ECCV (1)
ICCV (1)
NIPS (1)
Top co-authors
Keywords
embedding space
(2)
text-video retrieval
(2)
preference optimization
(1)
multimodal learning
(1)
video understanding
(1)
cross-modal retrieval
(1)
sequential prediction
(1)
diffusion model
(1)
autoregressive model
(1)
modality gap
(1)
vision-language model
(1)
multi-modal alignment
(1)
contrastive loss
(1)
vision-language modeling
(1)
similarity metric
(1)
error accumulation
(1)
joint embedding
(1)
dialogue modeling
(1)
joint embedding space
(1)
text-to-video retrieval
(1)
Papers
Re-Imagining Multimodal Instruction Tuning: A Representation View
ICLR 2025
X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning
EMNLP 2025
Visual Self-Refinement for Autoregressive Models
EMNLP 2025
Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue
ICCV 2025
Image Translation as Diffusion Visual Programmers
ICLR 2024
Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval
CVPR 2024
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
ECCV 2024
Diffusion-Inspired Truncated Sampler for Text-Video Retrieval
NIPS 2024