Yatai Ji
10 papers · 2021–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Cross-Pollinator (15) π Academic Marathon (5) π§ Keyword Pioneer π Conference Polyglot (6) π Renaissance Researcher (5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(30)
π
Century Club
(10)
ποΈ
Keyword Collector
(51)
Conferences
CVPR (3)
EMNLP (2)
AAAI (1)
ECCV (1)
ICCV (1)
ICLR (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(3)
vision-language pre-training
(2)
contrastive learning
(2)
image-text retrieval
(2)
video generation
(1)
visual question answering
(1)
online learning
(1)
attention mechanism
(1)
hierarchical planning
(1)
preference alignment
(1)
prompt engineering
(1)
direct preference optimization
(1)
multi-modal learning
(1)
visual reasoning
(1)
diffusion model
(1)
multimodal interaction
(1)
cross-modal alignment
(1)
multi-modal large language model
(1)
vision-language model
(1)
preference learning
(1)
Papers
Align Video Diffusion Model with Online Video-Centric Preference Optimization
WACV 2026
Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology
AAAI 2026
CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space
EMNLP 2025
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
ICCV 2025
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
ICLR 2025
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection
CVPR 2024
Taming Lookup Tables for Efficient Image Retouching
ECCV 2024
Seeing What You Miss: Vision-Language Pre-Training With Semantic Completion Learning
CVPR 2023
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-Training Model
CVPR 2023
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering
EMNLP 2021