Yiyuan Zhang
11 papers · 2022–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (25) π Conference Polyglot (3) π Renaissance Researcher (5) π Interdisciplinary Bridge π§ Keyword Pioneer
π
Conference Polyglot
(3)
π
Renaissance Researcher
(5)
π
Keyword Champion
(2)
π
Century Club
(11)
ποΈ
Keyword Collector
(52)
β‘
Prolific Year
(5)
Conferences
ICCV (5)
CVPR (4)
ECCV (2)
Top co-authors
Keywords
multimodal learning
(5)
large language model
(3)
diffusion model
(2)
image recognition
(2)
foundation model
(2)
audio recognition
(2)
video understanding
(2)
modality alignment
(1)
weakly supervised learning
(1)
semantic segmentation
(1)
instruction following
(1)
dynamic routing
(1)
text-to-image generation
(1)
instruction tuning
(1)
pseudo labeling
(1)
object detection
(1)
convolutional neural network
(1)
state space model
(1)
vision language model
(1)
cross-modal learning
(1)
Papers
Learning Beyond Still Frames: Scaling Vision-Language Models with Video
ICCV 2025
FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions
ICCV 2025
Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities
ICCV 2025
Breaking the Encoder Barrier for Seamless Video-Language Understanding
ICCV 2025
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
ICCV 2025
Online Vectorized HD Map Construction using Geometry
ECCV 2024
OneLLM: One Framework to Align All Modalities with Language
CVPR 2024
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
CVPR 2024
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition
CVPR 2024
Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors
CVPR 2024
Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification
ECCV 2022