conftrace_

Yiyuan Zhang

11 papers · 2022–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🗺️ Taxonomy Completionist (25) 🌍 Conference Polyglot (3) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer

🌍 Conference Polyglot (3) 🌈 Renaissance Researcher (5) 🏆 Keyword Champion (2) 💎 Century Club (11) 🗃️ Keyword Collector (52) ⚡ Prolific Year (5)

Conferences

ICCV (5) CVPR (4) ECCV (2)

Top co-authors

Xiangyu Yue (8) Xiaohan Ding (3) Handong Li (3) Kaixiong Gong (3) Jing Liu (3) Yixiao Ge (2) Ying Shan (2) Yuhao Kang (1) Zibin Wang (1) Yu Qiao (1)

Keywords

multimodal learning (5) large language model (3) diffusion model (2) image recognition (2) foundation model (2) audio recognition (2) video understanding (2) modality alignment (1) weakly supervised learning (1) semantic segmentation (1) instruction following (1) dynamic routing (1) text-to-image generation (1) instruction tuning (1) pseudo labeling (1) object detection (1) convolutional neural network (1) state space model (1) vision language model (1) cross-modal learning (1)

Papers

Learning Beyond Still Frames: Scaling Vision-Language Models with Video ICCV 2025 FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions ICCV 2025 Scaling Omni-modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities ICCV 2025 Breaking the Encoder Barrier for Seamless Video-Language Understanding ICCV 2025 MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing ICCV 2025 Online Vectorized HD Map Construction using Geometry ECCV 2024 OneLLM: One Framework to Align All Modalities with Language CVPR 2024 Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities CVPR 2024 UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition CVPR 2024 Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors CVPR 2024 Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification ECCV 2022