Yucheng Zhao

18 papers · 2020–2025 · 8 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🏃 Academic Marathon (5) 🌍 Conference Polyglot (8) 🗺️ Taxonomy Completionist (44)

🗺️ Taxonomy Completionist (44) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 📛 The Namer 🤝 Dynamic Duo (11) 🧬 Topic Evolution 💎 Century Club (18) 🗃️ Keyword Collector (97) 🔥 Unstoppable (6) ❓ The Questioner ⚡ Prolific Year (5)

Conferences

AAAI (5) CVPR (3) ICCV (3) INTERSPEECH (2) NIPS (2) ECCV (1) ICLR (1) IJCAI (1)

Top co-authors

Chong Luo (11) Xiangyu Zhang (6) Tiancai Wang (6) Chuanxin Tang (6) Wenjun Zeng (5) Haochen Wang (3) Zheng-Jun Zha (3) Yingfei Liu (3) Fan Jia (3) Guangting Wang (3)

Keywords

vision transformer (4) image classification (3) contrastive learning (2) attention mechanism (2) transformer architecture (2) video understanding (2) bird's-eye view (2) autonomous driving (2) image segmentation (1) sparse representation (1) semantic segmentation (1) dense matching (1) video generation (1) event camera (1) speech separation (1) image generation (1) voice conversion (1) data augmentation (1) self-supervised learning (1) action recognition (1)

Papers

ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance AAAI 2025 Reconstructive Visual Instruction Tuning ICLR 2025 Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness ICCV 2025 Holistic Tokenizer for Autoregressive Image Generation ICCV 2025 SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control AAAI 2025 MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds AAAI 2025 Panacea: Panoramic and Controllable Video Generation for Autonomous Driving CVPR 2024 Stream Query Denoising for Vectorized HD-Map Construction ECCV 2024 Streaming Video Model CVPR 2023 Look Before You Match: Instance Understanding Matters in Video Object Segmentation CVPR 2023 RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion INTERSPEECH 2022 Peripheral Vision Transformer NIPS 2022 Sparse MLP for Image Recognition: Is Self-Attention Really Necessary? AAAI 2022 When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism AAAI 2022 OmniVL: One Foundation Model for Image-Language and Video-Language Tasks NIPS 2022 Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration INTERSPEECH 2021 Self-Supervised Visual Representations Learning by Contrastive Mask Prediction ICCV 2021 Multi-Scale Group Transformer for Long Sequence Modeling in Speech Separation IJCAI 2020