Yucheng Zhao
18 papers · 2020–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (5) π Academic Marathon (5) π Conference Polyglot (8) πΊοΈ Taxonomy Completionist (44)
πΊοΈ
Taxonomy Completionist
(44)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
The Namer
π€
Dynamic Duo
(11)
π§¬
Topic Evolution
π
Century Club
(18)
ποΈ
Keyword Collector
(97)
π₯
Unstoppable
(6)
β
The Questioner
β‘
Prolific Year
(5)
Conferences
AAAI (5)
CVPR (3)
ICCV (3)
INTERSPEECH (2)
NIPS (2)
ECCV (1)
ICLR (1)
IJCAI (1)
Top co-authors
Keywords
vision transformer
(4)
image classification
(3)
contrastive learning
(2)
attention mechanism
(2)
transformer architecture
(2)
video understanding
(2)
bird's-eye view
(2)
autonomous driving
(2)
image segmentation
(1)
sparse representation
(1)
semantic segmentation
(1)
dense matching
(1)
video generation
(1)
event camera
(1)
speech separation
(1)
image generation
(1)
voice conversion
(1)
data augmentation
(1)
self-supervised learning
(1)
action recognition
(1)
Papers
ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance
AAAI 2025
Reconstructive Visual Instruction Tuning
ICLR 2025
Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
ICCV 2025
Holistic Tokenizer for Autoregressive Image Generation
ICCV 2025
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
AAAI 2025
MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds
AAAI 2025
Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
CVPR 2024
Stream Query Denoising for Vectorized HD-Map Construction
ECCV 2024
Streaming Video Model
CVPR 2023
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
CVPR 2023
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
INTERSPEECH 2022
Peripheral Vision Transformer
NIPS 2022
Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
AAAI 2022
When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
AAAI 2022
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks
NIPS 2022
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
INTERSPEECH 2021
Self-Supervised Visual Representations Learning by Contrastive Mask Prediction
ICCV 2021
Multi-Scale Group Transformer for Long Sequence Modeling in Speech Separation
IJCAI 2020