Yehao Li
24 papers · 2016–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Cross-Pollinator (10) π Academic Marathon (9) π Conference Polyglot (6) π§ Keyword Pioneer π Renaissance Researcher (5)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(42)
π€
Dynamic Duo
(24)
π
Conference Pioneer
π
Trend Setter
β‘
Prolific Year
(5)
π
Century Club
(24)
ποΈ
Keyword Collector
(88)
π₯
Unstoppable
(10)
Conferences
CVPR (11)
ECCV (6)
ICCV (3)
AAAI (2)
ICML (1)
IJCAI (1)
Top co-authors
Keywords
image captioning
(6)
image generation
(3)
diffusion model
(3)
long short-term memory
(2)
video captioning
(2)
multimodal learning
(2)
encoder-decoder network
(2)
transfer learning
(2)
convolutional neural network
(2)
autoregressive transformer
(1)
visual question answering
(1)
object recognition
(1)
deformable convolution
(1)
semantic segmentation
(1)
embedding space
(1)
novel object recognition
(1)
temporal modeling
(1)
text generation
(1)
domain adaptation
(1)
kl divergence
(1)
Papers
Denoising Token Prediction in Masked Autoregressive Models
ICCV 2025
Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots
ICML 2025
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
ECCV 2024
Improving Text-guided Object Inpainting with Semantic Pre-inpainting
ECCV 2024
Improving Virtual Try-On with Garment-focused Diffusion Models
ECCV 2024
Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
CVPR 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
CVPR 2024
HGNet: Learning Hierarchical Geometry From Points, Edges, and Surfaces
CVPR 2023
Semantic-Conditional Diffusion Networks for Image Captioning
CVPR 2023
Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning
ECCV 2022
Comprehending and Ordering Semantics for Image Captioning
CVPR 2022
SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement
ECCV 2022
Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network
AAAI 2021
Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation
CVPR 2020
X-Linear Attention Networks for Image Captioning
CVPR 2020
Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
AAAI 2019
Transferrable Prototypical Networks for Unsupervised Domain Adaptation
CVPR 2019
Pointing Novel Objects in Image Captioning
CVPR 2019
Hierarchy Parsing for Image Captioning
ICCV 2019
Jointly Localizing and Describing Events for Dense Video Captioning
CVPR 2018
Exploring Visual Relationship for Image Captioning
ECCV 2018
Boosting Image Captioning With Attributes
ICCV 2017
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects
CVPR 2017
Learning Deep Intrinsic Video Representation by Exploring Temporal Coherence and Graph Structure
IJCAI 2016