Yanghao Li
31 papers · 2017–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (10) π Academic Marathon (8) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (6)
π
Cross-Pollinator
(6)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(51)
π€
Dynamic Duo
(11)
π₯
Mega-Team
(85)
π§¬
Topic Evolution
π
Grand Slam
π
Conference Pioneer
ποΈ
Keyword Collector
(115)
π
Century Club
(30)
β‘
Prolific Year
(7)
π₯
Unstoppable
(7)
β
The Questioner
π
Trend Setter
Conferences
CVPR (10)
ICLR (5)
ICCV (4)
ACL (3)
NIPS (3)
IJCAI (2)
AAAI (1)
ECCV (1)
ICML (1)
JMLR (1)
Top co-authors
Keywords
vision transformer
(7)
image classification
(4)
masked autoencoder
(4)
object detection
(4)
self-supervised learning
(3)
egocentric video
(3)
video recognition
(3)
video representation
(2)
model scaling
(2)
domain adaptation
(2)
transfer learning
(2)
representation learning
(2)
video understanding
(2)
contrastive learning
(2)
efficient computing
(2)
activity recognition
(2)
temporal modeling
(2)
video classification
(2)
video segmentation
(1)
benchmark evaluation
(1)
Papers
RSMeM: Knowledge-Enhanced Memory Evolution for Remote Sensing Agents with Systematic Evaluation
ACL 2026
EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing
ICLR 2025
MMEgo: Towards Building Egocentric Multimodal LLMs for Video QA
ICLR 2025
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
ICLR 2025
Improve Vision Language Model Chain-of-thought Reasoning
ACL 2025
Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
ACL 2025
SEP: A General Lossless Compression Framework with Semantics Enhancement and Multi-Stream Pipelines
IJCAI 2025
R-MAE: Regions Meet Masked Autoencoders
ICLR 2024
Idempotence and Perceptual Image Compression
ICLR 2024
Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
CVPR 2023
Idempotent Learned Image Compression with Right-Inverse
NIPS 2023
MAViL: Masked Audio-Video Learners
NIPS 2023
Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos
CVPR 2023
Scaling Language-Image Pre-Training via Masking
CVPR 2023
Diffusion Models as Masked Autoencoders
ICCV 2023
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
ICML 2023
Masked Autoencoders As Spatiotemporal Learners
NIPS 2022
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
CVPR 2022
Masked Autoencoders Are Scalable Vision Learners
CVPR 2022
Reversible Vision Transformers
CVPR 2022
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
CVPR 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video
CVPR 2022
Exploring Plain Vision Transformer Backbones for Object Detection
ECCV 2022
Multiscale Vision Transformers
ICCV 2021
Ego-Exo: Transferring Visual Representations From Third-Person to First-Person Videos
CVPR 2021
Ego-Topo: Environment Affordances From Egocentric Video
CVPR 2020
Scale-Aware Trident Networks for Object Detection
ICCV 2019
SimpleDet: A Simple and Versatile Distributed Framework for Object Detection and Instance Recognition
JMLR 2019
Temporal Bilinear Networks for Video Action Recognition
AAAI 2019
Factorized Bilinear Models for Image Recognition
ICCV 2017
Demystifying Neural Style Transfer
IJCAI 2017