Yanghao Li

31 papers · 2017–2026 · 10 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🌍 Conference Polyglot (10) 🏃 Academic Marathon (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (6)

🐝 Cross-Pollinator (6) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (51) 🤝 Dynamic Duo (11) 👥 Mega-Team (85) 🧬 Topic Evolution 🏆 Grand Slam 🚀 Conference Pioneer 🗃️ Keyword Collector (115) 💎 Century Club (30) ⚡ Prolific Year (7) 🔥 Unstoppable (7) ❓ The Questioner 📈 Trend Setter

Conferences

CVPR (10) ICLR (5) ICCV (4) ACL (3) NIPS (3) IJCAI (2) AAAI (1) ECCV (1) ICML (1) JMLR (1)

Top co-authors

Christoph Feichtenhofer (11) haoqi fan (9) Jitendra Malik (7) Karttikeya Mangalam (6) Bo Xiong (5) Naiyan Wang (4) Kaiming He (4) Bowen Zhang (4) Yinfei Yang (3) Jiaying Liu (3)

Keywords

vision transformer (7) image classification (4) masked autoencoder (4) object detection (4) self-supervised learning (3) egocentric video (3) video recognition (3) video representation (2) model scaling (2) domain adaptation (2) transfer learning (2) representation learning (2) video understanding (2) contrastive learning (2) efficient computing (2) activity recognition (2) temporal modeling (2) video classification (2) video segmentation (1) benchmark evaluation (1)

Papers

RSMeM: Knowledge-Enhanced Memory Evolution for Remote Sensing Agents with Systematic Evaluation ACL 2026 EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing ICLR 2025 MMEgo: Towards Building Egocentric Multimodal LLMs for Video QA ICLR 2025 MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning ICLR 2025 Improve Vision Language Model Chain-of-thought Reasoning ACL 2025 Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs ACL 2025 SEP: A General Lossless Compression Framework with Semantics Enhancement and Multi-Stream Pipelines IJCAI 2025 R-MAE: Regions Meet Masked Autoencoders ICLR 2024 Idempotence and Perceptual Image Compression ICLR 2024 Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization CVPR 2023 Idempotent Learned Image Compression with Right-Inverse NIPS 2023 MAViL: Masked Audio-Video Learners NIPS 2023 Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos CVPR 2023 Scaling Language-Image Pre-Training via Masking CVPR 2023 Diffusion Models as Masked Autoencoders ICCV 2023 Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles ICML 2023 Masked Autoencoders As Spatiotemporal Learners NIPS 2022 MViTv2: Improved Multiscale Vision Transformers for Classification and Detection CVPR 2022 Masked Autoencoders Are Scalable Vision Learners CVPR 2022 Reversible Vision Transformers CVPR 2022 MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition CVPR 2022 Ego4D: Around the World in 3,000 Hours of Egocentric Video CVPR 2022 Exploring Plain Vision Transformer Backbones for Object Detection ECCV 2022 Multiscale Vision Transformers ICCV 2021 Ego-Exo: Transferring Visual Representations From Third-Person to First-Person Videos CVPR 2021 Ego-Topo: Environment Affordances From Egocentric Video CVPR 2020 Scale-Aware Trident Networks for Object Detection ICCV 2019 SimpleDet: A Simple and Versatile Distributed Framework for Object Detection and Instance Recognition JMLR 2019 Temporal Bilinear Networks for Video Action Recognition AAAI 2019 Factorized Bilinear Models for Image Recognition ICCV 2017 Demystifying Neural Style Transfer IJCAI 2017