De-An Huang
35 papers · 2013–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (8)
π
Interdisciplinary Bridge
π
Academic Marathon
(12)
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(4)
π€
Dynamic Duo
(12)
π₯
Mega-Team
(25)
π§¬
Topic Evolution
π₯
Unstoppable
(9)
ποΈ
Keyword Collector
(156)
π
Conference Pioneer
β
The Questioner
β‘
Prolific Year
(5)
π
Century Club
(35)
π
Trend Setter
Conferences
CVPR (13)
NIPS (6)
ECCV (5)
ICLR (4)
ICCV (3)
ICML (2)
EMNLP (1)
WACV (1)
Top co-authors
Research topics
Keywords
visual grounding
(3)
video understanding
(3)
few-shot learning
(3)
object detection
(2)
reference resolution
(2)
multimodal large language model
(2)
multimodal learning
(2)
knowledge distillation
(2)
zero-shot generalization
(2)
action recognition
(2)
instructional video
(2)
visual language model
(2)
temporal alignment
(2)
motion dynamics
(2)
unsupervised learning
(2)
robotic manipulation
(1)
imitation learning
(1)
representation learning
(1)
sequential decision-making
(1)
interactive decision-making
(1)
Papers
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought
CVPR 2025
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
ICLR 2025
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching
ICLR 2025
NVILA: Efficient Frontier Visual Language Models
CVPR 2025
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
CVPR 2025
Differentially Private Video Activity Recognition
WACV 2024
PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees
CVPR 2024
LITA: Language Instructed Temporal-Localization Assistant
ECCV 2024
Eureka: Human-Level Reward Design via Coding Large Language Models
ICLR 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
ICLR 2024
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
EMNLP 2023
I$^2$SB: Image-to-Image SchrΓΆdinger Bridge
ICML 2023
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
NIPS 2022
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
NIPS 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
NIPS 2022
Pre-Trained Language Models for Interactive Decision-Making
NIPS 2022
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
ICML 2021
Spatio-Temporal Graph for Video Captioning With Knowledge Distillation
CVPR 2020
Procedure Planning in Instructional Videos
ECCV 2020
Regression Planning Networks
NIPS 2019
Imitation Learning for Human Pose Prediction
ICCV 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
CVPR 2019
Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration
CVPR 2019
Finding "It": Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos
CVPR 2018
What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets
CVPR 2018
Dynamic Task Prioritization for Multitask Learning
ECCV 2018
Temporal Modular Networks for Retrieving Complex Compositional Activities in Videos
ECCV 2018
Neural Graph Matching Networks for Fewshot 3D Action Recognition
ECCV 2018
Learning to Decompose and Disentangle Representations for Video Prediction
NIPS 2018
Unsupervised Learning of Long-Term Motion Dynamics for Videos
CVPR 2017
Forecasting Interactive Dynamics of Pedestrians With Fictitious Play
CVPR 2017
Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos
CVPR 2017
Visual Forecasting by Imitating Dynamics in Natural Sequences
ICCV 2017
How Do We Use Our Hands? Discovering a Diverse Set of Common Grasps
CVPR 2015
Coupled Dictionary and Feature Space Learning with Applications to Cross-Domain Image Synthesis and Recognition
ICCV 2013