De-An Huang

35 papers · 2013–2025 · 8 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (8)

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12) 🧭 Keyword Pioneer 🌟 Keyword Trendsetter Combo (4) 🤝 Dynamic Duo (12) 👥 Mega-Team (25) 🧬 Topic Evolution 🔥 Unstoppable (9) 🗃️ Keyword Collector (156) 🚀 Conference Pioneer ❓ The Questioner ⚡ Prolific Year (5) 💎 Century Club (35) 📈 Trend Setter

Conferences

CVPR (13) NIPS (6) ECCV (5) ICLR (4) ICCV (3) ICML (2) EMNLP (1) WACV (1)

Top co-authors

Li Fei-fei (12) Juan Carlos Niebles (11) Anima Anandkumar (10) Zhiding Yu (9) Yuke Zhu (7) Weili Nie (5) Chaowei Xiao (5) Linxi Fan (5) Jan Kautz (4) Serena Yeung (3)

Research topics

Differential Privacy (1)

Keywords

visual grounding (3) video understanding (3) few-shot learning (3) object detection (2) reference resolution (2) multimodal large language model (2) multimodal learning (2) knowledge distillation (2) zero-shot generalization (2) action recognition (2) instructional video (2) visual language model (2) temporal alignment (2) motion dynamics (2) unsupervised learning (2) robotic manipulation (1) imitation learning (1) representation learning (1) sequential decision-making (1) interactive decision-making (1)

Papers

Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought CVPR 2025 Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders ICLR 2025 T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching ICLR 2025 NVILA: Efficient Frontier Visual Language Models CVPR 2025 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks CVPR 2025 Differentially Private Video Activity Recognition WACV 2024 PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees CVPR 2024 LITA: Language Instructed Temporal-Localization Assistant ECCV 2024 Eureka: Human-Level Reward Design via Coding Large Language Models ICLR 2024 Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition ICLR 2024 Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning EMNLP 2023 I$^2$SB: Image-to-Image Schrödinger Bridge ICML 2023 MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training NIPS 2022 Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models NIPS 2022 MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge NIPS 2022 Pre-Trained Language Models for Interactive Decision-Making NIPS 2022 SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies ICML 2021 Spatio-Temporal Graph for Video Captioning With Knowledge Distillation CVPR 2020 Procedure Planning in Instructional Videos ECCV 2020 Regression Planning Networks NIPS 2019 Imitation Learning for Human Pose Prediction ICCV 2019 D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation CVPR 2019 Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration CVPR 2019 Finding "It": Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos CVPR 2018 What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets CVPR 2018 Dynamic Task Prioritization for Multitask Learning ECCV 2018 Temporal Modular Networks for Retrieving Complex Compositional Activities in Videos ECCV 2018 Neural Graph Matching Networks for Fewshot 3D Action Recognition ECCV 2018 Learning to Decompose and Disentangle Representations for Video Prediction NIPS 2018 Unsupervised Learning of Long-Term Motion Dynamics for Videos CVPR 2017 Forecasting Interactive Dynamics of Pedestrians With Fictitious Play CVPR 2017 Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos CVPR 2017 Visual Forecasting by Imitating Dynamics in Natural Sequences ICCV 2017 How Do We Use Our Hands? Discovering a Diverse Set of Common Grasps CVPR 2015 Coupled Dictionary and Feature Space Learning with Applications to Cross-Domain Image Synthesis and Recognition ICCV 2013