Jonathan Huang

34 papers · 2007–2025 · 11 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (12) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (11)

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (11) 🗺️ Taxonomy Completionist (12) 🌟 Keyword Trendsetter Combo (7) 🧬 Topic Evolution 🏆 Keyword Champion 🌱 Topic Pioneer 👥 Mega-Team (31) ⚡ Prolific Year (5) 🗃️ Keyword Collector (144) 💎 Century Club (34) ❓ The Questioner 📈 Trend Setter 🔥 Unstoppable (11) 🚀 Conference Pioneer

Conferences

CVPR (8) ECCV (5) NIPS (5) ICCV (4) ICLR (3) INTERSPEECH (3) ICML (2) ACL (1) JMLR (1) NAACL (1) WACV (1)

Top co-authors

Kevin Murphy (8) Vivek Rathod (7) Leonidas Guibas (4) Tobias Bocklet (3) Carlos Guestrin (3) Sara Beery (3) Zhichao Lu (3) Saining Xie (2) Li Fei-fei (2) Sergio Guadarrama (2)

Research topics

Models (1) Probability (1) Education (1)

Keywords

object detection (4) recurrent neural network (3) permutation distribution (2) instance segmentation (2) generative model (2) graphical model (2) knowledge distillation (2) model compression (2) video understanding (2) fourier analysis (2) variational inference (2) fourier decomposition (2) attention mechanism (2) convolutional neural network (2) multimodal learning (2) probabilistic inference (2) weakly supervised learning (2) speaker recognition (2) probabilistic modeling (2) multi-person tracking (2)

Papers

Visually Consistent Hierarchical Image Classification ICLR 2025 Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think ICLR 2025 Principles of Visual Tokens for Efficient Video Understanding ICCV 2025 Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition ECCV 2024 VideoPoet: A Large Language Model for Zero-Shot Video Generation ICML 2024 Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors ECCV 2024 DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model NIPS 2023 PERF-Net: Pose Empowered RGB-Flow Net WACV 2022 The Auto Arborist Dataset: A Large-Scale Benchmark for Multiview Urban Forest Monitoring Under Domain Shift CVPR 2022 The Surprising Impact of Mask-Head Architecture on Novel Class Segmentation ICCV 2021 Length- and Noise-Aware Training Techniques for Short-Utterance Speaker Recognition INTERSPEECH 2020 Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection CVPR 2020 RetinaTrack: Online Single Stage Joint Detection and Tracking CVPR 2020 Compact Speaker Embedding: lrx-Vector INTERSPEECH 2020 Diverse Generation for Multi-Agent Sports Games CVPR 2019 Uncertainty-Aware Audiovisual Activity Recognition Using Deep Bayesian Variational Inference ICCV 2019 Intel Far-Field Speaker Recognition System for VOiCES Challenge 2019 INTERSPEECH 2019 Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification ECCV 2018 Progressive Neural Architecture Search ECCV 2018 Multimodal Relational Tensor Network for Sentiment and Emotion Classification ACL 2018 Learning to Segment via Cut-and-Paste ECCV 2018 Generative Models of Visually Grounded Imagination ICLR 2018 Spatially Adaptive Computation Time for Residual Networks CVPR 2017 Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors CVPR 2017 Detecting Events and Key Actors in Multi-Person Videos CVPR 2016 Generation and Comprehension of Unambiguous Object Descriptions CVPR 2016 Im2Calories: Towards an Automated Mobile Vision Food Diary ICCV 2015 Learning Program Embeddings to Propagate Feedback on Student Code ICML 2015 What’s Cookin’? Interpreting Cooking Videos using Text, Speech and Vision NAACL 2015 Deep Knowledge Tracing NIPS 2015 Probabilistic Event Cascades for Alzheimer's disease NIPS 2012 Fourier Theoretic Probabilistic Inference over Permutations JMLR 2009 Riffled Independence for Ranked Data NIPS 2009 Efficient Inference for Distributions on Permutations NIPS 2007