Jonathan Huang
34 papers · 2007–2025 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (12) π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (11)
π
Interdisciplinary Bridge
π
Conference Polyglot
(11)
πΊοΈ
Taxonomy Completionist
(12)
π
Keyword Trendsetter Combo
(7)
π§¬
Topic Evolution
π
Keyword Champion
π±
Topic Pioneer
π₯
Mega-Team
(31)
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(144)
π
Century Club
(34)
β
The Questioner
π
Trend Setter
π₯
Unstoppable
(11)
π
Conference Pioneer
Conferences
CVPR (8)
ECCV (5)
NIPS (5)
ICCV (4)
ICLR (3)
INTERSPEECH (3)
ICML (2)
ACL (1)
JMLR (1)
NAACL (1)
WACV (1)
Top co-authors
Research topics
Keywords
object detection
(4)
recurrent neural network
(3)
permutation distribution
(2)
instance segmentation
(2)
generative model
(2)
graphical model
(2)
knowledge distillation
(2)
model compression
(2)
video understanding
(2)
fourier analysis
(2)
variational inference
(2)
fourier decomposition
(2)
attention mechanism
(2)
convolutional neural network
(2)
multimodal learning
(2)
probabilistic inference
(2)
weakly supervised learning
(2)
speaker recognition
(2)
probabilistic modeling
(2)
multi-person tracking
(2)
Papers
Visually Consistent Hierarchical Image Classification
ICLR 2025
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
ICLR 2025
Principles of Visual Tokens for Efficient Video Understanding
ICCV 2025
Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition
ECCV 2024
VideoPoet: A Large Language Model for Zero-Shot Video Generation
ICML 2024
Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
ECCV 2024
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
NIPS 2023
PERF-Net: Pose Empowered RGB-Flow Net
WACV 2022
The Auto Arborist Dataset: A Large-Scale Benchmark for Multiview Urban Forest Monitoring Under Domain Shift
CVPR 2022
The Surprising Impact of Mask-Head Architecture on Novel Class Segmentation
ICCV 2021
Length- and Noise-Aware Training Techniques for Short-Utterance Speaker Recognition
INTERSPEECH 2020
Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection
CVPR 2020
RetinaTrack: Online Single Stage Joint Detection and Tracking
CVPR 2020
Compact Speaker Embedding: lrx-Vector
INTERSPEECH 2020
Diverse Generation for Multi-Agent Sports Games
CVPR 2019
Uncertainty-Aware Audiovisual Activity Recognition Using Deep Bayesian Variational Inference
ICCV 2019
Intel Far-Field Speaker Recognition System for VOiCES Challenge 2019
INTERSPEECH 2019
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
ECCV 2018
Progressive Neural Architecture Search
ECCV 2018
Multimodal Relational Tensor Network for Sentiment and Emotion Classification
ACL 2018
Learning to Segment via Cut-and-Paste
ECCV 2018
Generative Models of Visually Grounded Imagination
ICLR 2018
Spatially Adaptive Computation Time for Residual Networks
CVPR 2017
Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors
CVPR 2017
Detecting Events and Key Actors in Multi-Person Videos
CVPR 2016
Generation and Comprehension of Unambiguous Object Descriptions
CVPR 2016
Im2Calories: Towards an Automated Mobile Vision Food Diary
ICCV 2015
Learning Program Embeddings to Propagate Feedback on Student Code
ICML 2015
Whatβs Cookinβ? Interpreting Cooking Videos using Text, Speech and Vision
NAACL 2015
Deep Knowledge Tracing
NIPS 2015
Probabilistic Event Cascades for Alzheimer's disease
NIPS 2012
Fourier Theoretic Probabilistic Inference over Permutations
JMLR 2009
Riffled Independence for Ranked Data
NIPS 2009
Efficient Inference for Distributions on Permutations
NIPS 2007