Derek Hoiem

37 papers · 2013–2025 · 8 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🏃 Academic Marathon (12) 🌍 Conference Polyglot (8) 🗺️ Taxonomy Completionist (68)

🏃 Academic Marathon (12) 🗺️ Taxonomy Completionist (68) 🌈 Renaissance Researcher (6) 🌟 Keyword Trendsetter Combo (3) 🏆 Keyword Champion 🤝 Dynamic Duo (10) 💎 Century Club (37) 📈 Trend Setter 🚀 Conference Pioneer ⚡ Prolific Year (5) 🗃️ Keyword Collector (149) 🔥 Unstoppable (11)

Conferences

CVPR (15) ICCV (7) ECCV (5) NIPS (3) WACV (3) EMNLP (2) ICML (1) NAACL (1)

Top co-authors

Tanmay Gupta (10) Saurabh Singh (5) Joseph DeGol (4) Zhizhong Li (4) Chuhang Zou (4) Michal Shlapentokh-Rothman (4) Aniruddha Kembhavi (4) David Forsyth (3) Alexander Schwing (3) Heng Ji (3)

Keywords

depth estimation (5) object detection (4) 3d reconstruction (4) multimodal learning (4) zero-shot learning (3) surface normal (2) synthetic datum (2) 3d shape prediction (2) image synthesis (2) video understanding (2) landmark detection (2) convolutional neural network (2) vision-language model (2) transfer learning (2) semantic segmentation (2) in-context learning (2) ensemble learning (2) visual question answering (2) object localization (2) normal estimation (2)

Papers

Visual Program Distillation with Template-Based Augmentation EMNLP 2025 RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations CVPR 2025 Consistent Multimodal Generation via a Unified GAN Framework WACV 2024 Region-Based Representations Revisited CVPR 2024 Anytime Continual Learning for Open Vocabulary Classification ECCV 2024 WebWISE: Unlocking Web Interface Control for LLMs via Sequential Exploration NAACL 2024 Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action CVPR 2024 ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation EMNLP 2023 StyleGAN knows Normal, Depth, Albedo, and More NIPS 2023 Webly Supervised Concept Expansion for General Purpose Vision Models ECCV 2022 Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners NIPS 2022 Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture CVPR 2022 Learning Curves for Analysis of Deep Networks ICML 2021 PatchMatch-RL: Deep MVS With Pixelwise Depth, Normal, and Visibility ICCV 2021 Task-Assisted Domain Adaptation With Anchor Tasks WACV 2021 Silhouette Guided Point Cloud Reconstruction beyond Occlusion WACV 2020 Improving Confidence Estimates for Unfamiliar Examples CVPR 2020 Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion CVPR 2020 Contrastive Learning for Weakly Supervised Phrase Grounding ECCV 2020 ViCo: Word Embeddings From Visual Co-Occurrences ICCV 2019 No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques ICCV 2019 Improved Structure from Motion Using Fiducial Marker Matching ECCV 2018 LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image CVPR 2018 Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction CVPR 2018 Imagine This! Scripts to Compositions to Videos ECCV 2018 Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks ICCV 2017 3D-PRNN: Generating Shape Primitives With Recurrent Neural Networks ICCV 2017 ChromaTag: A Colored Marker and Fast Detection Algorithm ICCV 2017 Learning to Localize Little Landmarks CVPR 2016 Swapout: Learning an ensemble of deep architectures NIPS 2016 Where to Look: Focus Regions for Visual Question Answering CVPR 2016 Geometry-Informed Material Recognition CVPR 2016 Learning a Sequential Search for Landmarks CVPR 2015 Completing 3D Object Shape From One Depth Image CVPR 2015 Learning Collections of Part Models for Object Recognition CVPR 2013 Boundary Cues for 3D Object Shape Recovery CVPR 2013 Support Surface Prediction in Indoor Scenes ICCV 2013