Derek Hoiem
37 papers · 2013–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (6) π Academic Marathon (12) π Conference Polyglot (8) πΊοΈ Taxonomy Completionist (68)
π
Academic Marathon
(12)
πΊοΈ
Taxonomy Completionist
(68)
π
Renaissance Researcher
(6)
π
Keyword Trendsetter Combo
(3)
π
Keyword Champion
π€
Dynamic Duo
(10)
π
Century Club
(37)
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(149)
π₯
Unstoppable
(11)
Conferences
CVPR (15)
ICCV (7)
ECCV (5)
NIPS (3)
WACV (3)
EMNLP (2)
ICML (1)
NAACL (1)
Top co-authors
Keywords
depth estimation
(5)
object detection
(4)
3d reconstruction
(4)
multimodal learning
(4)
zero-shot learning
(3)
surface normal
(2)
synthetic datum
(2)
3d shape prediction
(2)
image synthesis
(2)
video understanding
(2)
landmark detection
(2)
convolutional neural network
(2)
vision-language model
(2)
transfer learning
(2)
semantic segmentation
(2)
in-context learning
(2)
ensemble learning
(2)
visual question answering
(2)
object localization
(2)
normal estimation
(2)
Papers
Visual Program Distillation with Template-Based Augmentation
EMNLP 2025
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations
CVPR 2025
Consistent Multimodal Generation via a Unified GAN Framework
WACV 2024
Region-Based Representations Revisited
CVPR 2024
Anytime Continual Learning for Open Vocabulary Classification
ECCV 2024
WebWISE: Unlocking Web Interface Control for LLMs via Sequential Exploration
NAACL 2024
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action
CVPR 2024
ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation
EMNLP 2023
StyleGAN knows Normal, Depth, Albedo, and More
NIPS 2023
Webly Supervised Concept Expansion for General Purpose Vision Models
ECCV 2022
Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
NIPS 2022
Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture
CVPR 2022
Learning Curves for Analysis of Deep Networks
ICML 2021
PatchMatch-RL: Deep MVS With Pixelwise Depth, Normal, and Visibility
ICCV 2021
Task-Assisted Domain Adaptation With Anchor Tasks
WACV 2021
Silhouette Guided Point Cloud Reconstruction beyond Occlusion
WACV 2020
Improving Confidence Estimates for Unfamiliar Examples
CVPR 2020
Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion
CVPR 2020
Contrastive Learning for Weakly Supervised Phrase Grounding
ECCV 2020
ViCo: Word Embeddings From Visual Co-Occurrences
ICCV 2019
No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques
ICCV 2019
Improved Structure from Motion Using Fiducial Marker Matching
ECCV 2018
LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image
CVPR 2018
Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction
CVPR 2018
Imagine This! Scripts to Compositions to Videos
ECCV 2018
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks
ICCV 2017
3D-PRNN: Generating Shape Primitives With Recurrent Neural Networks
ICCV 2017
ChromaTag: A Colored Marker and Fast Detection Algorithm
ICCV 2017
Learning to Localize Little Landmarks
CVPR 2016
Swapout: Learning an ensemble of deep architectures
NIPS 2016
Where to Look: Focus Regions for Visual Question Answering
CVPR 2016
Geometry-Informed Material Recognition
CVPR 2016
Learning a Sequential Search for Landmarks
CVPR 2015
Completing 3D Object Shape From One Depth Image
CVPR 2015
Learning Collections of Part Models for Object Recognition
CVPR 2013
Boundary Cues for 3D Object Shape Recovery
CVPR 2013
Support Surface Prediction in Indoor Scenes
ICCV 2013