Anton van den Hengel
103 papers · 2012–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
🌍 Conference Polyglot (12) 🏃 Academic Marathon (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (7)
🧭
Keyword Pioneer
🌈
Renaissance Researcher
(14)
🏃
Academic Marathon
(13)
🌟
Keyword Trendsetter Combo
(3)
🏠
Conference Loyalist
(60)
🤝
Dynamic Duo
(44)
🏆
Grand Slam
🌱
Topic Pioneer
🔬
Deep Specialist
(16)
🏆
Keyword Champion
⚡
Prolific Year
(14)
📈
Trend Setter
🚀
Conference Pioneer
❓
The Questioner
(4)
🔥
Unstoppable
(14)
🗃️
Keyword Collector
(436)
💎
Century Club
(102)
Conferences
CVPR (60)
ICCV (14)
NIPS (7)
ECCV (6)
ICLR (5)
AAAI (2)
EACL (2)
MICCAI (2)
WACV (2)
ICML (1)
IJCAI (1)
JMLR (1)
Top co-authors
Research topics
Keywords
visual question answering
(16)
multimodal learning
(8)
convolutional neural network
(8)
representation learning
(7)
attention mechanism
(6)
image classification
(6)
visual grounding
(4)
reinforcement learning
(4)
object detection
(4)
semantic segmentation
(4)
bayesian inference
(4)
conditional random field
(4)
knowledge base
(4)
visual navigation
(3)
domain generalization
(3)
visual reasoning
(3)
deep learning
(3)
active learning
(3)
vision-language navigation
(3)
domain adaptation
(3)
Papers
The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
EACL 2026
RandLoRA: Full rank parameter-efficient fine-tuning of large models
ICLR 2025
Open-World Objectness Modeling Unifies Novel Object Detection
CVPR 2025
Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering
CVPR 2025
Towards Higher Effective Rank in Parameter-Efficient Fine-tuning using Khatri-Rao Product
ICCV 2025
Seeing the Trees for the Forest: Rethinking Weakly-Supervised Medical Visual Grounding
ICCV 2025
Looking in the Mirror: A Faithful Counterfactual Explanation Method for Interpreting Deep Image Classification Models
ICCV 2025
Interactive Medical Image Analysis with Concept-based Similarity Reasoning
CVPR 2025
Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing
CVPR 2025
EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
CVPR 2025
PedCLIP: A Vision-Language model for Pediatric X-rays with Mixture of Body part Experts
MICCAI 2025
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering
CVPR 2025
Primitive Vision: Improving Diagram Understanding in MLLMs
ICML 2025
Analytic DAG Constraints for Differentiable DAG Learning
ICLR 2025
Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling
ICLR 2025
Weakly Supervised Video Individual Counting
CVPR 2024
Knowledge Composition using Task Vectors with Learned Anisotropic Scaling
NIPS 2024
BLiRF: Bandlimited Radiance Fields for Dynamic Scene Modeling
AAAI 2024
LipAT: Beyond Style Transfer for Controllable Neural Simulation of Lipstick Using Cosmetic Attributes
WACV 2024
AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis
MICCAI 2024
Improving the Convergence of Dynamic NeRFs via Optimal Transport
ICLR 2024
Identifiable Latent Polynomial Causal Models through the Lens of Change
ICLR 2024
CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation
CVPR 2024
ViewFusion: Towards Multi-View Consistency via Interpolated Denoising
CVPR 2024
Distributionally Robust Bayesian Optimization with $\varphi$-divergences
NIPS 2023
RanPAC: Random Projections and Pre-trained Models for Continual Learning
NIPS 2023
Domain Generalization via Rationale Invariance
ICCV 2023
Knowledge Combination To Learn Rotated Detection Without Rotated Annotation
CVPR 2023
Semi-Supervised Semantic Segmentation under Label Noise via Diverse Learning Groups
ICCV 2023
Learning Common Rationale To Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems
CVPR 2023
Learning Bayesian Sparse Networks With Full Experience Replay for Continual Learning
CVPR 2022
Active Learning by Feature Mixing
CVPR 2022
Retrieval Augmented Classification for Long-Tail Visual Recognition
CVPR 2022
Poseur: Direct Human Pose Regression with Transformers
ECCV 2022
ForeSI: Success-Aware Visual Navigation Agent
WACV 2022
PointInst3D: Segmenting 3D Instances by Points
ECCV 2022
Evading the Simplicity Bias: Training a Diverse Set of Models Discovers Solutions With Superior OOD Generalization
CVPR 2022
DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution
CVPR 2021
Unshuffling Data for Improved Generalization in Visual Question Answering
ICCV 2021
The Road To Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
ICCV 2021
Reasoning over Vision and Language: Exploring the Benefits of Supplemental Knowledge
EACL 2021
Memory-Augmented Dynamic Neural Relational Inference
ICCV 2021
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
CVPR 2020
On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law
NIPS 2020
Counterfactual Vision-and-Language Navigation: Unravelling the Unseen
NIPS 2020
V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices
AAAI 2020
Gold Seeker: Information Gain From Policy Distributions for Goal-Oriented Vision-and-Langauge Reasoning
CVPR 2020
Counterfactual Vision and Language Learning
CVPR 2020
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
CVPR 2020
Self-Trained Deep Ordinal Regression for End-to-End Video Anomaly Detection
CVPR 2020
Object-and-Action Aware Model for Visual Language Navigation
ECCV 2020
Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision
ECCV 2020
Attention-Guided Network for Ghost-Free High Dynamic Range Imaging
CVPR 2019
Memorizing Normality to Detect Anomaly: Memory-Augmented Deep Autoencoder for Unsupervised Anomaly Detection
ICCV 2019
What's to Know? Uncertainty as a Guide to Asking Goal-Oriented Questions
CVPR 2019
Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks
CVPR 2019
A Generative Adversarial Density Estimator
CVPR 2019
Visual Question Answering as Reading Comprehension
CVPR 2019
Actively Seeking and Learning From Live Data
CVPR 2019
Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries
CVPR 2018
Tips and Tricks for Visual Question Answering: Learnings From the 2017 Challenge
CVPR 2018
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments
CVPR 2018
Visual Question Answering as a Meta Learning Task
ECCV 2018
Goal-Oriented Visual Question Generation via Intermediate Rewards
ECCV 2018
Visual Question Answering With Memory-Augmented Networks
CVPR 2018
Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning
CVPR 2018
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions
CVPR 2017
Graph-Structured Representations for Visual Question Answering
CVPR 2017
Self-Paced Kernel Estimation for Robust Blind Image Deblurring
ICCV 2017
Explicit Knowledge-based Reasoning for Visual Question Answering
IJCAI 2017
When Unsupervised Domain Adaptation Meets Tensor Representations
ICCV 2017
Infinite Variational Autoencoder for Semi-Supervised Learning
CVPR 2017
Multi-Attention Network for One Shot Learning
CVPR 2017
From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur
CVPR 2017
Sequential Person Recognition in Photo Albums With a Recurrent Network
CVPR 2017
What's Wrong With That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution
CVPR 2016
Proximal Riemannian Pursuit for Large-Scale Trace-Norm Minimization
CVPR 2016
Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge From External Sources
CVPR 2016
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation
CVPR 2016
Less Is More: Zero-Shot Learning From Online Textual Documents With Noise Suppression
CVPR 2016
Blind Image Deconvolution by Automatic Gradient Activation
CVPR 2016
Pairwise Matching Through Max-Weight Bipartite Belief Propagation
CVPR 2016
What Value Do Explicit High Level Concepts Have in Vision to Language Problems?
CVPR 2016
Part-Based Modelling of Compound Scenes From Images
CVPR 2015
Depth and Surface Normal Estimation From Monocular Images Using Regression on Deep Features and Hierarchical CRFs
CVPR 2015
Mid-Level Deep Pattern Mining
CVPR 2015
The Treasure Beneath Convolutional Layers: Cross-Convolutional-Layer Pooling for Image Classification
CVPR 2015
Deeply Learning the Messages in Message Passing Inference
NIPS 2015
Learning Graph Structure for Multi-Label Image Classification via Clique Generation
CVPR 2015
Efficient SDP Inference for Fully-Connected CRFs Based on Low-Rank Decomposition
CVPR 2015
Robust Multiple Homography Estimation: An Ill-Solved Problem
CVPR 2015
Learning to Rank in Person Re-Identification With Metric Ensembles
CVPR 2015
Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors
NIPS 2014
Fast Supervised Hashing with Decision Trees for High-Dimensional Data
CVPR 2014
Part-Based Visual Tracking with Online Latent Structural Learning
CVPR 2013
Learning Compact Binary Codes for Visual Tracking
CVPR 2013
A Fast Semidefinite Approach to Solving Binary Quadratic Problems
CVPR 2013
Contextual Hypergraph Modeling for Salient Object Detection
ICCV 2013
Bilinear Programming for Human Activity Recognition with Unknown MRF Graphs
CVPR 2013
Inductive Hashing on Manifolds
CVPR 2013
A General Two-Step Approach to Learning-Based Hashing
ICCV 2013
Efficient Pedestrian Detection by Directly Optimizing the Partial Area under the ROC Curve
ICCV 2013
Positive Semidefinite Metric Learning Using Boosting-like Algorithms
JMLR 2012