Dimitris N. Metaxas
64 papers · 2013–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Conference Polyglot (14) π Academic Marathon (13) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (9)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(90)
π
Grand Slam
π§¬
Topic Evolution
π€
Dynamic Duo
(13)
π
Keyword Champion
(2)
π
Triple Crown
π
Conference Pioneer
π
Century Club
(62)
ποΈ
Keyword Collector
(231)
π
Trend Setter
π₯
Unstoppable
(12)
β
The Questioner
β‘
Prolific Year
(13)
Conferences
CVPR (16)
ICCV (12)
ECCV (6)
ICLR (6)
WACV (5)
ACL (3)
ICML (3)
IJCAI (3)
AAAI (2)
AISTATS (2)
COLING (2)
NIPS (2)
JMLR (1)
MIDL (1)
Top co-authors
Research topics
Keywords
image generation
(5)
diffusion model
(5)
object detection
(4)
data augmentation
(3)
contrastive learning
(3)
semantic segmentation
(3)
multimodal large language model
(3)
knowledge distillation
(3)
domain adaptation
(3)
image editing
(2)
representation learning
(2)
autonomous driving
(2)
point cloud
(2)
sign language translation
(2)
medical imaging
(2)
attention mechanism
(2)
sparse recovery
(2)
feature learning
(2)
self-supervised learning
(2)
3d reconstruction
(2)
Papers
Stable Signer: Hierarchical Sign Language Generative Model
ACL 2026
Large Sign Language Models: Toward 3D American Sign Language Translation
WACV 2026
DICE: Discrete Inversion Enabling Controllable Editing for Masked Generative Models
WACV 2026
Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
WACV 2026
Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety
ACL 2026
LUCAS: Layered Universal Codec Avatars
CVPR 2025
Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation
AAAI 2025
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models Via Visual Information Steering
ICML 2025
Improved Training Technique for Latent Consistency Models
ICLR 2025
Implicit In-context Learning
ICLR 2025
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
ICLR 2025
FlowChef: Steering of Rectified Flow Models for Controlled Generations
ICCV 2025
VISIAR: Empower MLLM for Visual Story Ideation
ACL 2025
MLLM-as-a-Judge for Image Safety without Human Labeling
CVPR 2025
Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction
CVPR 2025
Show and Segment: Universal Medical Image Segmentation via In-Context Learning
CVPR 2025
SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
CVPR 2025
Generating Enhanced Negatives for Training Language-Based Object Detectors
CVPR 2024
DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation
NIPS 2024
Diffusion Models for Sign Language Video Anonymization
COLING 2024
A Multimodal Spatio-Temporal GCN Model with Enhancements for Isolated Sign Recognition
COLING 2024
Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
NIPS 2024
Instantaneous Perception of Moving Objects in 3D
CVPR 2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
CVPR 2024
Taming Self-Training for Open-Vocabulary Object Detection
CVPR 2024
Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction
ECCV 2024
Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment
ECCV 2024
DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion Models
ICLR 2024
How to Trace Latent Generative Model Generated Images without Artificial Watermark?
ICML 2024
Steering Prototypes With Prompt-Tuning for Rehearsal-Free Continual Learning
WACV 2024
Learning Articulated Shape With Keypoint Pseudo-Labels From Web Images
CVPR 2023
SINE: SINgle Image Editing With Text-to-Image Diffusion Models
CVPR 2023
Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens
CVPR 2023
DeFormer: Integrating Transformers with Deformable Models for 3D Shape Abstraction from a Single Image
ICCV 2023
More Than Just Attention: Improving Cross-Modal Attentions With Contrastive Constraints for Image-Text Matching
WACV 2023
A Manifold View of Adversarial Risk
AISTATS 2022
Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning
ECCV 2022
Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations
ECCV 2022
Exploiting Unlabeled Data with Vision and Language Models for Object Detection
ECCV 2022
Learning Transferable Reward for Query Object Localization with Policy Adaptation
ICLR 2022
CrossNorm and SelfNorm for Generalization Under Distribution Shifts
ICCV 2021
Semantic Aware Data Augmentation for Cell Nuclei Microscopical Images With Artificial Neural Networks
ICCV 2021
Stochastic Transformer Networks With Linear Competing Units: Application To End-to-End SL Translation
ICCV 2021
A Good Image Generator Is What You Need for High-Resolution Video Synthesis
ICLR 2021
Dual Projection Generative Adversarial Networks for Conditional Image Generation
ICCV 2021
Knowledge As Priors: Cross-Modal Knowledge Generalization for Datasets Without Superior Knowledge
CVPR 2020
Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data
CVPR 2020
MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps
CVPR 2020
Dual Iterative Hard Thresholding
JMLR 2020
Object-Guided Instance Segmentation for Biological Images
AAAI 2020
Learning Trailer Moments in Full-Length Movies with Co-Contrastive Attention
ECCV 2020
Distributed Inexact Newton-type Pursuit for Non-convex Sparse Learning
AISTATS 2019
AdaTransform: Adaptive Data Transformation
ICCV 2019
Semantic Graph Convolutional Networks for 3D Human Pose Regression
CVPR 2019
Sharpen Focus: Learning With Attention Separability and Consistency
ICCV 2019
Weakly Supervised Deep Nuclei Segmentation using Points Annotation in Histopathology Images
MIDL 2019
CR-GAN: Learning Complete Representations for Multi-view Generation
IJCAI 2018
StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks
ICCV 2017
Reconstruction-Based Disentanglement for Pose-Invariant Face Recognition
ICCV 2017
Dual Iterative Hard Thresholding: From Non-convex Sparse Minimization to Non-smooth Concave Maximization
ICML 2017
Visual Tracking with Reliable Memories
IJCAI 2016
Nonlinear Hierarchical Part-Based Regression for Unconstrained Face Alignment
IJCAI 2016
PIEFA: Personalized Incremental and Ensemble Face Alignment
ICCV 2015
Pose-Free Facial Landmark Fitting via Optimized Part Mixtures and Cascaded Deformable Shape Model
ICCV 2013