Dimitris N. Metaxas

64 papers · 2013–2026 · 14 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🌍 Conference Polyglot (14) 🏃 Academic Marathon (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (9)

🐝 Cross-Pollinator (9) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (90) 🏆 Grand Slam 🧬 Topic Evolution 🤝 Dynamic Duo (13) 🏆 Keyword Champion (2) 👑 Triple Crown 🚀 Conference Pioneer 💎 Century Club (62) 🗃️ Keyword Collector (231) 📈 Trend Setter 🔥 Unstoppable (12) ❓ The Questioner ⚡ Prolific Year (13)

Conferences

CVPR (16) ICCV (12) ECCV (6) ICLR (6) WACV (5) ACL (3) ICML (3) IJCAI (3) AAAI (2) AISTATS (2) COLING (2) NIPS (2) JMLR (1) MIDL (1)

Top co-authors

Di Liu (13) Long Zhao (10) Ligong Han (10) Yuxiao Chen (8) Zhenting Wang (7) Xi Peng (7) Bo Liu (7) Yu Tian (6) Zhuowei Li (6) Shaoting Zhang (6)

Research topics

Learning Paradigms (1)

Keywords

image generation (5) diffusion model (5) object detection (4) data augmentation (3) contrastive learning (3) semantic segmentation (3) multimodal large language model (3) knowledge distillation (3) domain adaptation (3) image editing (2) representation learning (2) autonomous driving (2) point cloud (2) sign language translation (2) medical imaging (2) attention mechanism (2) sparse recovery (2) feature learning (2) self-supervised learning (2) 3d reconstruction (2)

Papers

Stable Signer: Hierarchical Sign Language Generative Model ACL 2026 Large Sign Language Models: Toward 3D American Sign Language Translation WACV 2026 DICE: Discrete Inversion Enabling Controllable Editing for Masked Generative Models WACV 2026 Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars WACV 2026 Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety ACL 2026 LUCAS: Layered Universal Codec Avatars CVPR 2025 Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Image Generation AAAI 2025 The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models Via Visual Information Steering ICML 2025 Improved Training Technique for Latent Consistency Models ICLR 2025 Implicit In-context Learning ICLR 2025 LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation ICLR 2025 FlowChef: Steering of Rectified Flow Models for Controlled Generations ICCV 2025 VISIAR: Empower MLLM for Visual Story Ideation ACL 2025 MLLM-as-a-Judge for Image Safety without Human Labeling CVPR 2025 Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction CVPR 2025 Show and Segment: Universal Medical Image Segmentation via In-Context Learning CVPR 2025 SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device CVPR 2025 Generating Enhanced Negatives for Training Language-Based Object Detectors CVPR 2024 DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation NIPS 2024 Diffusion Models for Sign Language Video Anonymization COLING 2024 A Multimodal Spatio-Temporal GCN Model with Enhancements for Isolated Sign Recognition COLING 2024 Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate NIPS 2024 Instantaneous Perception of Moving Objects in 3D CVPR 2024 Layout-Agnostic Scene Text Image Synthesis with Diffusion Models CVPR 2024 Taming Self-Training for Open-Vocabulary Object Detection CVPR 2024 Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction ECCV 2024 Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment ECCV 2024 DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion Models ICLR 2024 How to Trace Latent Generative Model Generated Images without Artificial Watermark? ICML 2024 Steering Prototypes With Prompt-Tuning for Rehearsal-Free Continual Learning WACV 2024 Learning Articulated Shape With Keypoint Pseudo-Labels From Web Images CVPR 2023 SINE: SINgle Image Editing With Text-to-Image Diffusion Models CVPR 2023 Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens CVPR 2023 DeFormer: Integrating Transformers with Deformable Models for 3D Shape Abstraction from a Single Image ICCV 2023 More Than Just Attention: Improving Cross-Modal Attentions With Contrastive Constraints for Image-Text Matching WACV 2023 A Manifold View of Adversarial Risk AISTATS 2022 Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning ECCV 2022 Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations ECCV 2022 Exploiting Unlabeled Data with Vision and Language Models for Object Detection ECCV 2022 Learning Transferable Reward for Query Object Localization with Policy Adaptation ICLR 2022 CrossNorm and SelfNorm for Generalization Under Distribution Shifts ICCV 2021 Semantic Aware Data Augmentation for Cell Nuclei Microscopical Images With Artificial Neural Networks ICCV 2021 Stochastic Transformer Networks With Linear Competing Units: Application To End-to-End SL Translation ICCV 2021 A Good Image Generator Is What You Need for High-Resolution Video Synthesis ICLR 2021 Dual Projection Generative Adversarial Networks for Conditional Image Generation ICCV 2021 Knowledge As Priors: Cross-Modal Knowledge Generalization for Datasets Without Superior Knowledge CVPR 2020 Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data CVPR 2020 MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird's Eye View Maps CVPR 2020 Dual Iterative Hard Thresholding JMLR 2020 Object-Guided Instance Segmentation for Biological Images AAAI 2020 Learning Trailer Moments in Full-Length Movies with Co-Contrastive Attention ECCV 2020 Distributed Inexact Newton-type Pursuit for Non-convex Sparse Learning AISTATS 2019 AdaTransform: Adaptive Data Transformation ICCV 2019 Semantic Graph Convolutional Networks for 3D Human Pose Regression CVPR 2019 Sharpen Focus: Learning With Attention Separability and Consistency ICCV 2019 Weakly Supervised Deep Nuclei Segmentation using Points Annotation in Histopathology Images MIDL 2019 CR-GAN: Learning Complete Representations for Multi-view Generation IJCAI 2018 StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks ICCV 2017 Reconstruction-Based Disentanglement for Pose-Invariant Face Recognition ICCV 2017 Dual Iterative Hard Thresholding: From Non-convex Sparse Minimization to Non-smooth Concave Maximization ICML 2017 Visual Tracking with Reliable Memories IJCAI 2016 Nonlinear Hierarchical Part-Based Regression for Unconstrained Face Alignment IJCAI 2016 PIEFA: Personalized Incremental and Ensemble Face Alignment ICCV 2015 Pose-Free Facial Landmark Fitting via Optimized Part Mixtures and Cascaded Deformable Shape Model ICCV 2013