Vishal M. Patel

114 papers · 2013–2026 · 10 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🌍 Conference Polyglot (10) 🏃 Academic Marathon (13) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5)

🗺️ Taxonomy Completionist (126) 🌍 Conference Polyglot (10) 🌈 Renaissance Researcher (9) 🏠 Conference Loyalist (25) 🌟 Keyword Trendsetter Combo (4) 🤝 Dynamic Duo (12) 🏆 Grand Slam 🔬 Deep Specialist (21) 🧬 Topic Evolution 🏆 Keyword Champion (5) 🚀 Conference Pioneer 💎 Century Club (108) ⚡ Prolific Year (6) 🔥 Unstoppable (10) 🗃️ Keyword Collector (399) 📈 Trend Setter

Conferences

CVPR (43) WACV (25) ECCV (10) ICCV (10) MIDL (10) AAAI (6) MICCAI (4) NIPS (3) ICLR (2) ICML (1)

Top co-authors

Vibashan VS (12) Rajeev Yasarla (10) Jeya Maria Jose Valanarasu (9) Vishwanath A. Sindagi (8) Nithin Gopalakrishnan Nair (8) Rama Chellappa (8) Poojan Oza (8) Jay N. Paranjape (7) Shameema Sikder (6) Kangfu Mei (6)

Research topics

Application Areas (1)

Keywords

diffusion model (21) domain adaptation (11) image restoration (11) image generation (8) self-supervised learning (6) change detection (5) object detection (5) attention mechanism (5) face recognition (5) remote sensing (5) deep learning (5) contrastive learning (4) dictionary learning (4) convolutional neural network (4) weakly supervised learning (4) transfer learning (4) knowledge distillation (4) semantic segmentation (4) image synthesis (4) feature fusion (4)

Papers

F-ViTA: Foundation Model Guided Visible to Infrared Translation WACV 2026 GHOST: Getting to the Bottom of Hallucinations with A Multi-round Consistency Benchmark WACV 2026 Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection WACV 2026 DiffRegCD: Integrated Registration and Change Detection with Diffusion Features WACV 2026 Referring Change Detection in Remote Sensing Imagery WACV 2026 GenVOG-DiT: A Transformer-Based Diffusion Model for Pose-Driven, Patient-Agnostic Nystagmus VOG Video Generation MIDL 2026 CatVLM: Enhancing Temporal Understanding in Cataract Surgery Videos with Boundary-Aware VLM MIDL 2026 ProCrop: Learning Aesthetic Image Cropping from Professional Compositions AAAI 2026 PETALface: Parameter Efficient Transfer Learning for Low-Resolution Face Recognition WACV 2025 SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing AAAI 2025 SegFace: Face Segmentation of Long-Tail Classes AAAI 2025 AWRaCLe: All-Weather Image Restoration Using Visual In-Context Learning AAAI 2025 SINR: Sparsity Driven Compressed Implicit Neural Representations CVPR 2025 Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning CVPR 2025 Distilling Multi-modal Large Language Models for Autonomous Driving CVPR 2025 Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models CVPR 2025 MIRE: Matched Implicit Neural Representations CVPR 2025 STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models CVPR 2025 Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset CVPR 2025 GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration CVPR 2025 The Power of Context: How Multimodality Improves Image Super-Resolution CVPR 2025 UniRes: Universal Image Restoration for Complex Degradations ICCV 2025 HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss ICCV 2025 FaceXFormer: A Unified Transformer for Facial Analysis ICCV 2025 Scaling Transformer-Based Novel View Synthesis with Models Token Disentanglement and Synthetic Data ICCV 2025 PIN: Prolate Spheroidal Wave Function-based Implicit Neural Representations ICLR 2025 Field-DiT: Diffusion Transformer on Unified Video, 3D, and Game Field Generation ICLR 2025 Perception in Reflection ICML 2025 StepAL: Step-aware Active Learning for Cataract Surgical Videos MICCAI 2025 Active Learning for Vision Language Models WACV 2025 A Mamba-Based Siamese Network for Remote Sensing Change Detection WACV 2025 ReBotNet: Fast Real-Time Video Enhancement WACV 2025 Frame by Familiar Frame: Understanding Replication in Video Diffusion Models WACV 2025 Deep Metric Learning for Unsupervised Remote Sensing Change Detection WACV 2025 MambaRecon: MRI Reconstruction with Structured State Space Models WACV 2025 I2I-Galip: Unsupervised Medical Image Translation Using Generative Adversarial CLIP MIDL 2025 A Vision Foundation Model for Cataract Surgery Using Joint-Embedding Predictive Architecture MIDL 2025 MedCL: Learn Consistent Anatomy Distribution for Scribble-supervised Medical Image Segmentation MIDL 2025 Federated Black-Box Adaptation for Semantic Segmentation NIPS 2024 Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections NIPS 2024 ReGS: Reference-based Controllable Scene Stylization with Gaussian Splatting NIPS 2024 Entropic Open-Set Active Learning AAAI 2024 LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation CVPR 2024 Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image CVPR 2024 View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network CVPR 2024 3SD: Self-Supervised Saliency Detection With No Labels WACV 2024 Self-Supervised Denoising Transformer With Gaussian Process WACV 2024 Attentive Prototypes for Source-Free Unsupervised Domain Adaptive 3D Object Detection WACV 2024 Latent Feature-Guided Diffusion Models for Shadow Removal WACV 2024 Diffuse and Restore: A Region-Adaptive Diffusion Model for Identity-Preserving Blind Face Restoration WACV 2024 JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation CVPR 2024 CrowdDiff: Multi-hypothesis Crowd Density Estimation using Diffusion Models CVPR 2024 MonoDiff: Monocular 3D Object Detection and Pose Estimation with Diffusion Models CVPR 2024 CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation CVPR 2024 Target and task specific source-free domain adaptive image segmentation MIDL 2024 Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training MIDL 2024 UltraMAE: Multi-modal Masked Autoencoder for Ultrasound Pre-training MIDL 2024 S-SAM: SVD-based Fine-Tuning of Segment Anything Model for Medical Image Segmentation MICCAI 2024 ModelMix: A New Model-Mixup Strategy to Minimize Vicinal Risk across Tasks for Few-scribble based Cardiac Segmentation MICCAI 2024 Black-Box Adaptation for Medical Image Segmentation MICCAI 2024 Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection ECCV 2024 Ambiguous Medical Image Segmentation Using Diffusion Models CVPR 2023 Towards Online Domain Adaptive Object Detection WACV 2023 Fine-Context Shadow Detection Using Shadow Removal WACV 2023 AT-DDPM: Restoring Faces Degraded by Atmospheric Turbulence Using Denoising Diffusion Probabilistic Models WACV 2023 Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis ICCV 2023 On-the-Fly Test-time Adaptation for Medical Image Segmentation MIDL 2023 Reference-based MRI Reconstruction Using Texture Transformer MIDL 2023 Unite and Conquer: Plug & Play Multi-Modal Synthesis Using Diffusion Models CVPR 2023 Spatio-Temporal Pixel-Level Contrastive Learning-Based Source-Free Domain Adaptation for Video Semantic Segmentation CVPR 2023 Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations CVPR 2023 LightPainter: Interactive Portrait Relighting With Freehand Scribble CVPR 2023 AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning With Masked Autoencoders CVPR 2023 SceneComposer: Any-Level Semantic Image Synthesis CVPR 2023 Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection CVPR 2023 Auto-FedRL: Federated Hyperparameter Optimization for Multi-Institutional Medical Image Segmentation ECCV 2022 ART-SS: An Adaptive Rejection Technique for Semi-Supervised Restoration for Adverse Weather-Affected Images ECCV 2022 Deep Semantic Statistics Matching (D2SM) Denoising Network ECCV 2022 Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection Using Meta-Learning WACV 2022 Multimodal Learning Using Optimal Transport for Sarcasm and Humor Detection WACV 2022 Enhancing Adversarial Robustness for Deep Metric Learning CVPR 2022 SketchEdit: Mask-Free Local Image Manipulation With Partial Sketches CVPR 2022 TransWeather: Transformer-Based Restoration of Images Degraded by Adverse Weather Conditions CVPR 2022 Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination CVPR 2022 HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening CVPR 2022 Completely Self-Supervised Crowd Counting via Distribution Matching ECCV 2022 Multi-Institutional Collaborations for Improving Deep Learning-Based Magnetic Resonance Image Reconstruction Using Federated Learning CVPR 2021 MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection CVPR 2021 CR-Fill: Generative Image Inpainting With Auxiliary Contextual Reconstruction ICCV 2021 Deep Image Compositing WACV 2021 A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset WACV 2021 Overcomplete Deep Subspace Clustering Networks WACV 2021 Multiple Class Novelty Detection Under Data Distribution Shift ECCV 2020 Open-set Adversarial Defense ECCV 2020 Syn2Real Transfer Learning for Image Deraining Using Gaussian Processes CVPR 2020 Generative-Discriminative Feature Representations for Open-Set Recognition CVPR 2020 Prior-based Domain Adaptive Object Detection for Hazy and Rainy Conditions ECCV 2020 Utilizing Patch-level Category Activation Patterns for Multiple Class Novelty Detection ECCV 2020 Learning to Count in the Crowd from Limited Labeled Data ECCV 2020 Disentangled Variational Representation for Heterogeneous Face Recognition AAAI 2019 C2AE: Class Conditioned Auto-Encoder for Open-Set Recognition CVPR 2019 Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition With Multimodal Training CVPR 2019 Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method ICCV 2019 Uncertainty Guided Multi-Scale Residual Learning-Using a Cycle Spinning CNN for Single Image De-Raining CVPR 2019 Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting ICCV 2019 Deep Transfer Learning for Multiple Class Novelty Detection CVPR 2019 Densely Connected Pyramid Dehazing Network CVPR 2018 Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network CVPR 2018 Generating High-Quality Crowd Density Maps Using Contextual Pyramid CNNs ICCV 2017 Hierarchical Multimodal Metric Learning for Multimodal Classification CVPR 2017 Matrix Completion for Resolving Label Ambiguity CVPR 2015 Generalized Domain-Adaptive Dictionaries CVPR 2013 Latent Space Sparse Subspace Clustering ICCV 2013 Dictionary Learning from Ambiguously Labeled Data CVPR 2013