Vishal M. Patel
114 papers · 2013–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π Conference Polyglot (10) π Academic Marathon (13) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (5)
πΊοΈ
Taxonomy Completionist
(126)
π
Conference Polyglot
(10)
π
Renaissance Researcher
(9)
π
Conference Loyalist
(25)
π
Keyword Trendsetter Combo
(4)
π€
Dynamic Duo
(12)
π
Grand Slam
π¬
Deep Specialist
(21)
π§¬
Topic Evolution
π
Keyword Champion
(5)
π
Conference Pioneer
π
Century Club
(108)
β‘
Prolific Year
(6)
π₯
Unstoppable
(10)
ποΈ
Keyword Collector
(399)
π
Trend Setter
Conferences
CVPR (43)
WACV (25)
ECCV (10)
ICCV (10)
MIDL (10)
AAAI (6)
MICCAI (4)
NIPS (3)
ICLR (2)
ICML (1)
Top co-authors
Research topics
Keywords
diffusion model
(21)
domain adaptation
(11)
image restoration
(11)
image generation
(8)
self-supervised learning
(6)
change detection
(5)
object detection
(5)
attention mechanism
(5)
face recognition
(5)
remote sensing
(5)
deep learning
(5)
contrastive learning
(4)
dictionary learning
(4)
convolutional neural network
(4)
weakly supervised learning
(4)
transfer learning
(4)
knowledge distillation
(4)
semantic segmentation
(4)
image synthesis
(4)
feature fusion
(4)
Papers
F-ViTA: Foundation Model Guided Visible to Infrared Translation
WACV 2026
GHOST: Getting to the Bottom of Hallucinations with A Multi-round Consistency Benchmark
WACV 2026
Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection
WACV 2026
DiffRegCD: Integrated Registration and Change Detection with Diffusion Features
WACV 2026
Referring Change Detection in Remote Sensing Imagery
WACV 2026
GenVOG-DiT: A Transformer-Based Diffusion Model for Pose-Driven, Patient-Agnostic Nystagmus VOG Video Generation
MIDL 2026
CatVLM: Enhancing Temporal Understanding in Cataract Surgery Videos with Boundary-Aware VLM
MIDL 2026
ProCrop: Learning Aesthetic Image Cropping from Professional Compositions
AAAI 2026
PETALface: Parameter Efficient Transfer Learning for Low-Resolution Face Recognition
WACV 2025
SyncNoise: Geometrically Consistent Noise Prediction for Instruction-based 3D Editing
AAAI 2025
SegFace: Face Segmentation of Long-Tail Classes
AAAI 2025
AWRaCLe: All-Weather Image Restoration Using Visual In-Context Learning
AAAI 2025
SINR: Sparsity Driven Compressed Implicit Neural Representations
CVPR 2025
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
CVPR 2025
Distilling Multi-modal Large Language Models for Autonomous Driving
CVPR 2025
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
CVPR 2025
MIRE: Matched Implicit Neural Representations
CVPR 2025
STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models
CVPR 2025
Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset
CVPR 2025
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
CVPR 2025
The Power of Context: How Multimodality Improves Image Super-Resolution
CVPR 2025
UniRes: Universal Image Restoration for Complex Degradations
ICCV 2025
HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss
ICCV 2025
FaceXFormer: A Unified Transformer for Facial Analysis
ICCV 2025
Scaling Transformer-Based Novel View Synthesis with Models Token Disentanglement and Synthetic Data
ICCV 2025
PIN: Prolate Spheroidal Wave Function-based Implicit Neural Representations
ICLR 2025
Field-DiT: Diffusion Transformer on Unified Video, 3D, and Game Field Generation
ICLR 2025
Perception in Reflection
ICML 2025
StepAL: Step-aware Active Learning for Cataract Surgical Videos
MICCAI 2025
Active Learning for Vision Language Models
WACV 2025
A Mamba-Based Siamese Network for Remote Sensing Change Detection
WACV 2025
ReBotNet: Fast Real-Time Video Enhancement
WACV 2025
Frame by Familiar Frame: Understanding Replication in Video Diffusion Models
WACV 2025
Deep Metric Learning for Unsupervised Remote Sensing Change Detection
WACV 2025
MambaRecon: MRI Reconstruction with Structured State Space Models
WACV 2025
I2I-Galip: Unsupervised Medical Image Translation Using Generative Adversarial CLIP
MIDL 2025
A Vision Foundation Model for Cataract Surgery Using Joint-Embedding Predictive Architecture
MIDL 2025
MedCL: Learn Consistent Anatomy Distribution for Scribble-supervised Medical Image Segmentation
MIDL 2025
Federated Black-Box Adaptation for Semantic Segmentation
NIPS 2024
Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections
NIPS 2024
ReGS: Reference-based Controllable Scene Stylization with Gaussian Splatting
NIPS 2024
Entropic Open-Set Active Learning
AAAI 2024
LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation
CVPR 2024
Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image
CVPR 2024
View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network
CVPR 2024
3SD: Self-Supervised Saliency Detection With No Labels
WACV 2024
Self-Supervised Denoising Transformer With Gaussian Process
WACV 2024
Attentive Prototypes for Source-Free Unsupervised Domain Adaptive 3D Object Detection
WACV 2024
Latent Feature-Guided Diffusion Models for Shadow Removal
WACV 2024
Diffuse and Restore: A Region-Adaptive Diffusion Model for Identity-Preserving Blind Face Restoration
WACV 2024
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
CVPR 2024
CrowdDiff: Multi-hypothesis Crowd Density Estimation using Diffusion Models
CVPR 2024
MonoDiff: Monocular 3D Object Detection and Pose Estimation with Diffusion Models
CVPR 2024
CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation
CVPR 2024
Target and task specific source-free domain adaptive image segmentation
MIDL 2024
Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training
MIDL 2024
UltraMAE: Multi-modal Masked Autoencoder for Ultrasound Pre-training
MIDL 2024
S-SAM: SVD-based Fine-Tuning of Segment Anything Model for Medical Image Segmentation
MICCAI 2024
ModelMix: A New Model-Mixup Strategy to Minimize Vicinal Risk across Tasks for Few-scribble based Cardiac Segmentation
MICCAI 2024
Black-Box Adaptation for Medical Image Segmentation
MICCAI 2024
Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection
ECCV 2024
Ambiguous Medical Image Segmentation Using Diffusion Models
CVPR 2023
Towards Online Domain Adaptive Object Detection
WACV 2023
Fine-Context Shadow Detection Using Shadow Removal
WACV 2023
AT-DDPM: Restoring Faces Degraded by Atmospheric Turbulence Using Denoising Diffusion Probabilistic Models
WACV 2023
Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis
ICCV 2023
On-the-Fly Test-time Adaptation for Medical Image Segmentation
MIDL 2023
Reference-based MRI Reconstruction Using Texture Transformer
MIDL 2023
Unite and Conquer: Plug & Play Multi-Modal Synthesis Using Diffusion Models
CVPR 2023
Spatio-Temporal Pixel-Level Contrastive Learning-Based Source-Free Domain Adaptation for Video Semantic Segmentation
CVPR 2023
Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations
CVPR 2023
LightPainter: Interactive Portrait Relighting With Freehand Scribble
CVPR 2023
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning With Masked Autoencoders
CVPR 2023
SceneComposer: Any-Level Semantic Image Synthesis
CVPR 2023
Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection
CVPR 2023
Auto-FedRL: Federated Hyperparameter Optimization for Multi-Institutional Medical Image Segmentation
ECCV 2022
ART-SS: An Adaptive Rejection Technique for Semi-Supervised Restoration for Adverse Weather-Affected Images
ECCV 2022
Deep Semantic Statistics Matching (D2SM) Denoising Network
ECCV 2022
Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection Using Meta-Learning
WACV 2022
Multimodal Learning Using Optimal Transport for Sarcasm and Humor Detection
WACV 2022
Enhancing Adversarial Robustness for Deep Metric Learning
CVPR 2022
SketchEdit: Mask-Free Local Image Manipulation With Partial Sketches
CVPR 2022
TransWeather: Transformer-Based Restoration of Images Degraded by Adverse Weather Conditions
CVPR 2022
Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination
CVPR 2022
HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening
CVPR 2022
Completely Self-Supervised Crowd Counting via Distribution Matching
ECCV 2022
Multi-Institutional Collaborations for Improving Deep Learning-Based Magnetic Resonance Image Reconstruction Using Federated Learning
CVPR 2021
MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection
CVPR 2021
CR-Fill: Generative Image Inpainting With Auxiliary Contextual Reconstruction
ICCV 2021
Deep Image Compositing
WACV 2021
A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset
WACV 2021
Overcomplete Deep Subspace Clustering Networks
WACV 2021
Multiple Class Novelty Detection Under Data Distribution Shift
ECCV 2020
Open-set Adversarial Defense
ECCV 2020
Syn2Real Transfer Learning for Image Deraining Using Gaussian Processes
CVPR 2020
Generative-Discriminative Feature Representations for Open-Set Recognition
CVPR 2020
Prior-based Domain Adaptive Object Detection for Hazy and Rainy Conditions
ECCV 2020
Utilizing Patch-level Category Activation Patterns for Multiple Class Novelty Detection
ECCV 2020
Learning to Count in the Crowd from Limited Labeled Data
ECCV 2020
Disentangled Variational Representation for Heterogeneous Face Recognition
AAAI 2019
C2AE: Class Conditioned Auto-Encoder for Open-Set Recognition
CVPR 2019
Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition With Multimodal Training
CVPR 2019
Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method
ICCV 2019
Uncertainty Guided Multi-Scale Residual Learning-Using a Cycle Spinning CNN for Single Image De-Raining
CVPR 2019
Multi-Level Bottom-Top and Top-Bottom Feature Fusion for Crowd Counting
ICCV 2019
Deep Transfer Learning for Multiple Class Novelty Detection
CVPR 2019
Densely Connected Pyramid Dehazing Network
CVPR 2018
Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network
CVPR 2018
Generating High-Quality Crowd Density Maps Using Contextual Pyramid CNNs
ICCV 2017
Hierarchical Multimodal Metric Learning for Multimodal Classification
CVPR 2017
Matrix Completion for Resolving Label Ambiguity
CVPR 2015
Generalized Domain-Adaptive Dictionaries
CVPR 2013
Latent Space Sparse Subspace Clustering
ICCV 2013
Dictionary Learning from Ambiguously Labeled Data
CVPR 2013