Papers
4,428 papers found
DOTGraph: CLIP-Driven Feature Disentanglement and Optimal Transport based Graph Learning for Few-Shot Segmentation
Shreya Biswas, Zhaozheng Yin
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji, Taojun Lin, Hongdong Li
Dragonite: Single-Step Drag-based Image Editing with Geometric-Semantic Guidance
Meng-Ting Jhong, Tai-Ming Huang, Shang-Fu Chen et al.
DreamAnywhere: Object-Centric Panoramic 3D Scene Generation
Edoardo A. Dominici, Jozef Hladký, Floor Verhoeven et al.
DreamCatcher: Efficient Multi-Concept Customization via Representation Finetuning
Jungwon Lee, Changhun Lee, Eunhyeok Park
DREAM: Dynamic Prompts and GuidedMix for Efficient Continual Adaptation of Visual-Language Models
Evelyn Chee, Mong Li Lee, Wynne Hsu
DreamMakeup: Face Makeup Customization using Latent Diffusion Models
Geon Yeong Park, Inhwa Han, Serin Yang et al.
Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel NeRA Adapter for Enhanced Feature Adaptation
Gayatri Deshmukh, Somsubhra De, Chirag Sehgal et al.
Dronaquatics: Real-time Swimming Analytics Using Drone Captured Imagery
Thu Tran, Harold Abraham Joseph, Kichang Lee et al.
DRWKV: Focusing on Object Edges for Low-Light Image Enhancement
Xuecheng Bai, Yuxiang Wang, Boyu Hu et al.
DTMIR-Pro: Domain Translation with Prompt-based Latent-Space Generalization for Multi-Weather Image Restoration
Ashutosh Kulkarni, Prashant W. Patil, Santosh Kumar Vipparthi et al.
Dual-Domain Multimodal Hyperbolic Fusion for Cardiopulmonary Disease Diagnosis in Emergency Care
Ke Nan, Maggie Samaan, Benjamin Burns et al.
DualRes: Production-ready Dynamic Object Detection
Jibril El Hassani, Thomas Verelst
DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation
Beomseok Kang, Niluthpol Chowdhury Mithun, Abhinav Rajvanshi et al.
DuPLUS: Dual-Prompt Vision-Language Model for Universal Medical Image Segmentation and Prognosis
Numan Saeed, Tausifa Jan Saleem, Fadillah Maani et al.
DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes
Runfa Blark Li, Mahdi Shaghaghi, Keito Suzuki et al.
Edge-Aware Image Manipulation via Diffusion Models with a Novel Structure-Preservation Loss
Minsu Gong, Nuri Ryu, Jungseul Ok et al.
Eff-GRot: Efficient and Generalizable Rotation Estimation with Transformers
Fanis Mathioulakis, Gorjan Radevski, Tinne Tuytelaars
Efficient Text-Guided Convolutional Adapter for the Diffusion Model
Aryan Das, Koushik Biswas, Swalpa Kumar Roy et al.
Efficient Vision Transformers via Token Merging with Head-wise Attention Correction
Yuki Ichikawa, Masato Motomura, Thiem Van Chu et al.
Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance
Francesco Ragusa, Michele Mazzamuto, Rosario Forte et al.
EllipssianNet: Image-guided Sampling of 2D Gaussians for Gaussian Splatting
MyoungGon Kim, JeongHyeon Ahn, Seohyeon Park et al.
EmojiDiff: Advanced Facial Expression Control with High Identity Preservation in Portrait Generation
Liangwei Jiang, Ruida Li, Zhifeng Zhang et al.
Empowering Source-Free Domain Adaptation via MLLM-Guided Reliability-Based Curriculum Learning
Dongjie Chen, Kartik Patwari, Zhengfeng Lai et al.
Enabling High-Quality In-the-Wild Imaging from Severely Aberrated Metalens Bursts
Debabrata Mandal, Zhihan Peng, Yujie Wang et al.