Papers
4,428 papers found
ENCORE : A Neural Collapse Perspective on Out-of-Distribution Detection in Deep Neural Networks
A. Q. M. Sazzad Sayyed, Nathaniel D. Bastian, Francesco Restuccia
EndoPBR: Photorealistic Synthetic Data for Surgical 3D Vision via Physically-based Rendering
John J. Han, Jie Ying Wu
End-to-End Fine-Tuning of 3D Texture Generation using Differentiable Rewards
AmirHossein Zamani, Tianhao Xie, Amir G. Aghdam et al.
Enhanced Back-Projection of Vision Features for 3D Symmetry Detection
Isaac Aguirre, Ivan Sipiran
Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors
Giorgos Karvounas, Nikolaos Kyriazis, Iason Oikonomidis et al.
Enhancing Object Detection Training via Joint Image-Annotation Generation
Roy Uziel, Oded Bialer
Enhancing Reverse Distillation with Core Exemplar Learning for Unified Multi-Class Anomaly Detection
Heechul Lim, Min-Soo Kim, Hyun-Boo Lee et al.
Enhancing Vision Language Corruption Robustness using Cross-Distribution & Prompted Denoisers
Sameer Shafayet Latif, Sadab Shiper, K. M. Rahiduzzaman Kiran et al.
Enhancing Visual Planning with Auxiliary Tasks and Multi-token Prediction
Ce Zhang, Yale Song, Ruta Desai et al.
Equivariant Sampling for Improving Diffusion Model-based Image Restoration
Chenxu Wu, Qingpeng Kong, Peiang Zhao et al.
Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Frechet Distance
Jaywon Koo, Jefferson Hernandez, Moayed Haji-Ali et al.
Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation
Huaying Zhang, Atsushi Hashimoto, Tosho Hirasawa
Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection
Aayush Atul Verma, Arpitsinh Vaghela, Bharatesh Chakravarthi et al.
EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models
Wenhui Zhu, Xiwen Chen, Zhipeng Wang et al.
ExDDV: A New Dataset for Explainable Deepfake Detection in Video
Vlad Hondru, Eduard Hogea, Darian Onchis et al.
Explaining the Unseen: Multimodal Vision-Language Reasoning for Situational Awareness in Underground Mining Disasters
Mizanur Rahman Jewel, Mohamed Elmahallawy, Sanjay Madria et al.
Exploiting Label-Independent Regularization from Spatial Patterns for Whole Slide Image Analysis
Weiyi Wu, Xinwen Xu, Chongyang Gao et al.
Exploring Automated Recognition of Instructional Activity and Discourse from Multimodal Classroom Data
Ivo Bueno, Ruikun Hou, Babette Bühler et al.
Exploring the Boundaries of Diffusion Models for Offline Writer Identification with Sparse and Intra-Variable Data
Aritra Dey, Chandranath Adak, Kumari Priya et al.
Extreme Amodal Face Detection
Changlin Song, Yunzhong Hou, Michael Randall Barnes et al.
Eye-for-an-eye: Appearance Transfer with Dense Semantic Correspondence in Diffusion Models
Sooyeon Go, Kyungmook Choi, Minjung Shin et al.
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Ashutosh Chaubey, Xulang Guan, Mohammad Soleymani
FAE-Net: Fashion Attribute Editing via Disentangled Latent Conditioning in Diffusion Models
P. Rajith Bhargav, Gaurab Bhattacharya, B S Vivek et al.
FairScene: Learning Class-Disentangled 2D/3D Representations for Semantic Scene Completion
Dian Jia, Pei Yu, Wei Tang
FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair
Arya Fayyazi, Mehdi Kamal, Massoud Pedram