Papers
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Tobias Christian Nauen, Sebastian Palacio, Federico Raue et al.
Who Brings the Frisbee: Probing Hidden Hallucination Factors in Large Vision-Language Model via Causality Analysis
Po-Hsuan Huang, Jeng-Lin Li, Chin-Po Chen et al.
WiGNet: Windowed Vision Graph Neural Network
Gabriele Spadaro, Marco Grangetto, Attilio Fiandrotti et al.
WINE : Wavelet-Guided GAN Inversion and Editing for High-Fidelity Refinement
Chaewon Kim, Seung Jun Moon, Gyeong-Moon Park
XPose: Towards Extreme Low Light Hand Pose Estimation
Green Rosh, Meghana Shankar, Prateek Kukreja et al.
XR-MBT: Multi-Modal Full Body Tracking for XR through Self-Supervision with Learned Depth Point Cloud Registration
Denys Rozumnyi, Nadine Bertsch, Othman Sbai et al.
ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset
Olaf Wysocki, Yue Tan, Thomas Froech et al.
ZeroComp: Zero-Shot Object Compositing from Image Intrinsics via Diffusion
Zitian Zhang, Frédéric Fortier-Chouinard, Mathieu Garon et al.
Zero-Shot Class Unlearning in CLIP with Synthetic Samples
Alexey Kravets, Vinay Namboodiri
Zero-Shot Detection of Out-of-Context Objects using Foundation Models
Anirban Roy, Adam Cobb, Ramneet Kaur et al.
2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation
Ozan Unal, Dengxin Dai, Lukas Hoyer et al.
360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View
Zhifeng Teng, Jiaming Zhang, Kailun Yang et al.
3D-Aware Talking-Head Video Motion Transfer
Haomiao Ni, Jiachen Liu, Yuan Xue et al.
3D Face Style Transfer With a Hybrid Solution of NeRF and Mesh Rasterization
Jianwei Feng, Prateek Singhal
3D Human Pose Estimation With Two-Step Mixed-Training Strategy
Yingfeng Wang, Zhengwei Wang, Muyu Li et al.
3D Reconstruction of Interacting Multi-Person in Clothing From a Single Image
Junuk Cha, Hansol Lee, Jaewon Kim et al.
3D Super-Resolution Model for Vehicle Flow Field Enrichment
Thanh Luan Trinh, Fangge Chen, Takuya Nanri et al.
3SD: Self-Supervised Saliency Detection With No Labels
Rajeev Yasarla, Renliang Weng, Wongun Choi et al.
4K-Resolution Photo Exposure Correction at 125 FPS With ~8K Parameters
Yijie Zhou, Chao Li, Jin Liang et al.
A*: Atrous Spatial Temporal Action Recognition for Real Time Applications
Myeongjun Kim, Federica Spinola, Philipp Benz et al.
A Closer Look at Robustness of Vision Transformers to Backdoor Attacks
Akshayvarun Subramanya, Soroush Abbasi Koohpayegani, Aniruddha Saha et al.
A Coarse-To-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection
Anas Al-lahham, Nurbek Tastan, Muhammad Zaigham Zaheer et al.
Active Batch Sampling for Multi-Label Classification With Binary User Feedback
Debanjan Goswami, Shayok Chakraborty
Active Learning for Single-Stage Object Detection in UAV Images
Asma Yamani, Albandari Alyami, Hamzah Luqman et al.
Active Learning With Task Consistency and Diversity in Multi-Task Networks
Aral Hekimoglu, Michael Schmidt, Alvaro Marcos-Ramiro