Papers
4,428 papers found
Zero-Shot Domain Generalisation via Prompt-Driven Feature Refinement
Tingrui Qiao, Di Zhao, Caroline Walker et al.
Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention
Junhao Xing, Ryohei Miyakawa, Yang Yang et al.
Zero-Shot Table Extraction in Business Documents: A Unified Benchmark with Error Taxonomy and Ecological Analysis
Eliott Thomas, Mickael Coustaty, Aurélie Joseph et al.
Zero-Shot Video Deraining with Video Diffusion Models
Tuomas Varanka, Juan Luis Gonzalez, Hyeongwoo Kim et al.
ZonUI-3B: Competitive GUI Grounding with a 3B VLM Trained on a Single Consumer GPU
ZongHan Hsieh, ShengJing Yang, Tzer-Jen Wei
360PanT: Training-Free Text-Driven 360-Degree Panorama-to-Panorama Translation
Hai Wang, Jing-Hao Xue
3D Edge Sketch from Multiview Images
Yilin Zheng, Chiang-Heng Chien, Ricardo Fabbri et al.
3D Part Segmentation via Geometric Aggregation of 2D Visual Features
Marco Garosi, Riccardo Tedoldi, Davide Boscaini et al.
3D Shape Completion using Multi-Resolution Spectral Encoding
Pallabjyoti Deka, Saumik Bhattacharya, Debashis Sen et al.
3D Synthesis for Architectural Design
I-Ting Tsai, Bharath Hariharan
3D Understanding of Deformable Linear Objects: Datasets and Transferability Benchmark
Bare Luka Žagar, Mingyu Liu, Tim Hertel et al.
A 0-Shot Self-Attention Mechanism for Accelerated Diagonal Attention
Mario Viti, Nadiya Shvai, Arcadi Llanza et al.
ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos
Reza Ghoddoosian, Nakul Agarwal, Isht Dwivedi et al.
ACE: Anatomically Consistent Embeddings in Composition and Decomposition
Ziyu Zhou, Haozhe Luo, Mohammad Reza Hosseinzadeh Taher et al.
Achieving Byzantine-Resilient Federated Learning via Layer-Adaptive Sparsified Model Aggregation
Jiahao Xu, Zikai Zhang, Rui Hu
AC-IND: Sparse CT Reconstruction Based on Attenuation Coefficient Estimation and Implicit Neural Distribution
Wangduo Xie, Richard Schoonhoven, Tristan van Leeuwen et al.
A Conflict-Guided Evidential Multimodal Fusion for Semantic Segmentation
Lucas Deregnaucourt, Hind Laghmara, Alexis Lechervy et al.
A Conic Transformation Approach for Solving the Perspective-Three-Point Problem
Haidong Wu, Snehal Bhayani, Janne Heikkilä
ActionDiffusion: An Action-Aware Diffusion Model for Procedure Planning in Instructional Videos
Lei Shi, Paul-Christian Bürkner, Andreas Bulling
Active Event Alignment for Monocular Distance Estimation
Nan Cai, Pia Bideau
Active Learning for Image Segmentation with Binary User Feedback
Debanjan Goswami, Shayok Chakraborty
Active Learning for Vision Language Models
Bardia Safaei, Vishal M. Patel
Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation
Fei Wu, Pablo Márquez Neila, Hedyeh Rafii-Tari et al.
Ad^2mix: Adversarial and Adaptive Mixup for Unsupervised Domain Adaptation
Lei Zhu, Yanyu Xu, Yong Liu et al.
AdaPrefix++: Integrating Adapters Prefixes and Hypernetwork for Continual Learning
Sayanta Adhikari, Dupati Srikar Chandra, P. K. Srijith et al.