Papers
4,428 papers found
Improving Animal Pose Estimation through Species Similarity Measures and Rigorous Label Definition
Medhashree Parhy, Shaan Chanchani, Claire Kim et al.
Improving Out-of-Distribution Detection Using Segmented Images and Cross-View Attention Fusion
Alexander Politowicz, Sahisnu Mazumder, Bing Liu
Improvise, Adapt, Overcome -- Telescopic Adapters for Efficient Fine-tuning of Vision Language Models in Medical Imaging
Ujjwal Mishra, Vinita Shukla, Praful Hambarde et al.
Inpaint360GS: Efficient Object-Aware 3D Inpainting via Gaussian Splatting for 360deg Scenes
Shaoxiang Wang, Shihong Zhang, Christen Millerdurai et al.
Inpainting of Sparse Depth Maps from Monocular Depth-from-Focus on Pixel Processor Arrays
Maciej Lewandowski, Piotr Dudek
INRetouch: Context Aware Implicit Neural Representation for Photography Retouching
Omar Elezabi, Marcos V. Conde, Zongwei Wu et al.
Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification
Pengfei Gu, Huimin Li, Haoteng Tang et al.
InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
Sreehari Rajan, Kunal Bhosikar, Charu Sharma
Interaction-via-Actions: Cattle Interaction Detection with Joint Learning of Action-Interaction Latent Space
Ren Nakagawa, Yang Yang, Risa Shinoda et al.
Interleaved Vision-and-Language Generation via Generative Voken
Kaizhi Zheng, Xuehai He, Xin Eric Wang
Intra-Class Probabilistic Embeddings for Uncertainty Estimation in Vision-Language Models
Zhenxiang Lin, Maryam Haghighat, Will Browne et al.
IPCD: Intrinsic Point-Cloud Decomposition
Shogo Sato, Takuhiro Kaneko, Shoichiro Takeda et al.
IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers
Gihwan Kim, Jemin Lee, Hyungshin Kim
ISALux: Illumination and Semantics-Aware Transformer Employing Mixture of Experts for Low Light Image Enhancement
Raul Balmez, Alexandru Brateanu, Ciprian Orhei et al.
Isolating the Role of Temporal Information in Video Saliency: A Controlled Experimental Analysis
Peter El-Jiz, Matthias Kuemmerer, Matthias Tangemann et al.
ITSELF: Attention Guided Fine-Grained Alignment for Vision-Language Retrieval
Tien-Huy Nguyen, Huu-Loc Tran, Thanh Duc Ngo
JOCA: Task-Driven Joint Optimisation of Camera Hardware and Adaptive Camera Control Algorithms
Chengyang Yan, Mitch Bryson, Donald G. Dansereau
Joint Modeling of Corruption-Driven and Information-Limited Uncertainty for Robust 3D Gaussian Splatting
Zeji Hui, Amirali Khodadadian Gostar, WeiQin Chuah et al.
Joint Optimization of Camera Model and Deep Neural Network for Image Recognition
Youta Noboru, Yuko Ozasa, Masayuki Tanaka
KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird's-Eye-View Segmentation
Wenke E, Yixin Sun, Jiaxu Liu et al.
KFS-Bench: Comprehensive Evaluation of Key Frame Sampling in Long Video Understanding
Zongyao Li, Kengo Ishida, Satoshi Yamazaki et al.
KMOPS: Keypoint-Driven Method for Multi-Object Pose and Metric Size Estimation from Stereo Images
Ying-Kun Wu, Yi Shen, Tzuhsuan Huang et al.
Knowledge to Sight: Reasoning over Visual Attributes via Knowledge Decomposition for Abnormality Grounding
Jun Li, Che Liu, Wenjia Bai et al.
LangPose: Language-Aligned Motion for Robust 3D Human Pose Estimation
Longyun Liao, Rong Zheng