Papers
4,428 papers found
Histopath-C: Towards Realistic Domain Shifts for Histopathology Vision-Language Adaptation
Mehrdad Noori, Gustavo A. Vargas Hakim, David Osowiechi et al.
HodgeFormer: Transformers for Learnable Operators on Triangular Meshes through Data-Driven Hodge Matrices
Akis Nousias, Stavros Nousias
HOLO: Holistic Lightweight Optimization for Scene Understanding with Auto-Annotation and Multimodal Learning
Xiaoyun Hu, Xiaohan Yan, Nan Wang et al.
How I Met Your Bias: Investigating Bias Amplification in Diffusion Models
Nathan Roos, Ekaterina Iakovleva, Ani Gjergji et al.
How to Design and Train Your Implicit Neural Representation for Video Compression
Matthew Gwilliam, Roy Zhang, Namitha Padmanabhan et al.
HumanBench: Two Heads, No Legs, But Mostly Human, the State of Generative Capabilities in T2I Models
Anubhooti Jain, Mayank Vatsa, Richa Singh
HumanGuideNet: Adapter-Based Alignment of Deep Neural Networks with Human Similarity Judgments
Xufu Liu, Yifan Yang, Zhengxin Zhang
Human Knowledge Integrated Multi-modal Learning for Single Source Domain Generalization
Ayan Banerjee, Kuntal Thakur, Sandeep Gupta
Human Pose Aggregation for Multi-View Temporal Video Alignment
Fabien Delattre, Tsung-Wei Huang, Guan-Ming Su et al.
Hybrid State Representation for Video Procedure Planning
Woo Suk Choi, Youwon Jang, Minsu Lee et al.
HyPCA-Net: Advancing Multimodal Fusion in Medical Image Analysis
Joy Dhar, Manish Kumar Pandey, Debashis Das Chakladar et al.
HyperPose: Hyper-pose Embeddings for 3D-Aware Generative Models with Self-Supervised Disentangling of Pose and Scene
Mijeong Kim, Namgi Kim, Bohyung Han
ICONIC-444: A 3.1-Million-Image Dataset for OOD Detection Research
Gerhard Krumpl, Henning Avenhaus, Horst Possegger
IDEAL-M3D: Instance Diversity-Enriched Active Learning for Monocular 3D Detection
Johannes Meier, Florian Günther, Riccardo Marin et al.
Identity Verification from Human Scent using Channel Representation of 2D Gas Chromatography-Mass Spectrometry Data
Radim Spetlik, Jan Hlavsa, Jana Čechová et al.
Illuminating Darkness: Learning to Enhance Low-light Images In-the-Wild
S. M. A. Sharif, Abdur Rehman, Zain Ul Abidin et al.
ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models
Danae Sanchez Villegas, Ingo Ziegler, Desmond Elliott
Image-Guided Semantic Pseudo-LiDAR Point Generation for 3D Object Detection
Minseung Lee, Seokha Moon, Seung Joon Lee et al.
ImageNet-sES: A First Systematic Study of Sensor-Environment Simulation Anchored by Real Recaptures
Ji-yoon Kim, Eunsu Baek, Hyung-Sin Kim
Imitating the Functionality of Image-to-Image Models Using a Single Example
Nurit Spingarn, Tomer Michaeli
IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion
Shashank Mishra, Karan Patil, Didier Stricker et al.
iMotion-LLM: Instruction-Conditioned Trajectory Generation
Abdulwahab Felemban, Nussair Hroub, Jian Ding et al.
IMPACT: Interpretable Most Important Person Analysis and Classification using Transformer-based Models
Akshat Rampuria, Kamakshya Prasad Nayak, Kamalakar Vijay Thakare et al.
Improved Wildfire Spread Prediction with Time-Series Data and the WSTS+ Benchmark
Saad Lahrichi, Jake Bova, Jesse Johnson et al.