Papers
4,428 papers found
Focusing on What to Decode and What to Train: SOV Decoding with Specific Target Guided DeNoising and Vision Language Advisor
Junwen Chen, Yingcheng Wang, Keiji Yanai
Forensic Iris Image-Based Post-Mortem Interval Estimation
Rasel Ahmed Bhuiyan, Adam Czajka
FOR: Finetuning for Object Level Open Vocabulary Image Retrieval
Hila Levi, Guy Heller, Dan Levi
Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering
Sai Bhargav Rongali, Mohamad Hassan N C, Ankit Jha et al.
Foundation X: Integrating Classification Localization and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis
Nahid Ul Islam, DongAo Ma, Jiaxuan Pang et al.
Frame by Familiar Frame: Understanding Replication in Video Diffusion Models
Aimon Rahman, Malsha V. Perera, Vishal M. Patel
FRAUD-Net: Fraud News Detection using Sample Uncertainty & Domain Aware Generalized Network
Devendra Patel, Vikas Verma, Shreyas Kumar Tah et al.
Frequency-Domain Refinement of Vision Transformers for Robust Medical Image Segmentation under Degradation
Sanaz Karimijafarbigloo, Sina Ghorbani Kolahi, Reza Azad et al.
From Visual Explanations to Counterfactual Explanations with Latent Diffusion
Tung Luu, Nam Le, Duc Le et al.
FT2TF: First-Person Statement Text-To-Talking Face Generation
Xingjian Diao, Ming Cheng, Wayner Barrios et al.
FUN-AD: Fully Unsupervised Learning for Anomaly Detection with Noisy Training Data
Jiin Im, Yongho Son, Je Hyeong Hong
GaitCloud: Leveraging Spatial-Temporal Information for LiDAR-Base Gait Recognition with A True-3D Gait Representation
Shaoxiong Zhang, Hiromitsu Awano, Takashi Sato
GaitContour: Efficient Gait Recognition Based on a Contour-Pose Representation
Yuxiang Guo, Anshul Shah, Jiang Liu et al.
GANESH: Generalizable NeRF for Lensless Imaging
Rakesh Raj Madhavan, Akshat Kaimal, Badhrinarayanan K.V et al.
GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space
Souhaib Attaiki, Paul Guerrero, Duygu Ceylan et al.
GAUDA: Generative Adaptive Uncertainty-Guided Diffusion-Based Augmentation for Surgical Segmentation
Yannik Frisch, Christina Bornberg, Moritz Fuchs et al.
GauFRe: Gaussian Deformation Fields for Real-Time Dynamic Novel View Synthesis
Yiqing Liang, Numair Khan, Zhengqin Li et al.
GaussianBeV : 3D Gaussian Representation Meets Perception Models for BeV Segmentation
Florian Chabot, Nicolas Granger, Guillaume Lapouge
Gaussian Deja-vu: Creating Controllable 3D Gaussian Head-Avatars with Enhanced Generalization and Personalization Abilities
Peizhi Yan, Rabab Ward, Qiang Tang et al.
GazeSearch: Radiology Findings Search Benchmark
Trong Thang Pham, Tien-Phat Nguyen, Yuki Ikebe et al.
Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
Hung-Shuo Chang, Chien-Yao Wang, Richard Robert Wang et al.
Generalizable Single-Source Cross-Modality Medical Image Segmentation via Invariant Causal Mechanisms
Boqi Chen, Yuanzhi Zhu, Yunke Ao et al.
Generalizable Single-View Object Pose Estimation by Two-Side Generating and Matching
Yujing Sun, Caiyi Sun, Yuan Liu et al.
GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts
Sameer Ambekar, Zehao Xiao, Xiantong Zhen et al.
Generating Long-Take Videos via Effective Keyframes and Guidance
Hsin-Ping Huang, Yu-Chuan Su, Ming-Hsuan Yang