Papers
4,428 papers found
FlowMorph: Revealing an Optimizable Flow Latent Space for Controlled Image Morphing
Yan Zheng, Yi Yang, Lanqing Guo et al.
FlyPose: Towards Robust Human Pose Estimation From Aerial Views
Hassaan Farooq, Marvin Brenner, Peter Stütz
FNOPT: Resolution-Agnostic, Self-Supervised Cloth Simulation using Meta-Optimization with Fourier Neural Operators
Ruochen Chen, Thuy Tran, Shaifali Parashar
FocalComm: Hard Instance-Aware Multi-Agent Perception
Dereje Shenkut, Vijayakumar Bhagavatula
Food Image Generation on Multi-Noun Categories
Xinyue Pan, Yuhao Chen, Jiangpeng He et al.
ForestSplats: Deformable Transient Field for Gaussian Splatting in the Wild
Wongi Park, Myeongseok Nam, Siwon Kim et al.
Forget Less by Learning Together through Concept Consolidation
Arjun Ramesh Kaushik, Naresh Kumar Devulapally, Vishnu Suresh Lokhande et al.
FreeCond: Free Lunch in the Input Conditions of Text-Guided Inpainting
Teng-Fang Hsiao, Bo-Kai Ruan, Sung-Lin Tsai et al.
Frequency Is What You Need: Considering Word Frequency When Text Masking Benefits Vision-Language Model Pre-training
Mingliang Liang, Martha Larson
From Bands to Depth: Understanding Bathymetry Decisions on Sentinel-2
Satyaki Roy Chowdhury, Aswathnarayan Radhakrishnan, Hari Subramoni
From Cognitive Priors to Instance Semantics: A Unified Framework for Multi-task Affective Computing
Guanyu Hu, Dimitrios Kollias, Xinyu Yang
From Darkness to Detail: Frequency-Aware SSMs for Low-Light Vision
Eashan Adhikarla, Kai Zhang, Gong Chen et al.
From Detection to Anticipation: Online Understanding of Struggles across Various Tasks and Activities
Shijia Feng, Michael Wray, Walterio Mayol-Cuevas
From Few-Shot to Zero-Shot Pallet Load Recognition: A Deployed Embedding-Based Vision System for Industrial Logistics
Juan Jesús Losada del Olmo, Emilio Pardo Ballesteros, Pedro E. López-de-Teruel et al.
From Lightweight CNNs to SpikeNets: Benchmarking Accuracy-Energy Tradeoffs with Pruned Spiking SqueezeNet
Radib Bin Kabir, Tawsif Tashwar Dipto, Mehedi Ahamed et al.
From Prompt to Production: Automating Brand-Safe Marketing Imagery with Text-to-Image Models
Parmida Atighehchian, Henry Wang, Andrei Kapustin et al.
From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation
Shivanshu Agnihotri, Snehashis Majhi, Deepak Ranjan Nayak et al.
From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance
Jeongho Min, Dongyoung Kim, Jaehyup Lee
FSP-DETR: Few-Shot Prototypical Parasitic Ova Detection
Shubham Trehan, Udhav Ramachandran, Akash Rao et al.
FujiView: Multimodal Late-Fusion for Predicting Scenic Visibility
Bryceton Bible, Nehal Hasnaeen, Hairong Qi
FuLLaMa: Training-free Diffusion-based Object Removal with Context Preservation
Ilke Demir, Umur Aybars Ciftci
Fully Unsupervised Self-debiasing of Text-to-Image Diffusion Models
Korada Sri Vardhana, Shrikrishna Lolla, Soma Biswas
Fused Similarity Measure Based Alignment with Dual-Scale Adaptive Selection for Weakly Supervised Video Anomaly Detection
Yue-Gao Lu, Hong-Jie Xing, Chun-Guo Li
F-ViTA: Foundation Model Guided Visible to Infrared Translation
Jay Nitin Paranjape, Celso M De Melo, Vishal M. Patel
GAEA: A Geolocation Aware Conversational Assistant
Ron Campos, Ashmal Vayani, Parth Parag Kulkarni et al.