Papers
Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition
Kha Nhat Le, Hoang-Tuan Nguyen, Hung Tien Tran et al.
Street TryOn: Learning In-the-Wild Virtual Try-On from Unpaired Person Images
Aiyu Cui, Jay Mahajan, Viraj Shah et al.
STRIDE: Single-Video Based Temporally Continuous Occlusion-Robust 3D Pose Estimation
Rohit Lal, Saketh Bachu, Yash Garg et al.
Structure-Aware Human Body Reshaping with Adaptive Affinity-Graph Network
Qiwen Deng, Yangcen Liu
Structured Human Assessment of Text-to-Image Generative Models
Ciprian A. Corneanu, Qianli Feng, Aleix M. Martinez
Style-Pro: Style-Guided Prompt Learning for Generalizable Vision-Language Models
Niloufar Alipour Talemi, Hossein Kashiani, Fatemeh Afghah
SUM: Saliency Unification through Mamba for Visual Attention Modeling
Alireza Hosseini, Amirhossein Kazerouni, Saeed Akhavan et al.
Sun Off Lights On: Photorealistic Monocular Nighttime Simulation for Robust Semantic Perception
Konstantinos Tzevelekakis, Shutong Zhang, Luc Van Gool et al.
Survival Prediction in Lung Cancer through Multi-Modal Representation Learning
Aiman Farooq, Deepak Mishra, Santanu Chaudhury
SV-data2vec: Guiding Video Representation Learning with Latent Skeleton Targets
Zorana Doždor, Tomislav Hrkac, Zoran Kalafatic
Swap Path Network for Robust Person Search Pre-Training
Lucas Jaffe, Avideh Zakhor
Swin-: Gradient-Based Image Restoration from Image Sequences using Video Swin-Transformers
Monika Kwiatkowski, Simon Matern, Olaf Hellwich
SwinIA: Self-Supervised Blind-Spot Image Denoising without Convolutions
Mikhail Papkov, Pavel Chizhov, Leopold Parts
SyncDiff: Diffusion-Based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization
Xulin Fan, Heting Gao, Ziyi Chen et al.
SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering
Hiroki Nishizawa, Keitaro Tanaka, Asuka Hirata et al.
SynDRA: Synthetic Dataset for Railway Applications
Gianluca D'Amico, Federico Nesti, Giulio Rossolini et al.
SynDroneVision: A Synthetic Dataset for Image-Based Drone Detection
Tamara R. Lenhard, Andreas Weinmann, Kai Franke et al.
TACLE: Task and Class-Aware Exemplar-Free Semi-Supervised Class Incremental Learning
Jayateja Kalla, Rohit Kumar, Soma Biswas
TaCOS: Task-Specific Camera Optimization with Simulation
Chengyang Yan, Donald G. Dansereau
Talking Head Anime 4: Distillation for Real-Time Performance
Pramook Khungurn
TAM-VT: Transformation-Aware Multi-Scale Video Transformer for Segmentation and Tracking
Raghav Goyal, Wan-Cyuan Fan, Mennatullah Siam et al.
Task Configuration Impacts Annotation Quality and Model Training Performance in Crowdsourced Image Segmentation
Benjamin R Bauchwitz, Mary Cummings
TaxaBind: A Unified Embedding Space for Ecological Applications
Srikumar Sastry, Subash Khanal, Aayush Dhakal et al.
Temporal Dynamics in Visual Data: Analyzing the Impact of Time on Classification Accuracy
Tom Pégeot, Eva Feillet, Adrian Popescu et al.