Papers
4,428 papers found
SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense
Jiayang Liu, Daniel Ts'o, Yiming Bu et al.
Safe Vision-Language Models via Unsafe Weights Manipulation
Moreno D'incà, Elia Peruzzo, Xingqian Xu et al.
SAIL: Self-supervised Learning of Lighting-Invariant Representations from Real Images with Latent Diffusion
Hala Djeghim, Céline Loscos, Désiré Sidibé
Salience-SGG: Enhancing Unbiased Scene Graph Generation with Iterative Salience Estimation
Runfeng Qu, Ole Hall, Pia K Bideau et al.
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
Aleksandr Gordeev, Vladimir Dokholyan, Irina Tolstykh et al.
SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation
Hu Cui, Wenqiang Hua, Renjing Huang et al.
SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance
Suzanne Stathatos, Michael Hobley, Pietro Perona et al.
SAVE: Sparse Autoencoder-Driven Visual Information Enhancement for Mitigating Object Hallucination
Sangha Park, Seungryong Yoo, Jisoo Mok et al.
Scalable Video Action Anticipation with Cross Linear Attentive Memory
Zeyun Zhong, Manuel Martin, David Schneider et al.
SCALEX: Scalable Concept and Latent Exploration for Diffusion Models
E. Zhixuan Zeng, Yuhao Chen, Alexander Wong
Scalpel: Fine-Grained Alignment of Attention Activation Manifolds via Mixture Gaussian Bridges to Mitigate Multimodal Hallucination
Ziqiang Shi, Rujie Liu, Shanshan Yu et al.
SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection
Chun-Jung Lin, Tat-Jun Chin, Sourav Garg et al.
SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis
Hou In Ivan Tam, Hou In Derek Pun, Austin T. Wang et al.
SceneProp: Combining Neural Network and Markov Random Field for Scene-Graph Grounding
Keita Otani, Tatsuya Harada
SceneShine: Illumination-aware Human Scene Gaussian Re-Splatting from Mobile Device Video
Xuqian Ren, Wenjia Wang, Mai Ngoc Nguyen et al.
ScoliGaitX: A Deep Multi-Modal Fusion Network for Scoliosis Assessment via Gait Video Analysis
Kaushik Vishwakarma, Aditya Nigam
ScoreNet: Netting Lightweight Quality Scores for Better Visual Assessment with Large Multi-Modality Models
Bahador Rashidi, Kiarash Aghakasiri, Shupei Zhang et al.
SCORE: Soft Label Compression-Centric Dataset Condensation via Coding Rate Optimization
Bowen Yuan, Yuxia Fu, Zijian Wang et al.
SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning
Ziwei Chen, Ziling Liu, Zitong Huang et al.
SD-CSFL: A Synthetic Data-Driven Conformity Scoring Framework for Robust Federated Learning
Ebtisaam Alharbi, Abdulrahman Kerim, Leandro Soriano Marcolino et al.
SDT-6D: Fully Sparse Depth-Transformer for Staged End-to-End 6D Pose Estimation in Industrial Multi-View Bin Picking
Nico Leuze, Maximilian Hoh, Samed Doğan et al.
Sea-CLIP: Mining Semantic-Aware Representations for Few-Shot Anomaly Detection with CLIP
Xiao Guo, Zhimin Chen, Carlos D. Castillo et al.
SeaClips: A Video Dataset for Maritime Object Detection.
Franziska Denk, Christian Rankl, Shaban Almouahed et al.
Seeing is Believing (and Predicting): Context-Aware Multi-Human Behavior Prediction with Vision Language Models
Utsav Panchal, Yuchen Liu, Luigi Palmieri et al.