Papers
Seeing Is Believing: Grounding Long-Video Understanding in Spatio-Temporal Visual Evidence
Zhaoyang Wei, Guoliang Wang, Guohua Gao et al.
Seeing Is Believing: Rich-Context Hallucination Detection for MLLMs via Backward Visual Grounding
Pinxue Guo, Chongruo Wu, Xinyu Zhou et al.
Seeing No Evil: Blinding Large Vision-Language Models to Safety Instructions via Adversarial Attention Hijacking
Jingru Li, Wei Ren, Tianqing Zhu
Seeing the Unseen: Zooming in the Dark with Event Cameras
Dachun Kai, Zeyu Xiao, Huyue Zhu et al.
Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems
Mengzhuo Chen, Junjie Wang, Fangwen Mu et al.
Seeing through the Conflict: Transparent Knowledge Conflict Handling in Retrieval-Augmented Generation
Hua Ye, Siyuan Chen, Ziqi Zhong et al.
Seeing Through the Rain: Resolving High-Frequency Conflicts in Deraining and Super-Resolution via Diffusion Guidance
Wenjie Li, Jinglei Shi, Jin Han et al.
Seeing Words Differently: Visual Embeddings for Robust English-Arabic Machine Translation
Mahdi Alshaikh Saleh, Irfan Ahmad
See More, Store Less: Memory-Efficient Resolution for Video Moment Retrieval
Mingyu Jeon, Sungjin Han, Jinkwon Hwang et al.
See, Record, Do: Automated Generation of UI Workflows from Tutorial Videos
Adam Beauchaine, Craig Shue
SEE: Signal Embedding Energy for Quantifying Noise Interference in Large Audio Language Models
Yuanhe Zhang, Jiayu Tian, Yibo Zhang et al.
See the Forest for the Trees: Loosely Speculative Decoding via Visual-Semantic Guidance for Efficient Inference of Video LLMs
Yicheng Ji, Jun Zhang, Jinpeng Chen et al.
See, Think, Learn: A Self-Taught Multimodal Reasoner
Sourabh Sharma, Sonam Gupta, Sadbhawna Sadbhawna
SEFEL: A Simple Yet Effective Framework for Fast Event Linking
Yinan Liu, Ziyang Zhang, Bin Wang et al.
SEG4SEG: Identifying Systematic Failure Modes in Segmentation by Subgroup Discovery Methods
Nina Weng, Eike Petersen, Alceu Bissoto et al.
SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features
Jinyuan Qu, Hongyang Li, Xingyu Chen et al.
SegMango: Early Deep Mango Yield Prediction based on Flower Segmentation and Weather Data
Janaksinh Ven, Charu Sharma, Azeemuddin Syed
SegMaST: Mamba-based Spatio-Temporal Modeling to Improve Longitudinal Disease Detection and Segmentation
Aswathi Varma, Jonas Weidner, Laurin Lux et al.
SegMem-RAG: Adaptive Memory for Retrieval-Augmented Generation in Open-Ended Knowledge Environments
Xuanbo Fan, Tianqi Zhao, Yi Cheng et al.
Segment and Matte Anything in a Unified Model
Zezhong Fan, Xiaohan Li, Topojoy Biswas et al.
Segment Anything Across Shots: A Method and Benchmark
Hengrui Hu, Kaining Ying, Henghui Ding
Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation
Aditi Agarwal, Anjali Jain, Nikita Saxena et al.
Segmentation Strategy Matters: Benchmarking Whisper on Persian YouTube Content
Reihaneh Iranmanesh, Rojin Ziaei, Joe Garman
Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing
Zifan Jiang, Youngjoon Jang, Liliane Momeni et al.