Papers
Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition
Yiming Rong, Yixin Zhang, Ziyi Wang et al.
Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception
Zhen Wan, Chao-Han Huck Yang, Jinchuan Tian et al.
SpeechLLM-as-Judges: Towards General and Interpretable Speech Quality Evaluation
Hui Wang, Jinghua Zhao, Yifan Yang et al.
SpeechMedAssist: Efficiently and Effectively Adapting Speech Language Models for Medical Consultation
Sirry Chen, Jieyi Wang, Wei Chen et al.
Speech Recognition Model Improves Text-to-Speech Synthesis Using Fine-Grained Reward
Guansu Wang, Peijie Sun
SPEED-Q: Staged Processing with Enhanced Distillation Towards Efficient Low-Bit On-Device VLM Quantization
Tianyu Guo, Shanwei Zhao, Shiai Zhu et al.
SPENCE: A Syntactic Probe for Detecting Contamination in NL2SQL Benchmarks
Mohammadtaher Safarzadeh, Hitesh Laxmichand Patel, Afshin Oroojlooy et al.
SphereDiff: Tuning-free 360° Static and Dynamic Panorama Generation via Spherical Latent Representation
Minho Park, Taewoong Kang, Jooyeol Yun et al.
SphereEdit: Spherical Semantic Editing in Diffusion Models
Salamata Konate, Hassan Hamidi, Elham Dolatabadi et al.
Spherical Geometry Diffusion: Generating High-quality 3D Face Geometry via Sphere-anchored Representations
Junyi Zhang, Yiming Wang, Yunhong Lu et al.
SpiderFlow: Efficient Topology-Aware Scheduling for LLM Training Across Decentralized GPU Clusters
Zihan Chang, Shuibing He, Bo Zhou et al.
SpiderGen: Towards Procedure Generation for Carbon Life Cycle Assessments with Generative AI
Anupama Sitaraman, Bharathan Balaji, Yuvraj Agarwal
SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation
Mahi Luthra, Jiayi Shen, Maxime Poli et al.
SpikCommander: A High-performance Spiking Transformer with Multi-view Learning for Efficient Speech Command Recognition
Jiaqi Wang, Liutao Yu, Xiongri Shen et al.
Spike Imaging Velocimetry: Dense Motion Estimation of Fluids Using Spike Streams
Yunzhong Zhang, You Zhou, Changqing Su et al.
SpikeRain: Towards Energy-Efficient Single Image Deraining with Spiking Neural Networks
Md Tanvir Islam, Inzamamul Alam, Sambit Bakshi et al.
Spike Stream Memory Transfer for Dynamic Scene Reconstruction
Yanchen Dong, Ruiqin Xiong, Rui Zhao et al.
Spiking-Aided Neural Architecture for Efficient and Robust WiFi Sensing
Yisha Lu, Liwen Jing, Jiangmao Zheng et al.
Spikingformer: A Key Foundation Model for Spiking Neural Networks
Chenlin Zhou, Liutao Yu, Zhaokun Zhou et al.
Spiking Heterogeneous Graph Attention Networks
Buqing Cao, Qian Peng, Xiang Xie et al.
SpikingIR: A Novel Converted Spiking Neural Network for Efficient Image Restoration
Yang Ouyang, Zihan Cheng, Xiaotong Luo et al.
SPIO: Ensemble and Selective Strategies via LLM-Based Multi-Agent Planning in Automated Data Science
Wonduk Seo, Juhyeon Lee, Yanjun Shao et al.
SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search
Yifan Zhang, Giridhar Ganapavarapu, Srideepika Jayaraman et al.
SPJFNet: Self-Mining Prior-Guided Joint Frequency Enhancement for Ultra-Efficient Dark Image Restoration
Tongshun Zhang, Pingping Liu, Zijian Zhang et al.
Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting
Hao-Jen Chien, Yi-Chuan Huang, Chung-Ho Wu et al.