Papers
8,761 papers found
Speaker- and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech
Tobias Weise, Philipp Klumpp, Kubilay Can Demir et al.
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Hiroshi Sato, Takafumi Moriya, Masato Mimura et al.
Speaker Change Detection with Weighted-sum Knowledge Distillation based on Self-supervised Pre-trained Models
Hang Su, Yuxiang Kong, Lichun Fan et al.
Speaker Conditional Sinc-Extractor for Personal VAD
En-Lun Yu, Kuan-Hsun Ho, Jeih-weih Hung et al.
Speaker Detection by the Individual Listener and the Crowd: Parametric Models Applicable to Bonafide and Deepfake Speech
Tomi H. Kinnunen, Rosa Gonzalez Hautamäki, Xin Wang et al.
Speaker-Independent Acoustic-to-Articulatory Inversion through Multi-Channel Attention Discriminator
Woo-Jin Chung, Hong-Goo Kang
Speaker Personalization for Automatic Speech Recognition using Weight-Decomposed Low-Rank Adaptation
George Joseph, Arun Baby
Speaker-Smoothed kNN Speaker Adaptation for End-to-End ASR
Shaojun Li, Daimeng Wei, Hengchao Shang et al.
Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization
Xiang Li, Vivek Govindan, Rohit Paturi et al.
Speaking of Health: Leveraging Large Language Models to assess Exercise Motivation and Behavior of Rehabilitation Patients
Suhas BN, Amanda Rebar, Saeed Abdullah
Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation
Miseul Kim, Soo-Whan Chung, Youna Ji et al.
Specializing Self-Supervised Speech Representations for Speaker Segmentation
Séverin Baroudi, Thomas Pellegrini, Hervé Bredin
Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models
Bolaji Yusuf, Murali Karthick Baskar, Andrew Rosenberg et al.
Speech After Gender: A Trans-Feminine Perspective on Next Steps for Speech Science and Technology
Robin Netzorg, Alyssa Cote, Sumi Koshin et al.
Speech and Language Recognition with Low-rank Adaptation of Pretrained Models
Amrutha Prasad, Srikanth Madikeri, Driss Khalil et al.
SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics
Takaaki Saeki, Soumi Maiti, Shinnosuke Takamichi et al.
Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds
Hanbin Bae, Pavel Andreev, Azat Saginbaev et al.
Speech dereverberation constrained on room impulse response characteristics
Louis Bahrman, Mathieu Fontaine, Jonathan Le Roux et al.
Speech emotion recognition with deep learning beamforming on a distant human-robot interaction scenario
Ricardo García, Rodrigo Mahu, Nicolás Grágeda et al.
Speech Emotion Recognition with Multi-level Acoustic and Semantic Information Extraction and Interaction
Yuan Gao, Hao Shi, Chenhui Chu et al.
Speech enabled visual acuity test
Boon Peng Yap, Kok Liang Tan, Zhenghao Li et al.
Speech Formants Integration for Generalized Detection of Synthetic Speech Spoofing Attacks
Kexu Liu, Yuanxin Wang, Shengchen Li et al.
Speech foundation models in healthcare: Effect of layer selection on pathological speech feature prediction
Daniela A. Wiepert, Rene L. Utianski, Joseph R. Duffy et al.
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond
Beomseok Lee, Ioan Calapodescu, Marco Gaido et al.
Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran et al.