Research Explorer

Speaker- and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech

Tobias Weise, Philipp Klumpp, Kubilay Can Demir et al.

2024 INTERSPEECH

SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling

Hiroshi Sato, Takafumi Moriya, Masato Mimura et al.

2024 INTERSPEECH

Speaker Change Detection with Weighted-sum Knowledge Distillation based on Self-supervised Pre-trained Models

Hang Su, Yuxiang Kong, Lichun Fan et al.

2024 INTERSPEECH

Speaker Conditional Sinc-Extractor for Personal VAD

En-Lun Yu, Kuan-Hsun Ho, Jeih-weih Hung et al.

2024 INTERSPEECH

Speaker Detection by the Individual Listener and the Crowd: Parametric Models Applicable to Bonafide and Deepfake Speech

Tomi H. Kinnunen, Rosa Gonzalez Hautamäki, Xin Wang et al.

2024 INTERSPEECH

Speaker-Independent Acoustic-to-Articulatory Inversion through Multi-Channel Attention Discriminator

Woo-Jin Chung, Hong-Goo Kang

2024 INTERSPEECH

Speaker Personalization for Automatic Speech Recognition using Weight-Decomposed Low-Rank Adaptation

George Joseph, Arun Baby

2024 INTERSPEECH

Speaker-Smoothed kNN Speaker Adaptation for End-to-End ASR

Shaojun Li, Daimeng Wei, Hengchao Shang et al.

2024 INTERSPEECH

Speakers Unembedded: Embedding-free Approach to Long-form Neural Diarization

Xiang Li, Vivek Govindan, Rohit Paturi et al.

2024 INTERSPEECH

Speaking of Health: Leveraging Large Language Models to assess Exercise Motivation and Behavior of Rehabilitation Patients

Suhas BN, Amanda Rebar, Saeed Abdullah

2024 INTERSPEECH

Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation

Miseul Kim, Soo-Whan Chung, Youna Ji et al.

2024 INTERSPEECH

Specializing Self-Supervised Speech Representations for Speaker Segmentation

Séverin Baroudi, Thomas Pellegrini, Hervé Bredin

2024 INTERSPEECH

Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models

Bolaji Yusuf, Murali Karthick Baskar, Andrew Rosenberg et al.

2024 INTERSPEECH

Speech After Gender: A Trans-Feminine Perspective on Next Steps for Speech Science and Technology

Robin Netzorg, Alyssa Cote, Sumi Koshin et al.

2024 INTERSPEECH

Speech and Language Recognition with Low-rank Adaptation of Pretrained Models

Amrutha Prasad, Srikanth Madikeri, Driss Khalil et al.

2024 INTERSPEECH

SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics

Takaaki Saeki, Soumi Maiti, Shinnosuke Takamichi et al.

2024 INTERSPEECH

Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds

Hanbin Bae, Pavel Andreev, Azat Saginbaev et al.

2024 INTERSPEECH

Speech dereverberation constrained on room impulse response characteristics

Louis Bahrman, Mathieu Fontaine, Jonathan Le Roux et al.

2024 INTERSPEECH

Speech emotion recognition with deep learning beamforming on a distant human-robot interaction scenario

Ricardo García, Rodrigo Mahu, Nicolás Grágeda et al.

2024 INTERSPEECH

Speech Emotion Recognition with Multi-level Acoustic and Semantic Information Extraction and Interaction

Yuan Gao, Hao Shi, Chenhui Chu et al.

2024 INTERSPEECH

Speech enabled visual acuity test

Boon Peng Yap, Kok Liang Tan, Zhenghao Li et al.

2024 INTERSPEECH

Speech Formants Integration for Generalized Detection of Synthetic Speech Spoofing Attacks

Kexu Liu, Yuanxin Wang, Shengchen Li et al.

2024 INTERSPEECH

Speech foundation models in healthcare: Effect of layer selection on pathological speech feature prediction

Daniela A. Wiepert, Rene L. Utianski, Joseph R. Duffy et al.

2024 INTERSPEECH

Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond

Beomseok Lee, Ioan Calapodescu, Marco Gaido et al.

2024 INTERSPEECH

Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions

Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran et al.

2024 INTERSPEECH

Papers