Research Explorer

Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting

Yosuke Kashiwagi, Hayato Futami, Emiru Tsunoo et al.

2024 INTERSPEECH

Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting

Muhammad Yeza Baihaqi, Angel Garcia Contreras, Seiya Kawano et al.

2024 INTERSPEECH

Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings

Praveen Srinivasa Varadhan, Ashwin Sankar, Giri Raju et al.

2024 INTERSPEECH

RAST: A Reference-Audio Synchronization Tool for Dubbed Content

David Meyer, Eitan Abecassis, Clara Fernandez-Labrador et al.

2024 INTERSPEECH

RASU: Retrieval Augmented Speech Understanding through Generative Modeling

Hao Yang, Min Zhang, Minghan Wang et al.

2024 INTERSPEECH

RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

Yujie Chen, Jiangyan Yi, Jun Xue et al.

2024 INTERSPEECH

Reading Miscue Detection in Primary School through Automatic Speech Recognition

Lingyun Gao, Cristian Tejedor-Garcia, Helmer Strik et al.

2024 INTERSPEECH

Real-Time Gaze-directed speech enhancement for audio-visual hearing-aids

Arif Reza Anway, Bryony Buck, Mandar Gogate et al.

2024 INTERSPEECH

Real-time scheme for rapid extraction of speaker embeddings in challenging recording conditions

Kai Liu, Ziqing Du, Zhou Huan et al.

2024 INTERSPEECH

Real-time Speech Summarization for Medical Conversations

Khai Le-Duc, Khai-Nguyen Nguyen, Long Vo-Dang et al.

2024 INTERSPEECH

Real-world PTSD Recognition: A Cross-corpus and Cross-linguistic Evaluation

Alexander Kathan, Martin Bürger, Andreas Triantafyllopoulos et al.

2024 INTERSPEECH

Reduce, Reuse, Recycle: Is Perturbed Data Better than Other Language Augmentation for Low Resource Self-Supervised Speech Models

Asad Ullah, Alessandro Ragano, Andrew Hines

2024 INTERSPEECH

Reducing Speech Distortion and Artifacts for Speech Enhancement by Loss Function

Haixin Guan, Wei Dai, Guangyong Wang et al.

2024 INTERSPEECH

Reference-Free Estimation of the Quality of Clinical Notes Generated from Doctor-Patient Conversations

Mojtaba Kadkhodaie Elyaderani, John Glover, Thomas Schaaf

2024 INTERSPEECH

Refining Self-supervised Learnt Speech Representation using Brain Activations

HengYu Li, Kangdi Mei, Zhaoci Liu et al.

2024 INTERSPEECH

Reinforcement Learning based Data Augmentation for Noise Robust Speech Emotion Recognition

Sumit Ranjan, Rupayan Chakraborty, Sunil Kumar Kopparapu

2024 INTERSPEECH

Reinforcement Learning from Answer Reranking Feedback for Retrieval-Augmented Answer Generation

Minh Nguyen, Toan Quoc Nguyen, Kishan KC et al.

2024 INTERSPEECH

Relational Proxy Loss for Audio-Text based Keyword Spotting

Youngmoon Jung, Seungjin Lee, Joon-Young Yang et al.

2024 INTERSPEECH

Reliable dialogue system for facilitating student-counselor communication

Mahdin Rohmatillah, Bryan Gautama Ngo, Willianto Sulaiman et al.

2024 INTERSPEECH

RepCNN: Micro-sized, Mighty Models for Wakeword Detection

Arnav Kundu, Prateeth Nayak, Priyanka Padmanabhan et al.

2024 INTERSPEECH

RepTor: Re-parameterizable Temporal Convolution for Keyword Spotting via Differentiable Kernel Search

Eunik Park, Daehyun Ahn, Hyungjun Kim

2024 INTERSPEECH

Reshape Dimensions Network for Speaker Recognition

Ivan Yakovlev, Rostislav Makarov, Andrei Balykin et al.

2024 INTERSPEECH

Residual Speaker Representation for One-Shot Voice Conversion

Le Xu, Jiangyan Yi, Tao Wang et al.

2024 INTERSPEECH

Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps

Mattias Nilsson, Riccardo Miccini, Clement Laroche et al.

2024 INTERSPEECH

Retrieval-Augmented Classifier Guidance for Audio Generation

Ho-Young Choi, Won-Gook Choi, Joon-Hyuk Chang

2024 INTERSPEECH

Papers