Papers
8,761 papers found
Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting
Yosuke Kashiwagi, Hayato Futami, Emiru Tsunoo et al.
Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting
Muhammad Yeza Baihaqi, Angel Garcia Contreras, Seiya Kawano et al.
Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings
Praveen Srinivasa Varadhan, Ashwin Sankar, Giri Raju et al.
RAST: A Reference-Audio Synchronization Tool for Dubbed Content
David Meyer, Eitan Abecassis, Clara Fernandez-Labrador et al.
RASU: Retrieval Augmented Speech Understanding through Generative Modeling
Hao Yang, Min Zhang, Minghan Wang et al.
RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Yujie Chen, Jiangyan Yi, Jun Xue et al.
Reading Miscue Detection in Primary School through Automatic Speech Recognition
Lingyun Gao, Cristian Tejedor-Garcia, Helmer Strik et al.
Real-Time Gaze-directed speech enhancement for audio-visual hearing-aids
Arif Reza Anway, Bryony Buck, Mandar Gogate et al.
Real-time scheme for rapid extraction of speaker embeddings in challenging recording conditions
Kai Liu, Ziqing Du, Zhou Huan et al.
Real-time Speech Summarization for Medical Conversations
Khai Le-Duc, Khai-Nguyen Nguyen, Long Vo-Dang et al.
Real-world PTSD Recognition: A Cross-corpus and Cross-linguistic Evaluation
Alexander Kathan, Martin Bürger, Andreas Triantafyllopoulos et al.
Reduce, Reuse, Recycle: Is Perturbed Data Better than Other Language Augmentation for Low Resource Self-Supervised Speech Models
Asad Ullah, Alessandro Ragano, Andrew Hines
Reducing Speech Distortion and Artifacts for Speech Enhancement by Loss Function
Haixin Guan, Wei Dai, Guangyong Wang et al.
Reference-Free Estimation of the Quality of Clinical Notes Generated from Doctor-Patient Conversations
Mojtaba Kadkhodaie Elyaderani, John Glover, Thomas Schaaf
Refining Self-supervised Learnt Speech Representation using Brain Activations
HengYu Li, Kangdi Mei, Zhaoci Liu et al.
Reinforcement Learning based Data Augmentation for Noise Robust Speech Emotion Recognition
Sumit Ranjan, Rupayan Chakraborty, Sunil Kumar Kopparapu
Reinforcement Learning from Answer Reranking Feedback for Retrieval-Augmented Answer Generation
Minh Nguyen, Toan Quoc Nguyen, Kishan KC et al.
Relational Proxy Loss for Audio-Text based Keyword Spotting
Youngmoon Jung, Seungjin Lee, Joon-Young Yang et al.
Reliable dialogue system for facilitating student-counselor communication
Mahdin Rohmatillah, Bryan Gautama Ngo, Willianto Sulaiman et al.
RepCNN: Micro-sized, Mighty Models for Wakeword Detection
Arnav Kundu, Prateeth Nayak, Priyanka Padmanabhan et al.
RepTor: Re-parameterizable Temporal Convolution for Keyword Spotting via Differentiable Kernel Search
Eunik Park, Daehyun Ahn, Hyungjun Kim
Reshape Dimensions Network for Speaker Recognition
Ivan Yakovlev, Rostislav Makarov, Andrei Balykin et al.
Residual Speaker Representation for One-Shot Voice Conversion
Le Xu, Jiangyan Yi, Tao Wang et al.
Resource-Efficient Speech Quality Prediction through Quantization Aware Training and Binary Activation Maps
Mattias Nilsson, Riccardo Miccini, Clement Laroche et al.
Retrieval-Augmented Classifier Guidance for Audio Generation
Ho-Young Choi, Won-Gook Choi, Joon-Hyuk Chang