Research Explorer

HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts

Xinlei Niu, Jing Zhang, Charles Patrick Martin

2024 INTERSPEECH

HypR: A comprehensive study for ASR hypothesis revising with a reference corpus

Yi-Wei Wang, Ke-Han Lu, Kuan-Yu Chen

2024 INTERSPEECH

Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models

Minh Nguyen, Franck Dernoncourt, Seunghyun Yoon et al.

2024 INTERSPEECH

IIITH Ucchar e-Sudharak: an automatic English pronunciation corrector for school-going children with a teacher in the loop

Meenakshi Sirigiraju, Arjun Rajasekar, Abhishikth Meejuri et al.

2024 INTERSPEECH

Impact of the tonal factor on diphthong realizations in Standard Mandarin with Generalized Additive Mixed Models

Chenyu Li, Jalal Al-Tamimi

2024 INTERSPEECH

Improved Factorized Neural Transducer Model For Text-only Domain Adaptation

Junzhe Liu, Jianwei Yu, Xie Chen

2024 INTERSPEECH

Improved Remixing Process for Domain Adaptation-Based Speech Enhancement by Mitigating Data Imbalance in Signal-to-Noise Ratio

Li Li, Shogo Seki

2024 INTERSPEECH

Improvement Speaker Similarity for Zero-Shot Any-to-Any Voice Conversion of Whispered and Regular Speech

Aleksei Gusev, Anastasia Avdeeva

2024 INTERSPEECH

Improving Audio Classification with Low-Sampled Microphone Input: An Empirical Study Using Model Self-Distillation

Dawei Liang, Alice Zhang, David Harwath et al.

2024 INTERSPEECH

Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model

Jinlong Xue, Yayue Deng, Yicheng Han et al.

2024 INTERSPEECH

Improving child speech recognition with augmented child-like speech

Yuanyuan Zhang, Zhengjun Yue, Tanvina Patel et al.

2024 INTERSPEECH

Improving Copy-Synthesis Anti-Spoofing Training Method with Rhythm and Speaker Perturbation

Jingze Lu, Yuxiang Zhang, Zhuo Li et al.

2024 INTERSPEECH

Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions

Jiwon Suh, Injae Na, Woohwan Jung

2024 INTERSPEECH

Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation

Ke Chen, Jiaqi Su, Taylor Berg-Kirkpatrick et al.

2024 INTERSPEECH

Improving Multilingual ASR Robustness to Errors in Language Input

Brady Houston, Omid Sadjadi, Zejiang Hou et al.

2024 INTERSPEECH

Improving Multilingual Text-to-Speech with Mixture-of-Language-Experts and Accent Disentanglement

Jing Wu, Ting Chen, Minchuan Chen et al.

2024 INTERSPEECH

Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation

Ruizhe Huang, Mahsa Yarmohammadi, Sanjeev Khudanpur et al.

2024 INTERSPEECH

Improving Noise Robustness in Self-supervised Pre-trained Model for Speaker Verification

Chan-yeong Lim, Hyun-seo Shin, Ju-ho Kim et al.

2024 INTERSPEECH

Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment

Paarth Neekhara, Shehzeen Hussain, Subhankar Ghosh et al.

2024 INTERSPEECH

Improving Self-supervised Pre-training using Accent-Specific Codebooks

Darshan Prabhu, Abhishek Gupta, Omkar Nitsure et al.

2024 INTERSPEECH

Improving Speech-Based Dysarthria Detection using Multi-task Learning with Gradient Projection

Yan Xiong, Visar Berisha, Julie Liss et al.

2024 INTERSPEECH

Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer

Jizhen Li, Xinmeng Xu, Weiping Tu et al.

2024 INTERSPEECH

Improving Speech Recognition with Prompt-based Contextualized ASR and LLM-based Re-predictor

Nguyen Manh Tien Anh, Thach Ho Sy

2024 INTERSPEECH

Improving Streaming Speech Recognition With Time-Shifted Contextual Attention And Dynamic Right Context Masking

Khanh Le, Duc Chau

2024 INTERSPEECH

Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text

Jinpeng Li, Yu Pu, Qi Sun et al.

2024 INTERSPEECH

Papers