Papers
Improved CNN-Transformer using Broadcasted Residual Learning for Text-Independent Speaker Verification
Jeong-Hwan Choi, Joon-Young Yang, Ye-Rin Jeoung et al.
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing
Heli Qi, Sashi Novitasari, Sakriani Sakti et al.
Improved Modulation-Domain Loss for Neural-Network-based Speech Enhancement
Tyler Vuong, Richard Stern
Improved Relation Networks for End-to-End Speaker Verification and Identification
Ashutosh Chaubey, Sparsh Sinha, Susmita Ghose
Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training
Jiaxu He, Cheng Gong, Longbiao Wang et al.
Improve Speech Enhancement using Perception-High-Related Time-Frequency Loss
Ding Zhao, Zhan Zhang, Bin Yu et al.
Improving ASR Robustness in Noisy Condition Through VAD Integration
Sashi Novitasari, Takashi Fukuda, Gakuto Kurata
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model
Jennifer Fox, Natalie Delworth
Improving Data Driven Inverse Text Normalization using Data Augmentation and Machine Translation
Debjyoti Paul, Yutong Pang, Szu-Jui Chen et al.
Improving Deliberation by Text-Only and Semi-Supervised Training
Ke Hu, Tara Sainath, Yanzhang He et al.
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation
Kuan Po Huang, Yu-Kuan Fu, Yu Zhang et al.
Improving GAN-based vocoder for fast and high-quality speech synthesis
He Mengnan, Tingwei Guo, Zhenxing Lu et al.
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Xiaodong Cui, George Saon, Tohru Nagano et al.
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech
Kaitao Song, Teng Wan, Bixia Wang et al.
Improving Language Identification of Accented Speech
Kunnar Kukk, Tanel Alumäe
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information
Jie Chen, Changhe Song, Deyi Tuo et al.
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
Mu Yang, Kevin Hirschi, Stephen Daniel Looney et al.
Improving Phonetic Transcriptions of Children’s Speech by Pronunciation Modelling with Constrained CTC-Decoding
Lars Rumberg, Christopher Gebauer, Hanna Ehlert et al.
Improving Rare Word Recognition with LM-aware MWER Training
Wang Weiran, Tongzhou Chen, Tara Sainath et al.
Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods
Lingxuan Ye, Gaofeng Cheng, Runyan Yang et al.
Improving Speech Emotion Recognition Through Focus and Calibration Attention Mechanisms
Junghun Kim, Yoojin An, Jihie Kim
Improving Speech Emotion Recognition Using Self-Supervised Learning with Domain-Specific Audiovisual Tasks
Lucas Goncalves, Carlos Busso
Improving Speech Enhancement through Fine-Grained Speech Characteristics
Muqiao Yang, Joseph Konan, David Bick et al.
Improving Spoken Language Understanding with Cross-Modal Contrastive Learning
Jingjing Dong, Jiayi Fu, Peng Zhou et al.
Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Zehan Li, Haoran Miao, Keqi Deng et al.