Research Explorer

Improved CNN-Transformer using Broadcasted Residual Learning for Text-Independent Speaker Verification

Jeong-Hwan Choi, Joon-Young Yang, Ye-Rin Jeoung et al.

2022 INTERSPEECH

Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing

Heli Qi, Sashi Novitasari, Sakriani Sakti et al.

2022 INTERSPEECH

Improved Modulation-Domain Loss for Neural-Network-based Speech Enhancement

Tyler Vuong, Richard Stern

2022 INTERSPEECH

Improved Relation Networks for End-to-End Speaker Verification and Identification

Ashutosh Chaubey, Sparsh Sinha, Susmita Ghose

2022 INTERSPEECH

Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training

Jiaxu He, Cheng Gong, Longbiao Wang et al.

2022 INTERSPEECH

Improve Speech Enhancement using Perception-High-Related Time-Frequency Loss

Ding Zhao, Zhan Zhang, Bin Yu et al.

2022 INTERSPEECH

Improving ASR Robustness in Noisy Condition Through VAD Integration

Sashi Novitasari, Takashi Fukuda, Gakuto Kurata

2022 INTERSPEECH

Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model

Jennifer Fox, Natalie Delworth

2022 INTERSPEECH

Improving Data Driven Inverse Text Normalization using Data Augmentation and Machine Translation

Debjyoti Paul, Yutong Pang, Szu-Jui Chen et al.

2022 INTERSPEECH

Improving Deliberation by Text-Only and Semi-Supervised Training

Ke Hu, Tara Sainath, Yanzhang He et al.

2022 INTERSPEECH

Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation

Kuan Po Huang, Yu-Kuan Fu, Yu Zhang et al.

2022 INTERSPEECH

Improving GAN-based vocoder for fast and high-quality speech synthesis

He Mengnan, Tingwei Guo, Zhenxing Lu et al.

2022 INTERSPEECH

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing

Xiaodong Cui, George Saon, Tohru Nagano et al.

2022 INTERSPEECH

Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech

Kaitao Song, Teng Wan, Bixia Wang et al.

2022 INTERSPEECH

Improving Language Identification of Accented Speech

Kunnar Kukk, Tanel Alumäe

2022 INTERSPEECH

Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information

Jie Chen, Changhe Song, Deyi Tuo et al.

2022 INTERSPEECH

Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment

Mu Yang, Kevin Hirschi, Stephen Daniel Looney et al.

2022 INTERSPEECH

Improving Phonetic Transcriptions of Children’s Speech by Pronunciation Modelling with Constrained CTC-Decoding

Lars Rumberg, Christopher Gebauer, Hanna Ehlert et al.

2022 INTERSPEECH

Improving Rare Word Recognition with LM-aware MWER Training

Wang Weiran, Tongzhou Chen, Tara Sainath et al.

2022 INTERSPEECH

Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods

Lingxuan Ye, Gaofeng Cheng, Runyan Yang et al.

2022 INTERSPEECH

Improving Speech Emotion Recognition Through Focus and Calibration Attention Mechanisms

Junghun Kim, Yoojin An, Jihie Kim

2022 INTERSPEECH

Improving Speech Emotion Recognition Using Self-Supervised Learning with Domain-Specific Audiovisual Tasks

Lucas Goncalves, Carlos Busso

2022 INTERSPEECH

Improving Speech Enhancement through Fine-Grained Speech Characteristics

Muqiao Yang, Joseph Konan, David Bick et al.

2022 INTERSPEECH

Improving Spoken Language Understanding with Cross-Modal Contrastive Learning

Jingjing Dong, Jiayi Fu, Peng Zhou et al.

2022 INTERSPEECH

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies

Zehan Li, Haoran Miao, Keqi Deng et al.

2022 INTERSPEECH

Papers