Papers
Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention
Junjie Li, Zhiyu Zhang, Minchuan Chen et al.
Improving RNN-T ASR Accuracy Using Context Audio
Andreas Schwarz, Ilya Sklyar, Simon Wiesler
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS
Yan Deng, Rui Zhao, Zhong Meng et al.
Improving Streaming Transformer Based ASR Under a Framework of Self-Supervised Learning
Songjun Cao, Yueteng Kang, Yanzhe Fu et al.
Improving the Expressiveness of Neural Vocoding with Non-Affine Normalizing Flows
Adam Gabryś, Yunlong Jiao, Viacheslav Klimkov et al.
Improving Time Delay Neural Network Based Speaker Recognition with Convolutional Block and Feature Aggregation Methods
Yu-Jia Zhang, Yih-Wen Wang, Chia-Ping Chen et al.
Improving Weakly Supervised Sound Event Detection with Self-Supervised Auxiliary Tasks
Soham Deshmukh, Bhiksha Raj, Rita Singh
Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech
Zengqiang Shang, Zhihua Huang, Haozhe Zhang et al.
Incorporating Embedding Vectors from a Human Mean-Opinion Score Prediction Model for Monaural Speech Enhancement
Khandokar Md. Nayem, Donald S. Williamson
Incorporating External POS Tagger for Punctuation Restoration
Ning Shi, Wei Wang, Boxin Wang et al.
Influence of the Interviewer on the Automatic Assessment of Alzheimer’s Disease in the Context of the ADReSSo Challenge
P.A. Pérez-Toro, S.P. Bayerl, T. Arias-Vergara et al.
Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw
Jan Chorowski, Grzegorz Ciesielski, Jarosław Dzikowski et al.
Information Sieve: Content Leakage Reduction in End-to-End Prosody Transfer for Expressive Speech Synthesis
Xudong Dai, Cheng Gong, Longbiao Wang et al.
In-Group Advantage in the Perception of Emotions: Evidence from Three Varieties of German
Moritz Jakob, Bettina Braun, Katharina Zahner-Ritter
Inhalations in Speech: Acoustic and Physiological Characteristics
Raphael Werner, Susanne Fuchs, Jürgen Trouvain et al.
Injecting Descriptive Meta-Information into Pre-Trained Language Models with Hypernetworks
Wenying Duan, Xiaoxi He, Zimu Zhou et al.
Inplace Gated Convolutional Recurrent Neural Network for Dual-Channel Speech Enhancement
Jinjiang Liu, Xueliang Zhang
Insights on Neural Representations for End-to-End Speech Recognition
Anna Ollerenshaw, Md. Asif Jalal, Thomas Hain
Integrating Dialog History into End-to-End Spoken Language Understanding Systems
Jatin Ganhotra, Samuel Thomas, Hong-Kwang J. Kuo et al.
Integrating Frequency Translational Invariance in TDNNs and Frequency Positional Information in 2D ResNets to Enhance Speaker Verification
Jenthe Thienpondt, Brecht Desplanques, Kris Demuynck
Intent Detection and Slot Filling for Vietnamese
Mai Hoang Dao, Thinh Hung Truong, Dat Quoc Nguyen
Interactive and Real-Time Acoustic Measurement Tools for Speech Data Acquisition and Presentation: Application of an Extended Member of Time Stretched Pulses
Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara et al.
INTERSPEECH 2021 Acoustic Echo Cancellation Challenge
Ross Cutler, Ando Saabas, Tanel Parnamaa et al.
INTERSPEECH 2021 Deep Noise Suppression Challenge
Chandan K.A. Reddy, Harishchandra Dubey, Kazuhito Koishida et al.