Papers
Empirical Analysis of Generalized Iterative Speech Separation Networks
Yi Luo, Cong Han, Nima Mesgarani
End-to-End Audio-Visual Speech Recognition for Overlapping Speech
Richard Rose, Olivier Siohan, Anshuman Tripathi et al.
End-to-End Cross-Lingual Spoken Language Understanding Model with Multilingual Pretraining
Xianwei Zhang, Liang He
End-to-End Language Diarization for Bilingual Code-Switching Speech
Hexin Liu, Leibny Paola García Perera, Xinyi Zhang et al.
End-to-End Neural Diarization: From Transformer to Conformer
Yi Chieh Liu, Eunjung Han, Chul Lee et al.
End-to-End Open Vocabulary Keyword Search
Bolaji Yusuf, Alican Gok, Batuhan Gundogdu et al.
End-to-End Optimized Multi-Stage Vector Quantization of Spectral Envelopes for Speech and Audio Coding
Mohammad Hassan Vali, Tom Bäckström
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning
Tomohiro Tanaka, Ryo Masumura, Mana Ihori et al.
End-to-End Speaker-Attributed ASR with Transformer
Naoyuki Kanda, Guoli Ye, Yashesh Gaur et al.
End-To-End Speaker Segmentation for Overlap-Aware Resegmentation
Hervé Bredin, Antoine Laurent
End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain
Kai Wang, Hao Huang, Ying Hu et al.
End-to-End Speech Translation via Cross-Modal Progressive Training
Rong Ye, Mingxuan Wang, Lei Li
End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-Switching Speech Recognition
Shuai Zhang, Jiangyan Yi, Zhengkun Tian et al.
End-to-End Spoken Language Understanding for Generalized Voice Assistants
Michael Saxon, Samridhi Choudhary, Joseph P. McKenna et al.
End to End Transformer-Based Contextual Speech Recognition Based on Pointer Network
Binghuai Lin, Liyuan Wang
End-to-End Transformer-Based Open-Vocabulary Keyword Spotting with Location-Guided Local Attention
Bo Wei, Meirong Yang, Tao Zhang et al.
Energy-Friendly Keyword Spotting System Using Add-Based Convolution
Hang Zhou, Wenchao Hu, Yu Ting Yeung et al.
Enhancing Semantic Understanding with Self-Supervised Methods for Abstractive Dialogue Summarization
Hyunjae Lee, Jaewoong Yun, Hyunjin Choi et al.
Enriching Source Style Transfer in Recognition-Synthesis Based Non-Parallel Voice Conversion
Zhichao Wang, Xinyong Zhou, Fengyu Yang et al.
Enrollment-Less Training for Personalized Voice Activity Detection
Naoki Makishima, Mana Ihori, Tomohiro Tanaka et al.
Ensemble-Within-Ensemble Classification for Escalation Prediction from Speech
Oxana Verkholyak, Denis Dresvyanskiy, Anastasia Dvoynikova et al.
Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept
Wei Zhou, Albert Zeyer, André Merboldt et al.
Estimating Articulatory Movements in Speech Production with Transformer Networks
Sathvik Udupa, Anwesha Roy, Abhayjeet Singh et al.
ETLT 2021: Shared Task on Automatic Speech Recognition for Non-Native Children’s Speech
R. Gretter, Marco Matassoni, D. Falavigna et al.