Papers
End-to-End Binaural Speech Synthesis
Wen Chin Huang, Dejan Markovic, Alexander Richard et al.
End-to-End Dependency Parsing of Spoken French
Adrien Pupier, Maximin Coavoux, Benjamin Lecouteux et al.
End-to-end framework for spoof-aware speaker verification
Woohyun Kang, Md Jahangir Alam, Abderrahim Fathan
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
Xuankai Chang, Takashi Maekaku, Yuya Fujita et al.
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training
Ryo Masumura, Yoshihiro Yamazaki, Saki Mizuno et al.
End-To-End Label Uncertainty Modeling for Speech-based Arousal Recognition Using Bayesian Neural Networks
Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock et al.
End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation
Krishna Subramani, Jean-Marc Valin, Umut Isik et al.
End-to-end Mispronunciation Detection with Simulated Error Distance
Zhan Zhang, Yuehai Wang, Jianyi Yang
End-to-end model for named entity recognition from speech without paired training data
Salima Mdhaffar, Jarod Duret, Titouan Parcollet et al.
End-to-End Multi-Loss Training for Low Delay Packet Loss Concealment
Nan Li, Xiguang Zheng, Chen Zhang et al.
End-to-End multi-talker audio-visual ASR using an active speaker attention module
Richard Rose, Olivier Siohan
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors
Magdalena Rybicka, Jesus Villalba, Najim Dehak et al.
End-to-end speech recognition modeling from de-identified data
Martin Flechl, Shou-Chun Yin, Junho Park et al.
End-to-end Speech-to-Punctuated-Text Recognition
Jumon Nozaki, Tatsuya Kawahara, Kenkichi Ishizuka et al.
End-to-End Spontaneous Speech Recognition Using Disfluency Labeling
Koharu Horii, Meiko Fukuda, Kengo Ohta et al.
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue
Kentaro Mitsui, Tianyu Zhao, Kei Sawada et al.
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation
Sravya Popuri, Peng-Jen Chen, Changhan Wang et al.
Enhancement of Pitch Controllability using Timbre-Preserving Pitch Augmentation in FastPitch
Hanbin Bae, Young-Sun Joo
Enhancing Embeddings for Speech Classification in Noisy Conditions
Mohamed Nabih Ali, Alessio Brutti, Falavigna Daniele
Enhancing Speech Privacy with Slicing
Mohamed Maouche, Brij Mohan Lal Srivastava, Nathalie Vauquier et al.
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis
Yixuan Zhou, Changhe Song, Jingbei Li et al.
Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification
Leying Zhang, Zhengyang Chen, Yanmin Qian
Environment Aware Text-to-Speech Synthesis
Daxin Tan, Guangyan Zhang, Tan Lee
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models
Perry Lam, Huayun Zhang, Nancy Chen et al.
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
Yen-Ju Lu, Xuankai Chang, Chenda Li et al.