Papers
End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders
Ryo Masumura, Hiroshi Sato, Tomohiro Tanaka et al.
End-to-End Convolutional Sequence Learning for ASL Fingerspelling Recognition
Katerina Papadimitriou, Gerasimos Potamianos
End-to-End Losses Based on Speaker Basis Vectors and All-Speaker Hard Negative Mining for Speaker Verification
Hee-Soo Heo, Jee-weon Jung, IL-Ho Yang et al.
End-to-End Monaural Speech Separation with Multi-Scale Dynamic Weighted Gated Dilated Convolutional Pyramid Network
Ziqiang Shi, Huibin Lin, Liu Liu et al.
End-to-End Multi-Channel Speech Enhancement Using Inter-Channel Time-Restricted Attention on Raw Waveform
Hyeonseung Lee, Hyung Yong Kim, Woo Hyun Kang et al.
End-to-End Multilingual Multi-Speaker Speech Recognition
Hiroshi Seki, Takaaki Hori, Shinji Watanabe et al.
End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning
Pavel Denisov, Ngoc Thang Vu
End-to-End Music Source Separation: Is it Possible in the Waveform Domain?
Francesc Lluís, Jordi Pons, Xavier Serra
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi et al.
End-to-End SpeakerBeam for Single Channel Target Speech Recognition
Marc Delcroix, Shinji Watanabe, Tsubasa Ochiai et al.
End-to-End Speaker Identification in Noisy and Reverberant Environments Using Raw Waveform Convolutional Neural Networks
Daniele Salvati, Carlo Drioli, Gian Luca Foresti
End-to-End Speech Translation with Knowledge Distillation
Yuchen Liu, Hao Xiong, Jiajun Zhang et al.
End-to-End Spoken Language Understanding: Bootstrapping in Low Resource Scenarios
Swapnil Bhosale, Imran Sheikh, Sri Harsha Dumpala et al.
End-to-End Text-to-Speech for Low-Resource Languages by Cross-Lingual Transfer Learning
Yuan-Jui Chen, Tao Tu, Cheng-chieh Yeh et al.
Energy Separation-Based Instantaneous Frequency Estimation for Cochlear Cepstral Feature for Replay Spoof Detection
Ankur T. Patil, Rajul Acharya, Pulikonda Aditya Sai et al.
Enforcing Semantic Consistency for Cross Corpus Valence Regression from Speech Using Adversarial Discrepancy Learning
Gao-Yi Chao, Yun-Shao Lin, Chun-Min Chang et al.
Enhanced Spectral Features for Distortion-Independent Acoustic Modeling
Peidong Wang, DeLiang Wang
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation
Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham et al.
Ensemble Models for Spoofing Detection in Automatic Speaker Verification
Bhusan Chettri, Daniel Stoller, Veronica Morfi et al.
Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement
Meng Ge, Longbiao Wang, Nan Li et al.
EpaDB: A Database for Development of Pronunciation Assessment Systems
Jazmín Vidal, Luciana Ferrer, Leonardo Brambilla
ERP Signal Analysis with Temporal Resolution Using a Time Window Bank
Annika Nijveld, L. ten Bosch, Mirjam Ernestus
Evaluating Audiovisual Source Separation in the Context of Video Conferencing
Berkay İnan, Milos Cernak, Helmut Grabner et al.
Evaluating Intention Communication by TTS Using Explicit Definitions of Illocutionary Act Performance
Nobukatsu Hojo, Noboru Miyazaki