Papers
End-To-End Audio Replay Attack Detection Using Deep Convolutional Networks with Attention
Francis Tom, Mohit Jain, Prasenjit Dey
End-to-end Deep Neural Network Age Estimation
Pegah Ghahremani, Phani Sankar Nidadavolu, Nanxin Chen et al.
End-to-End Speech Command Recognition with Capsule Network
Jaesung Bae, Dae-Shik Kim
End-to-End Speech Recognition from the Raw Waveform
Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve et al.
End-to-end Speech Recognition Using Lattice-free MMI
Hossein Hadian, Hossein Sameti, Daniel Povey et al.
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Zhong-Qiu Wang, Jonathan Le Roux, DeLiang Wang et al.
End-to-end Text-dependent Speaker Verification Using Novel Distance Measures
Subhadeep Dey, Srikanth Madikeri, Petr Motlicek
Engagement Recognition in Spoken Dialogue via Neural Network by Aggregating Different Annotators' Models
Koji Inoue, Divesh Lala, Katsuya Takanashi et al.
Enhancement of Noisy Speech Signal by Non-Local Means Estimation of Variational Mode Functions
Nagapuri Srinivas, Gayadhar Pradhan, Syed Shahnawazuddin
Entity-Aware Language Model as an Unsupervised Reranker
Mohammad Sadegh Rasooli, Sarangarajan Parthasarathy
Epoch Extraction from Pathological Children Speech Using Single Pole Filtering Approach
C M Vikram, S R Mahadeva Prasanna
Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement
Li Chai, Jun Du, Chin-Hui Lee
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe, Takaaki Hori, Shigeki Karita et al.
Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source
Sudarsana Reddy Kadiri, Bayya Yegnanarayana
Estimation of Hypernasality Scores from Cleft Lip and Palate Speech
C M Vikram, Ayush Tripathi, Sishir Kalita et al.
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge.
Ignacio Viñals, Pablo Gimeno, Alfonso Ortega et al.
Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley
TV Ananthapadmanabha, Ramakrishnan A G
Evolving Learning for Analysing Mood-Related Infant Vocalisation
Zixing Zhang, Jing Han, Kun Qian et al.
Exemplar-Based Spectral Detail Compensation for Voice Conversion
Yu-Huai Peng, Hsin-Te Hwang, Yichiao Wu et al.
Exemplar-based Speech Waveform Generation
Oliver Watts, Cassia Valentini-Botinhao, Felipe Espic et al.
Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization
Paul Magron, Tuomas Virtanen
Experience-dependent Influence of Music and Language on Lexical Pitch Learning Is Not Additive
Akshay Raj Maggu, Patrick C. M. Wong, Hanjun Liu et al.
Experiments with Training Corpora for Statistical Text-to-speech Systems.
Monika Podsiadło, Victor Ungureanu