Papers
Empirical Analysis of Score Fusion Application to Combined Neural Networks for Open Vocabulary Spoken Term Detection
Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh
Empirical Evaluation of Speaker Adaptation on DNN Based Acoustic Model
Ke Wang, Junbo Zhang, Yujun Wang et al.
Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance
Md Hafizur Rahman, Ivan Himawan, Mitchell McLaren et al.
Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition
Sei Ueno, Takafumi Moriya, Masato Mimura et al.
End-To-End Audio Replay Attack Detection Using Deep Convolutional Networks with Attention
Francis Tom, Mohit Jain, Prasenjit Dey
End-to-end Deep Neural Network Age Estimation
Pegah Ghahremani, Phani Sankar Nidadavolu, Nanxin Chen et al.
End-to-End Speech Command Recognition with Capsule Network
Jaesung Bae, Dae-Shik Kim
End-to-End Speech Recognition from the Raw Waveform
Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve et al.
End-to-end Speech Recognition Using Lattice-free MMI
Hossein Hadian, Hossein Sameti, Daniel Povey et al.
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
Zhong-Qiu Wang, Jonathan Le Roux, DeLiang Wang et al.
End-to-end Text-dependent Speaker Verification Using Novel Distance Measures
Subhadeep Dey, Srikanth Madikeri, Petr Motlicek
Engagement Recognition in Spoken Dialogue via Neural Network by Aggregating Different Annotators' Models
Koji Inoue, Divesh Lala, Katsuya Takanashi et al.
Enhancement of Noisy Speech Signal by Non-Local Means Estimation of Variational Mode Functions
Nagapuri Srinivas, Gayadhar Pradhan, Syed Shahnawazuddin
Entity-Aware Language Model as an Unsupervised Reranker
Mohammad Sadegh Rasooli, Sarangarajan Parthasarathy
Epoch Extraction from Pathological Children Speech Using Single Pole Filtering Approach
C M Vikram, S R Mahadeva Prasanna
Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement
Li Chai, Jun Du, Chin-Hui Lee
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe, Takaaki Hori, Shigeki Karita et al.
Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source
Sudarsana Reddy Kadiri, Bayya Yegnanarayana
Estimation of Hypernasality Scores from Cleft Lip and Palate Speech
C M Vikram, Ayush Tripathi, Sishir Kalita et al.
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge.
Ignacio Viñals, Pablo Gimeno, Alfonso Ortega et al.
Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley
TV Ananthapadmanabha, Ramakrishnan A G
Evolving Learning for Analysing Mood-Related Infant Vocalisation
Zixing Zhang, Jing Han, Kun Qian et al.