Papers
God as Interlocutor — Real or Imaginary? Prosodic Markers of Dialogue Speech and Expected Efficacy in Spoken Prayer
Oliver Niebuhr, Uffe Schjoedt
GPU-Based WFST Decoding with Extra Large Language Model
Daisuke Fukunaga, Yoshiki Tanaka, Yuichi Kageyama
“ Gra[f] e!” Word-Final Devoicing of Obstruents in Standard French: An Acoustic Study Based on Large Corpora
Adèle Jatteau, Ioana Vasilescu, Lori Lamel et al.
Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion
Shaojin Ding, Ricardo Gutierrez-Osuna
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
Naoyuki Kanda, Christoph Boeddeker, Jens Heitkaemper et al.
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata, Kartik Audhkhasi
Harmonic Beamformers for Non-Intrusive Speech Intelligibility Prediction
Charlotte Sørensen, Jesper B. Boldt, Mads G. Christensen
Hierarchical Pooling Structure for Weakly Labeled Sound Event Detection
Ke-Xin He, Yu-Han Shen, Wei-Qiang Zhang
High Quality, Lightweight and Adaptable TTS Using LPCNet
Zvi Kons, Slava Shechtman, Alex Sorin et al.
How to Annotate 100 Hours in 45 Minutes
Per Fallgren, Zofia Malisz, Jens Edlund
Hush-Hush Speak: Speech Reconstruction Using Silent Videos
Shashwat Uttam, Yaman Kumar, Dhruva Sahrawat et al.
Hypernasality Severity Detection Using Constant Q Cepstral Coefficients
Akhilesh Kumar Dubey, S.R. Mahadeva Prasanna, S. Dandapat
HyST: A Hybrid Approach for Flexible and Accurate Dialogue State Tracking
Rahul Goel, Shachi Paul, Dilek Hakkani-Tür
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences
Kong Aik Lee, Ville Hautamäki, Tomi H. Kinnunen et al.
IA-NET: Acceleration and Compression of Speech Enhancement Using Integer-Adder Deep Neural Network
Yu-Chen Lin, Yi-Te Hsu, Szu-Wei Fu et al.
Identifying Distinctive Acoustic and Spectral Features in Parkinson’s Disease
Yermiyahu Hauptman, Ruth Aloni-Lavi, Itshak Lapidot et al.
Identifying Input Features for Development of Real-Time Translation of Neural Signals to Text
Janaki Sheth, Ariel Tankus, Michelle Tran et al.
Identifying Mood Episodes Using Dialogue Features from Clinical Interviews
Zakaria Aldeneh, Mimansa Jaiswal, Michael Picheny et al.
Identifying Personality Traits Using Overlap Dynamics in Multiparty Dialogue
Mingzhi Yu, Emer Gilmartin, Diane Litman
Identifying Therapist and Client Personae for Therapeutic Alliance Estimation
Victor R. Martinez, Nikolaos Flemotomos, Victor Ardulov et al.
IIIT-H Spoofing Countermeasures for Automatic Speaker Verification Spoofing and Countermeasures Challenge 2019
K.N.R.K. Raju Alluri, Anil Kumar Vuppala
Impact of ASR Performance on Spoken Grammatical Error Detection
Y. Lu, Mark J.F. Gales, Kate M. Knill et al.
Improved Deep Duel Model for Rescoring N-Best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders
Atsunori Ogawa, Marc Delcroix, Shigeki Karita et al.