Papers
End-to-End Speech-to-Dialog-Act Recognition
Viet-Trung Dang, Tianyu Zhao, Sei Ueno et al.
End-to-End Spoken Language Understanding Without Full Transcripts
Hong-Kwang J. Kuo, Zoltán Tüske, Samuel Thomas et al.
End-to-End Task-Oriented Dialog System Through Template Slot Value Generation
Teakgyu Hong, Oh-Woog Kwon, Young-Kil Kim
End-to-End Text-to-Speech Synthesis with Unaligned Multiple Language Units Based on Attention
Masashi Aso, Shinnosuke Takamichi, Hiroshi Saruwatari
Enhancing Formant Information in Spectrographic Display of Speech
B. Yegnanarayana, Anand Joseph, Vishala Pannala
Enhancing Intelligibility of Dysarthric Speech Using Gated Convolutional-Based Voice Conversion System
Chen-Yu Chen, Wei-Zhong Zheng, Syu-Siang Wang et al.
Enhancing Monotonicity for Robust Autoregressive Transformer TTS
Xiangyu Liang, Zhiyong Wu, Runnan Li et al.
Enhancing Monotonic Multihead Attention for Streaming ASR
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara
Enhancing Sequence-to-Sequence Text-to-Speech with Morphology
Jason Taylor, Korin Richmond
Enhancing Speech Intelligibility in Text-To-Speech Synthesis Using Speaking Style Conversion
Dipjyoti Paul, Muhammed P.V. Shifas, Yannis Pantazis et al.
Enhancing the Interaural Time Difference of Bilateral Cochlear Implants with the Temporal Limits Encoder
Yangyang Wan, Huali Zhou, Qinglin Meng et al.
Enhancing Transferability of Black-Box Adversarial Attacks via Lifelong Learning for Speech Emotion Recognition Models
Zhao Ren, Jing Han, Nicholas Cummins et al.
Ensemble Approaches for Uncertainty in Spoken Language Assessment
Xixin Wu, Kate M. Knill, Mark J.F. Gales et al.
Ensemble of Students Taught by Probabilistic Teachers to Improve Speech Emotion Recognition
Kusha Sridhar, Carlos Busso
Ensembling End-to-End Deep Models for Computational Paralinguistics Tasks: ComParE 2020 Mask and Breathing Sub-Challenges
Maxim Markitantov, Denis Dresvyanskiy, Danila Mamontov et al.
Entity Linking for Short Text Using Structured Knowledge Graph via Multi-Grained Text Matching
Binxuan Huang, Han Wang, Tong Wang et al.
Environmental Sound Classification with Parallel Temporal-Spectral Attention
Helin Wang, Yuexian Zou, Dading Chong et al.
Environment Sound Classification Using Multiple Feature Channels and Attention Based Deep Convolutional Neural Network
Jivitesh Sharma, Ole-Christoffer Granmo, Morten Goodwin
Er-Suffixation in Southwestern Mandarin: An EMA and Ultrasound Study
Jing Huang, Feng-fan Hsieh, Yueh-chin Chang
Evaluating and Optimizing Prosodic Alignment for Automatic Dubbing
Marcello Federico, Yogesh Virkar, Robert Enyedi et al.
Evaluating Automatically Generated Phoneme Captions for Images
Justin van der Hout, Zoltán D’Haese, Mark Hasegawa-Johnson et al.
Evaluating the Reliability of Acoustic Speech Embeddings
Robin Algayres, Mohamed Salah Zaiem, Benoît Sagot et al.
Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification
Xiaoyang Qu, Jianzong Wang, Jing Xiao
Evolved Speech-Transformer: Applying Neural Architecture Search to End-to-End Automatic Speech Recognition
Jihwan Kim, Jisung Wang, Sangki Kim et al.
Exploiting Conic Affinity Measures to Design Speech Enhancement Systems Operating in Unseen Noise Conditions
Pavlos Papadopoulos, Shrikanth Narayanan