Papers
Duplex Conversation in Outbound Agent System
Chunxiang Jin, Minghui Yang, Zujie Wen
Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder
Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency
Yangyang Shi, Varun Nagaraja, Chunyang Wu et al.
Dynamic Multi-Scale Convolution for Dialect Identification
Tianlong Kong, Shouyi Yin, Dawei Zhang et al.
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition
Jicheng Zhang, Yizhou Peng, Van Tung Pham et al.
Earnings-21: A Practical Benchmark for ASR in the Wild
Miguel Del Rio, Natalie Delworth, Ryan Westerman et al.
EasyCall Corpus: A Dysarthric Speech Dataset
Rosanna Turrisi, Arianna Braccia, Marco Emanuele et al.
ECAPA-TDNN Embeddings for Speaker Diarization
Nauman Dawalatabad, Mirco Ravanelli, François Grondin et al.
Effective Phase Encoding for End-To-End Speaker Verification
Junyi Peng, Xiaoyang Qu, Rongzhi Gu et al.
Effect of Carrier Bandwidth on Understanding Mandarin Sentences in Simulated Electric-Acoustic Hearing
Feng Wang, Jing Chen, Fei Chen
Effects of Aging and Age-Related Hearing Loss on Talker Discrimination
Min Xu, Jing Shao, Lan Wang
Effects of Feature Scaling and Fusion on Sign Language Translation
Tejaswini Ananthanarayana, Lipisha Chaudhary, Ifeoma Nwogu
Effects of Time Pressure and Spontaneity on Phonotactic Innovations in German Dialogues
Petra Wagner, Sina Zarrieß, Joana Cholin
Effects of Voice Type and Task on L2 Learners’ Awareness of Pronunciation Errors
Alif Silpachai, Ivana Rehman, Taylor Anne Barriuso et al.
Efficient and Stable Adversarial Learning Using Unpaired Data for Unsupervised Multichannel Speech Separation
Yu Nakagome, Masahito Togami, Tetsuji Ogawa et al.
Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-End Speech Recognition
Xiong Wang, Sining Sun, Lei Xie et al.
EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder
Zhengchen Liu, Chenfeng Miao, Qingying Zhu et al.
Efficient Weight Factorization for Multilingual Speech Recognition
Ngoc-Quan Pham, Tuan-Nam Nguyen, Sebastian Stüker et al.
Emitting Word Timings with HMM-Free End-to-End System in Automatic Speech Recognition
Xianzhao Chen, Hao Ni, Yi He et al.
EML Online Speech Activity Detection for the Fearless Steps Challenge Phase-III
Omid Ghahabi, Volker Fischer
Emotional Prosody Control for Speech Generation
Sarath Sivaprasad, Saiteja Kosgi, Vineet Gandhi
Emotion Carrier Recognition from Personal Narratives
Aniruddha Tammewar, Alessandra Cervone, Giuseppe Riccardi
Emotion Recognition from Speech Using wav2vec 2.0 Embeddings
Leonardo Pepino, Pablo Riera, Luciana Ferrer
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
Chenye Cui, Yi Ren, Jinglin Liu et al.