Papers
229 papers found
EasyASR: A Distributed Machine Learning Platform for End-to-end Automatic Speech Recognition
Chengyu Wang, Mengli Cheng, Xu Hu et al.
Constructing Korean Learners’ L2 Speech Corpus of Seven Languages for Automatic Pronunciation Assessment
Seunghee Han, Sunhee Kim, Minhwa Chung
Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition
Rabindra Nath Nandi, Mehadi Menon, Tareq Muntasir et al.
A De Novo Divide-and-Merge Paradigm for Acoustic Model Optimization in Automatic Speech Recognition
Conghui Tan, Di Jiang, Jinhua Peng et al.
DNN-Based Automatic Speech Recognition as a Model for Human Phoneme Perception
Mats Exter, Bernd T. Meyer
Multichannel Spatial Clustering for Robust Far-Field Automatic Speech Recognition in Mismatched Conditions
Michael I. Mandel, Jon Barker
Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition
Titouan Parcollet, Ying Zhang, Mohamed Morchid et al.
Unidirectional Neural Network Architectures for End-to-End Automatic Speech Recognition
Niko Moritz, Takaaki Hori, Jonathan Le Roux
Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition
Kohei Matsuura, Masato Mimura, Shinsuke Sakai et al.
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition
Ryo Masumura, Naoki Makishima, Mana Ihori et al.
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition
Egor Lakomkin, Jahn Heymann, Ilya Sklyar et al.
Insertion-Based Modeling for End-to-End Automatic Speech Recognition
Yuya Fujita, Shinji Watanabe, Motoi Omachi et al.
NAS-SCAE: Searching Compact Attention-based Encoders For End-to-end Automatic Speech Recognition
Yukun Liu, Ta Li, Pengyuan Zhang et al.
Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition
Martin Sustek, Samik Sadhu, Hynek Hermansky
Addressing Cold Start Problem for End-to-end Automatic Speech Scoring
Jungbae Park, Seungtaek Choi
A Neural Time Alignment Module for End-to-End Automatic Speech Recognition
Dongcheng Jiang, Chao Zhang, Philip C. Woodland
Uncertainty Estimation for Connectionist Temporal Classification Based Automatic Speech Recognition
Lars Rumberg, Christopher Gebauer, Hanna Ehlert et al.
On Disfluency and Non-lexical Sound Labeling for End-to-end Automatic Speech Recognition
Peter Mihajlik, Yan Meng, Mate S Kadar et al.
Benchmarking IsiXhosa Automatic Speech Recognition and Machine Translation for Digital Health Provision
Abby Blocker, Francois Meyer, Ahmed Biyabani et al.
AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies
José-M. Acosta-Triana, David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos
An Automatic Soundtracking System for Text-to-Speech Audiobooks
Zikai Chen, Lin Wu, Junjie Pan et al.
Automatic Speech Recognition and Query By Example for Creole Languages Documentation
Cécile Macaire, Didier Schwab, Benjamin Lecouteux et al.
A Unified Approach to Multilingual Automatic Speech Recognition with Improved Language Identification for Indic Languages
Nikhil Jakhar, Sudhanshu Srivastava, Arun Baby