Papers
O-1: Self-training with Oracle and 1-best Hypothesis
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran et al.
Obstructive Sleep Apnea Detection using Pre-trained Speech Representations
Kaibo Zhang, Lili Cao, Yiming Ding et al.
Obstructive sleep apnea screening with breathing sounds and respiratory effort: a multimodal deep learning approach
Hector E. Romero, Ning Ma, Guy J. Brown et al.
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation
Gene-Ping Yang, Yue Gu, Qingming Tang et al.
On-Device Speaker Anonymization of Acoustic Embeddings for ASR based on Flexible Location Gradient Reversal Layer
Md Asif Jalal, Pablo Peso Parada, Jisi Zhang et al.
One-Step Knowledge Distillation and Fine-Tuning in Using Large Pre-Trained Self-Supervised Learning Models for Speaker Verification
Jungwoo Heo, Chan-yeong Lim, Ju-ho Kim et al.
Online Continual Learning in Keyword Spotting for Low-Resource Devices via Pooling High-Order Temporal Statistics
Umberto Michieli, Pablo Peso Parada, Mete Ozay
Online Punctuation Restoration using ELECTRA Model for streaming ASR Systems
Martin Poláček, Petr Červa, Jindřich Žďánský et al.
On Monotonic Aggregation for Open-domain QA
Sang-eun Han, Yeonseok Jeong, Seung-won Hwang et al.
On the Benefits of Self-supervised Learned Speech Representations for Predicting Human Phonetic Misperceptions
Santiago Cuervo, Ricard Marxer
On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition
Lokesh Bansal, S. Pavankumar Dubagunta, Malolan Chetlur et al.
On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition
Mengzhe Geng, Xurong Xie, Rongfeng Su et al.
On the (In)Efficiency of Acoustic Feature Extractors for Self-Supervised Speech Representation Learning
Titouan Parcollet, Shucong Zhang, Rogier van Dalen et al.
On the N-gram Approximation of Pre-trained Language Models
Aravind Krishnan, Jesujoba O. Alabi, Dietrich Klakow
On the Robustness of Arabic Speech Dialect Identification
Peter Sullivan, AbdelRahim Elmadany, Muhammad Abdul-Mageed
On the robustness of wav2vec 2.0 based speaker recognition systems
Sergey Novoselov, Galina Lavrentyeva, Anastasia Avdeeva et al.
On the Use of High Frequency Information for Voice Pathology Classification
David Martínez, Dayana Ribas, Eduardo Lleida
Ontology-aware Learning and Evaluation for Audio Tagging
Haohe Liu, Qiuqiang Kong, Xubo Liu et al.
On Training a Neural Residual Acoustic Echo Suppressor for Improved ASR
Sankaran Panchapagesan, Turaj Zakizadeh Shabestary, Arun Narayanan
Opening or Closing? An Electroglottographic Analysis of Voiceless Coda Consonants in Australian English
Louise Ratko, Joshua Penney, Felicity Cox
Optimal control of speech with context-dependent articulatory targets
Benjamin Elie, Juraj Šimko, Alice Turk
Ordered and Binary Speaker Embedding
Jiaying Wang, Xianglong Wang, Namin Wang et al.
Orthography-based Pronunciation Scoring for Better CAPT Feedback
Caitlin Richter, Ragnar Pálsson, Luke O'Brien et al.
OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition
Li Fu, Siqi Li, Qingtao Li et al.
Outlier-aware Inlier Modeling and Multi-scale Scoring for Anomalous Sound Detection via Multitask Learning
Yucong Zhang, Suo Hongbin, Yulong Wan et al.