Papers
An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Christoph Boeddeker, Tobias Cord-Landwehr, Thilo von Neumann et al.
An investigation of regression-based prediction of the femininity or masculinity in speech of transgender people
Leon Liebig, Christoph Wagner, Alexander Mainka et al.
An objective test tool for pitch extractors' response attributes
Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara et al.
A Novel Phoneme-based Modeling for Text-independent Speaker Identification
Xin Wang, Chuan Xie, Qiang Wu et al.
An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion
Zijiang Yang, Xin Jing, Andreas Triantafyllopoulos et al.
An overview of discourse clicks in Central Swedish
Margaret Zellers
Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck
Youngsik Eom, Yeonghyeon Lee, Ji Sub Um et al.
Ant Multilingual Recognition System for OLR 2021 Challenge
Anqi Lyu, Zhiming Wang, Huijia Zhu
A Passive Similarity based CNN Filter Pruning for Efficient Acoustic Scene Classification
Arshdeep Singh, Mark D. Plumbley
A polyphone BERT for Polyphone Disambiguation in Mandarin Chinese
Song Zhang, Ken Zheng, Xiaoxu Zhu et al.
Application for Real-time Personalized Speaker Extraction
Damien Ronssin, Milos Cernak
Applying Syntax–Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis
Kei Furukawa, Takeshi Kishiyama, Satoshi Nakamura
Are disentangled representations all you need to build speaker anonymization systems?
Champion Pierre, Anthony Larcher, Denis Jouvet
Are reported accuracies in the clinical speech machine learning literature overoptimistic?
Visar Berisha, Chelsea Krantsevich, Gabriela Stegmann et al.
Articulatory Synthesis for Data Augmentation in Phoneme Recognition
Paul Konstantin Krug, Peter Birkholz, Branislav Gerazov et al.
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization
Fadi Biadsy, Youzheng Chen, Xia Zhang et al.
A Sparsity-promoting Dictionary Model for Variational Autoencoders
Mostafa Sadeghi, Paul Magron
A speech enhancement method for long-range speech acquisition task
YANZHANG GENG, Heng Wang, Tao Zhang et al.
ASR2K: Speech Recognition for Around 2000 Languages without Audio
Xinjian Li, Florian Metze, David R. Mortensen et al.
ASR Error Correction with Constrained Decoding on Operation Prediction
Jingyuan Yang, Rongjun Li, Wei Peng
ASR Error Detection via Audio-Transcript entailment
Nimshi Venkat Meripo, Sandeep Konam
ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks
Valentin Pelloin, Franck Dary, Nicolas Hervé et al.
ASR-Robust Natural Language Understanding on ASR-GLUE dataset
Lingyun Feng, Jianwei Yu, Yan Wang et al.
A Step Towards Preserving Speakers’ Identity While Detecting Depression Via Speaker Disentanglement
Vijay Ravi, Jinhan Wang, Jonathan Flint et al.
A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems
Marcely Zanon Boito, Laurent Besacier, Natalia Tomashenko et al.