Papers
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages
Yosuke Higuchi, Naohiro Tawara, Tetsunori Kobayashi et al.
Speaker Augmentation and Bandwidth Extension for Deep Speaker Embedding
Hitoshi Yamamoto, Kong Aik Lee, Koji Okabe et al.
Speaker-Aware Deep Denoising Autoencoder with Embedded Speaker Identity for Speech Enhancement
Fu-Kai Chuang, Syu-Siang Wang, Jeih-weih Hung et al.
Speaker-Corrupted Embeddings for Online Speaker Diarization
Omid Ghahabi, Volker Fischer
Speaker Diarization Using Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings
Alan McCree, Gregory Sell, Daniel Garcia-Romero
Speaker Diarization with Deep Speaker Embeddings for DIHARD Challenge II
Sergey Novoselov, Aleksei Gusev, Artem Ivanov et al.
Speaker Diarization with Lexical Information
Tae Jin Park, Kyu J. Han, Jing Huang et al.
Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning
Long Wu, Hangting Chen, Li Wang et al.
Speaker Recognition Benchmark Using the CHiME-5 Corpus
Daniel Garcia-Romero, David Snyder, Shinji Watanabe et al.
SPEAK YOUR MIND! Towards Imagined Speech Recognition with Hierarchical Deep Learning
Pramit Saha, Muhammad Abdul-Mageed, Sidney Fels
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park, William Chan, Yu Zhang et al.
Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric
Ryandhimas E. Zezario, Szu-Wei Fu, Xugang Lu et al.
Spectral Subspace Analysis for Automatic Assessment of Pathological Speech Intelligibility
Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard
Speech Audio Super-Resolution for Speech Recognition
Xinyu Li, Venkata Chebiyyam, Katrin Kirchhoff
Speech Augmentation via Speaker-Specific Noise in Unseen Environment
Ya’nan Guo, Ziping Zhao, Yide Ma et al.
Speech Based Emotion Prediction: Can a Linear Model Work?
Anda Ouyang, Ting Dang, Vidhyasaharan Sethu et al.
Speech-Based Web Navigation for Limited Mobility Users
Vasiliy Radostev, Serge Berger, Justin Tabrizi et al.
Speech Denoising with Deep Feature Losses
François G. Germain, Qifeng Chen, Vladlen Koltun
Speech Driven Backchannel Generation Using Deep Q-Network for Enhancing Engagement in Human-Robot Interaction
Nusrah Hussain, Engin Erzin, T. Metin Sezgin et al.
Speech Emotion Recognition Based on Multi-Label Emotion Existence Model
Atsushi Ando, Ryo Masumura, Hosana Kamiyama et al.
Speech Emotion Recognition in Dyadic Dialogues with Attentive Interaction Modeling
Jinming Zhao, Shizhe Chen, Jingjun Liang et al.
Speech Emotion Recognition with a Reject Option
Kusha Sridhar, Carlos Busso
Speech Enhancement for Noise-Robust Speech Synthesis Using Wasserstein GAN
Nagaraj Adiga, Yannis Pantazis, Vassilis Tsiaras et al.
Speech Enhancement Using Forked Generative Adversarial Networks with Spectral Subtraction
Ju Lin, Sufeng Niu, Zice Wei et al.