Papers - Conftrace

SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification

Helin Wang, Yuexian Zou, Wenwu Wang

2021 INTERSPEECH

SpecMix : A Mixed Sample Data Augmentation Method for Training with Time-Frequency Domain Features

Gwantae Kim, David K. Han, Hanseok Ko

2021 INTERSPEECH

SpecRec: An Alternative Solution for Improving End-to-End Speech-to-Text Translation via Spectrogram Reconstruction

Junkun Chen, Mingbo Ma, Renjie Zheng et al.

2021 INTERSPEECH

Spectral and Latent Speech Representation Distortion for TTS Evaluation

Thananchai Kongthaworn, Burin Naowarat, Ekapol Chuangsuwanich

2021 INTERSPEECH

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition

Mengzhe Geng, Shansong Liu, Jianwei Yu et al.

2021 INTERSPEECH

Speech2Video: Cross-Modal Distillation for Speech to Video Generation

Shijing Si, Jianzong Wang, Xiaoyang Qu et al.

2021 INTERSPEECH

Speech Acoustic Modelling Using Raw Source and Filter Components

Erfan Loweimi, Zoran Cvetkovic, Peter Bell et al.

2021 INTERSPEECH

Speech Activity Detection Based on Multilingual Speech Recognition System

Seyyed Saeed Sarfjoo, Srikanth Madikeri, Petr Motlicek

2021 INTERSPEECH

SpeechAdjuster: A Tool for Investigating Listener Preferences and Speech Intelligibility

Olympia Simantiraki, Martin Cooke

2021 INTERSPEECH

Speech Based Depression Severity Level Classification Using a Multi-Stage Dilated CNN-LSTM Model

Nadee Seneviratne, Carol Espy-Wilson

2021 INTERSPEECH

Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation

Alfredo Esquivel Jaramillo, Jesper Kjær Nielsen, Mads Græsbøll Christensen

2021 INTERSPEECH

Speech Denoising with Auditory Models

Mark R. Saddler, Andrew Francl, Jenelle Feather et al.

2021 INTERSPEECH

Speech Denoising Without Clean Training Data: A Noise2Noise Approach

Madhav Mahesh Kashyap, Anuj Tambwekar, Krishnamoorthy Manohara et al.

2021 INTERSPEECH

Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-Encoders

Jinzi Qi, Hugo Van hamme

2021 INTERSPEECH

Speech Emotion Recognition Based on Attention Weight Correction Using Word-Level Confidence Measure

Jennifer Santoso, Takeshi Yamada, Shoji Makino et al.

2021 INTERSPEECH

Speech Emotion Recognition via Multi-Level Cross-Modal Distillation

Ruichen Li, Jinming Zhao, Qin Jin

2021 INTERSPEECH

Speech Emotion Recognition with Multi-Task Learning

Xingyu Cai, Jiahong Yuan, Renjie Zheng et al.

2021 INTERSPEECH

Speech Enhancement with Topology-Enhanced Generative Adversarial Networks (GANs)

Xudong Zhang, Liang Zhao, Feng Gu

2021 INTERSPEECH

Speech Enhancement with Weakly Labelled Data from AudioSet

Qiuqiang Kong, Haohe Liu, Xingjian Du et al.

2021 INTERSPEECH

Speech Intelligibility of Dysarthric Speech: Human Scores and Acoustic-Phonetic Features

Wei Xue, Roeland van Hout, Fleur Boogmans et al.

2021 INTERSPEECH

SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts

Zhao You, Shulin Feng, Dan Su et al.

2021 INTERSPEECH

speechocean762: An Open-Source Non-Native English Speech Corpus for Pronunciation Assessment

Junbo Zhang, Zhiwen Zhang, Yongqing Wang et al.

2021 INTERSPEECH

Speech Perception and Loanword Adaptations: The Case of Copy-Vowel Epenthesis

Adriana Guevara-Rukoz, Shi Yu, Sharon Peperkamp

2021 INTERSPEECH

Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021

Takashi Maekaku, Xuankai Chang, Yuya Fujita et al.

2021 INTERSPEECH

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations

Adam Polyak, Yossi Adi, Jade Copet et al.

2021 INTERSPEECH