Papers - Conftrace

Doing Something we Never could with Spoken Language Technologies-from early days to the era of deep learning

Lin-shan Lee

2020 INTERSPEECH

Domain Adaptation for Enhancing Speech-Based Depression Detection in Natural Environmental Conditions Using Dilated CNNs

Zhaocheng Huang, Julien Epps, Dale Joachim et al.

2020 INTERSPEECH

Domain Adaptation Using Class Similarity for Robust Speech Recognition

Han Zhu, Jiangjiang Zhao, Yuling Ren et al.

2020 INTERSPEECH

Domain Adversarial Neural Networks for Dysarthric Speech Recognition

Dominika Woszczyk, Stavros Petridis, David Millard

2020 INTERSPEECH

Domain Aware Training for Far-Field Small-Footprint Keyword Spotting

Haiwei Wu, Yan Jia, Yuanfei Nie et al.

2020 INTERSPEECH

Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning

Jiawen Kang, Ruiqi Liu, Lantian Li et al.

2020 INTERSPEECH

Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition

Zhihao Du, Jiqing Han, Xueliang Zhang

2020 INTERSPEECH

Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection

Hongji Wang, Heinrich Dinkel, Shuai Wang et al.

2020 INTERSPEECH

Dual Attention in Time and Frequency Domain for Voice Activity Detection

Joohyung Lee, Youngmoon Jung, Hoirin Kim

2020 INTERSPEECH

Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation

Jingjing Chen, Qirong Mao, Dong Liu

2020 INTERSPEECH

Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression

Nils L. Westhausen, Bernd T. Meyer

2020 INTERSPEECH

Dual Stage Learning Based Dynamic Time-Frequency Mask Generation for Audio Event Classification

Donghyeon Kim, Jaihyun Park, David K. Han et al.

2020 INTERSPEECH

DurIAN: Duration Informed Attention Network for Speech Synthesis

Chengzhu Yu, Heng Lu, Na Hu et al.

2020 INTERSPEECH

DurIAN-SC: Duration Informed Attention Network Based Singing Voice Conversion System

Liqiang Zhang, Chengzhu Yu, Heng Lu et al.

2020 INTERSPEECH

Dynamic Margin Softmax Loss for Speaker Verification

Dao Zhou, Longbiao Wang, Kong Aik Lee et al.

2020 INTERSPEECH

Dynamic Prosody Generation for Speech Synthesis Using Linguistics-Driven Acoustic Embedding Selection

Shubhi Tyagi, Marco Nicolis, Jonas Rohnke et al.

2020 INTERSPEECH

Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis

Ruibo Fu, Jianhua Tao, Zhengqi Wen et al.

2020 INTERSPEECH

Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis

Ruibo Fu, Jianhua Tao, Zhengqi Wen et al.

2020 INTERSPEECH

Dysarthria Detection and Severity Assessment Using Rhythm-Based Metrics

Abner Hernandez, Eun Jung Yeo, Sunhee Kim et al.

2020 INTERSPEECH

Dysarthric Speech Recognition Based on Deep Metric Learning

Yuki Takashima, Ryoichi Takashima, Tetsuya Takiguchi et al.

2020 INTERSPEECH

Early Stage LM Integration Using Local and Global Log-Linear Combination

Wilfried Michel, Ralf Schlüter, Hermann Ney

2020 INTERSPEECH

ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification

Brecht Desplanques, Jenthe Thienpondt, Kris Demuynck

2020 INTERSPEECH

EEG-Based Short-Time Auditory Attention Detection Using Multi-Task Deep Learning

Zhuo Zhang, Gaoyan Zhang, Jianwu Dang et al.

2020 INTERSPEECH

Effect of Adding Positional Information on Convolutional Neural Networks for End-to-End Speech Recognition

Jinhwan Park, Wonyong Sung

2020 INTERSPEECH

Effect of Microphone Position Measurement Error on RIR and its Impact on Speech Intelligibility and Quality

Aditya Raikar, Karan Nathwani, Ashish Panda et al.

2020 INTERSPEECH