Papers - Conftrace

Pruned RNN-T for fast, memory-eﬀicient ASR training

Fangjun Kuang, Liyong Guo, Wei Kang et al.

2022 INTERSPEECH

Pseudo Label Is Better Than Human Label

Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo et al.

2022 INTERSPEECH

Pushing the limits of raw waveform speaker recognition

Jee-weon Jung, Youjin Kim, Hee-Soo Heo et al.

2022 INTERSPEECH

QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer

Jinmiao Huang, Waseem Gharbieh, Qianhui Wan et al.

2022 INTERSPEECH

QDPN - Quasi-dual-path Network for single-channel Speech Separation

Joel Rixen, Matthias Renz

2022 INTERSPEECH

Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition

Thibault Bañeras Roux, Mickael Rouvier, Jane Wottawa et al.

2022 INTERSPEECH

Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals

Running Zhao, Jiangtao Yu, Tingle Li et al.

2022 INTERSPEECH

RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection

Dongchao Yang, Helin Wang, Zhongjie Ye et al.

2022 INTERSPEECH

Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting

Yang Xiao, Nana Hou, Eng Siong Chng

2022 INTERSPEECH

RCT: Random consistency training for semi-supervised sound event detection

Nian Shao, Erfan Loweimi, Xiaofei Li

2022 INTERSPEECH

Real-Time Monitoring of Silences in Contact Center Conversations

Digvijay Ingle, Ayush Kumar, Krishnachaitanya Gogineni et al.

2022 INTERSPEECH

Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model

Jean-Marc Valin, Ahmed Mustafa, Christopher Montgomery et al.

2022 INTERSPEECH

Recent improvements of ASR models in the face of adversarial attacks

Raphael Olivier, Bhiksha Raj

2022 INTERSPEECH

Recording and timing vocal responses in online experimentation

Katrina Kechun Li, Julia Schwarz, Jasper Hong Sim et al.

2022 INTERSPEECH

Recurrent multi-head attention fusion network for combining audio and text for speech emotion recognition

Chung-Soo Ahn, Chamara Kasun, Sunil Sivadas et al.

2022 INTERSPEECH

Reducing Domain mismatch in Self-supervised speech pre-training

Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran et al.

2022 INTERSPEECH

Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation

Viet Anh Trinh, Pegah Ghahremani, Brian King et al.

2022 INTERSPEECH

reducing multilingual context confusion for end-to-end code-switching automatic speech recognition

Shuai Zhang, Jiangyan Yi, Zhengkun Tian et al.

2022 INTERSPEECH

Reducing Offensive Replies in Open Domain Dialogue Systems

Naokazu Uchida, Takeshi Homma, Makoto Iwayama et al.

2022 INTERSPEECH

Reducing uncertainty at the score-to-LR stage in likelihood ratio-based forensic voice comparison using automatic speaker recognition systems

Bruce Xiao Wang, Vincent Hughes

2022 INTERSPEECH

RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses

Shengyuan Xu, Wenxiao Zhao, Jing Guo

2022 INTERSPEECH

Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for Multi-channel Noise Reduction

Julitta Bartolewska, Stanisław Kacprzak, Konrad Kowalczyk

2022 INTERSPEECH

RefTextLAS: Reference Text Biased Listen, Attend, and Spell Model For Accurate Reading Evaluation

Phani Sankar Nidadavolu, Na Xu, Nick Jutila et al.

2022 INTERSPEECH

Regularizing Transformer-based Acoustic Models by Penalizing Attention Weights

Munhak Lee, Joon-Hyuk Chang, Sang-Eon Lee et al.

2022 INTERSPEECH

Relating the fundamental frequency of speech with EEG using a dilated convolutional network

Corentin Puffay, Jana Van Canneyt, Jonas Vanthornhout et al.

2022 INTERSPEECH