Research Explorer

Expressive, Variable, and Controllable Duration Modelling in TTS

Syed Ammar Abbas, Thomas Merritt, Alexis Moinet et al.

2022 INTERSPEECH

Extended U-Net for Speaker Verification in Noisy Environments

Ju-Ho Kim, Jungwoo Heo, Hye-jin Shim et al.

2022 INTERSPEECH

Extending Compositional Attention Networks for Social Reasoning in Videos

Christina Sartzetaki, Georgios Paraskevopoulos, Alexandros Potamianos

2022 INTERSPEECH

Extending GCC-PHAT using Shift Equivariant Neural Networks

Axel Berg, Mark O'Connor, Kalle Åström et al.

2022 INTERSPEECH

Extending RNN-T-based speech recognition systems with emotion and language classification

Zvi Kons, Hagai Aronowitz, Edmilson Morais et al.

2022 INTERSPEECH

External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge

Guolong Zhong, Hongyu Song, Ruoyu Wang et al.

2022 INTERSPEECH

Extract and Abstract with BART for Clinical Notes from Doctor-Patient Conversations

Jing Su, Longxiang Zhang, Hamid Reza Hassanzadeh et al.

2022 INTERSPEECH

Extracting Targeted Training Data from ASR Models, and How to Mitigate It

Ehsan Amid, Om Dipakbhai Thakkar, Arun Narayanan et al.

2022 INTERSPEECH

Factors affecting the percept of Yanny v. Laurel (or mixed): Insights from a large-scale study on Swiss German listeners

Adrian Leemann, Péter Jeszenszky, Carina Steiner et al.

2022 INTERSPEECH

Fast Grad-TTS: Towards Efficient Diffusion-Based Speech Generation on CPU

Ivan Vovk, Tasnima Sadekova, Vladimir Gogoryan et al.

2022 INTERSPEECH

Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation

Manthan Thakker, Sefik Emre Eskimez, Takuya Yoshioka et al.

2022 INTERSPEECH

FeaRLESS: Feature Refinement Loss for Ensembling Self-Supervised Learning Features in Robust End-to-end Speech Recognition

Szu-Jui Chen, Jiamin Xie, John H.L. Hansen

2022 INTERSPEECH

Federated Domain Adaptation for ASR with Full Self-Supervision

Junteng Jia, Jay Mahadeokar, Weiyi Zheng et al.

2022 INTERSPEECH

Federated Pruning: Improving Neural Network Efficiency with Federated Learning

Rongmei Lin, Yonghui Xiao, Tien-Ju Yang et al.

2022 INTERSPEECH

Federated Self-supervised Speech Representations: Are We There Yet?

Yan Gao, Javier Fernandez-Marques, Titouan Parcollet et al.

2022 INTERSPEECH

FedNST: Federated Noisy Student Training for Automatic Speech Recognition

Haaris Mehmood, Agnieszka Dobrowolska, Karthikeyan Saravanan et al.

2022 INTERSPEECH

Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding

Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang et al.

2022 INTERSPEECH

FFC-SE: Fast Fourier Convolution for Speech Enhancement

Ivan Shchekotov, Pavel K. Andreev, Oleg Ivanov et al.

2022 INTERSPEECH

FFM: A Frame Filtering Mechanism To Accelerate Inference Speed For Conformer In Speech Recognition

Zongfeng Quan, Nick J.C. Wang, Wei Chu et al.

2022 INTERSPEECH

Filler Word Detection and Classification: A Dataset and Benchmark

Ge Zhu, Juan-Pablo Caceres, Justin Salamon

2022 INTERSPEECH

FiLM Conditioning with Enhanced Feature to the Transformer-based End-to-End Noisy Speech Recognition

Da-Hee Yang, Joon-Hyuk Chang

2022 INTERSPEECH

Fine-grained Noise Control for Multispeaker Speech Synthesis

Karolos Nikitaras, Georgios Vamvoukakis, Nikolaos Ellinas et al.

2022 INTERSPEECH

Finer-grained Modeling units-based Meta-Learning for Low-resource Tibetan Speech Recognition

Siqing Qin, Longbiao Wang, Sheng Li et al.

2022 INTERSPEECH

FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Models

Yeonghyeon Lee, Kangwook Jang, Jahyun Goo et al.

2022 INTERSPEECH

FlowCPCVC: A Contrastive Predictive Coding Supervised Flow Framework for Any-to-Any Voice Conversion

Jiahong Huang, Wen Xu, Yule Li et al.

2022 INTERSPEECH

Papers