Papers
Expressive, Variable, and Controllable Duration Modelling in TTS
Syed Ammar Abbas, Thomas Merritt, Alexis Moinet et al.
Extended U-Net for Speaker Verification in Noisy Environments
Ju-Ho Kim, Jungwoo Heo, Hye-jin Shim et al.
Extending Compositional Attention Networks for Social Reasoning in Videos
Christina Sartzetaki, Georgios Paraskevopoulos, Alexandros Potamianos
Extending GCC-PHAT using Shift Equivariant Neural Networks
Axel Berg, Mark O'Connor, Kalle Åström et al.
Extending RNN-T-based speech recognition systems with emotion and language classification
Zvi Kons, Hagai Aronowitz, Edmilson Morais et al.
External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge
Guolong Zhong, Hongyu Song, Ruoyu Wang et al.
Extract and Abstract with BART for Clinical Notes from Doctor-Patient Conversations
Jing Su, Longxiang Zhang, Hamid Reza Hassanzadeh et al.
Extracting Targeted Training Data from ASR Models, and How to Mitigate It
Ehsan Amid, Om Dipakbhai Thakkar, Arun Narayanan et al.
Factors affecting the percept of Yanny v. Laurel (or mixed): Insights from a large-scale study on Swiss German listeners
Adrian Leemann, Péter Jeszenszky, Carina Steiner et al.
Fast Grad-TTS: Towards Efficient Diffusion-Based Speech Generation on CPU
Ivan Vovk, Tasnima Sadekova, Vladimir Gogoryan et al.
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation
Manthan Thakker, Sefik Emre Eskimez, Takuya Yoshioka et al.
FeaRLESS: Feature Refinement Loss for Ensembling Self-Supervised Learning Features in Robust End-to-end Speech Recognition
Szu-Jui Chen, Jiamin Xie, John H.L. Hansen
Federated Domain Adaptation for ASR with Full Self-Supervision
Junteng Jia, Jay Mahadeokar, Weiyi Zheng et al.
Federated Pruning: Improving Neural Network Efficiency with Federated Learning
Rongmei Lin, Yonghui Xiao, Tien-Ju Yang et al.
Federated Self-supervised Speech Representations: Are We There Yet?
Yan Gao, Javier Fernandez-Marques, Titouan Parcollet et al.
FedNST: Federated Noisy Student Training for Automatic Speech Recognition
Haaris Mehmood, Agnieszka Dobrowolska, Karthikeyan Saravanan et al.
Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding
Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang et al.
FFC-SE: Fast Fourier Convolution for Speech Enhancement
Ivan Shchekotov, Pavel K. Andreev, Oleg Ivanov et al.
FFM: A Frame Filtering Mechanism To Accelerate Inference Speed For Conformer In Speech Recognition
Zongfeng Quan, Nick J.C. Wang, Wei Chu et al.
Filler Word Detection and Classification: A Dataset and Benchmark
Ge Zhu, Juan-Pablo Caceres, Justin Salamon
FiLM Conditioning with Enhanced Feature to the Transformer-based End-to-End Noisy Speech Recognition
Da-Hee Yang, Joon-Hyuk Chang
Fine-grained Noise Control for Multispeaker Speech Synthesis
Karolos Nikitaras, Georgios Vamvoukakis, Nikolaos Ellinas et al.
Finer-grained Modeling units-based Meta-Learning for Low-resource Tibetan Speech Recognition
Siqing Qin, Longbiao Wang, Sheng Li et al.
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Models
Yeonghyeon Lee, Kangwook Jang, Jahyun Goo et al.
FlowCPCVC: A Contrastive Predictive Coding Supervised Flow Framework for Any-to-Any Voice Conversion
Jiahong Huang, Wen Xu, Yule Li et al.