Papers
Few-Shot Audio Classification with Attentional Graph Neural Networks
Shilei Zhang, Yong Qin, Kewei Sun et al.
Fine-Grained Robust Prosody Transfer for Single-Speaker Neural Text-To-Speech
Viacheslav Klimkov, Srikanth Ronanki, Jonas Rohnke et al.
Follow-Up Question Generation Using Neural Tensor Network-Based Domain Ontology Population in an Interview Coaching System
Ming-Hsiang Su, Chung-Hsien Wu, Yi Chang
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams
Guanlong Zhao, Shaojin Ding, Ricardo Gutierrez-Osuna
Foreign-Language Knowledge Enhances Artificial-Language Segmentation
Annie Tremblay, Mirjam Broersma
Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition
Kartik Audhkhasi, George Saon, Zoltán Tüske et al.
Formant Pattern and Spectral Shape Ambiguity of Vowel Sounds, and Related Phenomena of Vowel Acoustics — Exemplary Evidence
Dieter Maurer, Heidy Suter, Christian d’Hereuse et al.
Forward-Backward Decoding for Regularizing End-to-End TTS
Yibin Zheng, Xi Wang, Lei He et al.
Framewise Supervised Training Towards End-to-End Speech Recognition Models: First Results
Mohan Li, Yuanjiang Cao, Weicong Zhou et al.
Framework for Conducting Tasks Requiring Human Assessment
Martin Grůber, Adam Chýlek, Jindřich Matoušek
Fréchet Audio Distance: A Reference-Free Metric for Evaluating Music Enhancement Algorithms
Kevin Kilgour, Mauricio Zuluaga, Dominik Roblek et al.
Frication as a Vowel Feature? — Evidence from the Rui’an Wu Chinese Dialect
Fang Hu, Youjue He
Front-End Feature Compensation and Denoising for Noise Robust Speech Emotion Recognition
Rupayan Chakraborty, Ashish Panda, Meghna Pandharipande et al.
Fully-Convolutional Network for Pitch Estimation of Speech Signals
Luc Ardaillon, Axel Roebel
Fundamental Frequency Accommodation in Multi-Party Human-Robot Game Interactions: The Effect of Winning or Losing
Omnia Ibrahim, Gabriel Skantze, Sabine Stoll et al.
Fusion Strategy for Prosodic and Lexical Representations of Word Importance
Sushant Kafle, Cecilia Ovesdotter Alm, Matt Huenerfauth
Fusion Techniques for Utterance-Level Emotion Recognition Combining Speech and Transcripts
Jilt Sebastian, Piero Pierucci
GECKO — A Tool for Effective Annotation of Human Conversations
Golan Levy, Raquel Sitman, Ido Amir et al.
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram
Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi et al.
Gender De-Biasing in Speech Emotion Recognition
Cristina Gorrostieta, Reza Lotfian, Kye Taylor et al.
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech
Li-Wei Chen, Hung-Yi Lee, Yu Tsao
Generative Noise Modeling and Channel Simulation for Robust Speech Recognition in Unseen Conditions
Meet Soni, Sonal Joshi, Ashish Panda
GFM-Voc: A Real-Time Voice Quality Modification System
Olivier Perrotin, Ian McLoughlin
Glottal Closure Instants Detection from Speech Signal by Deep Features Extracted from Raw Speech and Linear Prediction Residual
Gurunath Reddy M., K. Sreenivasa Rao, Partha Pratim Das