Papers
Feature Exploration for Almost Zero-Resource ASR-Free Keyword Spotting Using a Multilingual Bottleneck Extractor and Correspondence Autoencoders
Raghav Menon, Herman Kamper, Ewald van der Westhuizen et al.
Feature Representation of Pathophysiology of Parkinsonian Dysarthria
Alice Rueda, J.C. Vásquez-Correa, Cristian David Rios-Urrego et al.
Feature Space Visualization with Spatial Similarity Maps for Pathological Speech Data
Philipp Klumpp, J.C. Vásquez-Correa, Tino Haderlein et al.
Few-Shot Audio Classification with Attentional Graph Neural Networks
Shilei Zhang, Yong Qin, Kewei Sun et al.
Fine-Grained Robust Prosody Transfer for Single-Speaker Neural Text-To-Speech
Viacheslav Klimkov, Srikanth Ronanki, Jonas Rohnke et al.
Follow-Up Question Generation Using Neural Tensor Network-Based Domain Ontology Population in an Interview Coaching System
Ming-Hsiang Su, Chung-Hsien Wu, Yi Chang
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams
Guanlong Zhao, Shaojin Ding, Ricardo Gutierrez-Osuna
Foreign-Language Knowledge Enhances Artificial-Language Segmentation
Annie Tremblay, Mirjam Broersma
Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition
Kartik Audhkhasi, George Saon, Zoltán Tüske et al.
Formant Pattern and Spectral Shape Ambiguity of Vowel Sounds, and Related Phenomena of Vowel Acoustics — Exemplary Evidence
Dieter Maurer, Heidy Suter, Christian d’Hereuse et al.
Forward-Backward Decoding for Regularizing End-to-End TTS
Yibin Zheng, Xi Wang, Lei He et al.
Framewise Supervised Training Towards End-to-End Speech Recognition Models: First Results
Mohan Li, Yuanjiang Cao, Weicong Zhou et al.
Framework for Conducting Tasks Requiring Human Assessment
Martin Grůber, Adam Chýlek, Jindřich Matoušek
Fréchet Audio Distance: A Reference-Free Metric for Evaluating Music Enhancement Algorithms
Kevin Kilgour, Mauricio Zuluaga, Dominik Roblek et al.
Frication as a Vowel Feature? — Evidence from the Rui’an Wu Chinese Dialect
Fang Hu, Youjue He
Front-End Feature Compensation and Denoising for Noise Robust Speech Emotion Recognition
Rupayan Chakraborty, Ashish Panda, Meghna Pandharipande et al.
Fully-Convolutional Network for Pitch Estimation of Speech Signals
Luc Ardaillon, Axel Roebel
Fundamental Frequency Accommodation in Multi-Party Human-Robot Game Interactions: The Effect of Winning or Losing
Omnia Ibrahim, Gabriel Skantze, Sabine Stoll et al.
Fusion Strategy for Prosodic and Lexical Representations of Word Importance
Sushant Kafle, Cecilia Ovesdotter Alm, Matt Huenerfauth
Fusion Techniques for Utterance-Level Emotion Recognition Combining Speech and Transcripts
Jilt Sebastian, Piero Pierucci
GECKO — A Tool for Effective Annotation of Human Conversations
Golan Levy, Raquel Sitman, Ido Amir et al.
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram
Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi et al.
Gender De-Biasing in Speech Emotion Recognition
Cristina Gorrostieta, Reza Lotfian, Kye Taylor et al.
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech
Li-Wei Chen, Hung-Yi Lee, Yu Tsao