Papers
FeatherWave: An Efficient High-Fidelity Neural Vocoder with Multi-Band Linear Prediction
Qiao Tian, Zewang Zhang, Heng Lu et al.
FinChat: Corpus and Evaluation Setup for Finnish Chat Conversations on Everyday Topics
Katri Leino, Juho Leinonen, Mittul Singh et al.
Finding Intelligible Consonant-Vowel Sounds Using High-Quality Articulatory Synthesis
Daniel R. van Niekerk, Anqi Xu, Branislav Gerazov et al.
Finnish ASR with Deep Transformer Models
Abhilash Jain, Aku Rouhe, Stig-Arne Grönroos et al.
Focal Loss for Punctuation Prediction
Jiangyan Yi, Jianhua Tao, Zhengkun Tian et al.
Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism
Wang Dai, Jinsong Zhang, Yingming Gao et al.
Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning
Hao Li, DeLiang Wang, Xueliang Zhang et al.
Frame-Wise Online Unsupervised Adaptation of DNN-HMM Acoustic Model from Perspective of Robust Adaptive Filtering
Ryu Takeda, Kazunori Komatani
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint
Zexin Cai, Chuxiong Zhang, Ming Li
FT Speech: Danish Parliament Speech Corpus
Andreas Kirkedal, Marija Stepanović, Barbara Plank
Fundamental Frequency Model for Postfiltering at Low Bitrates in a Transform-Domain Speech and Audio Codec
Sneha Das, Tom Bäckström, Guillaume Fuchs
Fusion Architectures for Word-Based Audiovisual Speech Recognition
Michael Wand, Jürgen Schmidhuber
FusionRNN: Shared Neural Parameters for Multi-Channel Distant Speech Recognition
Titouan Parcollet, Xinchi Qiu, Nicholas D. Lane
Gaming Corpus for Studying Social Screams
Hiroki Mori, Yuki Kikuchi
GAN-Based Data Generation for Speech Emotion Recognition
Sefik Emre Eskimez, Dimitrios Dimitriadis, Robert Gmyr et al.
Gated Multi-Head Attention Pooling for Weakly Labelled Audio Tagging
Sixin Hong, Yuexian Zou, Wenwu Wang
Gated Recurrent Fusion of Spatial and Spectral Features for Multi-Channel Speech Separation with Deep Embedding Representations
Cunhang Fan, Jianhua Tao, Bin Liu et al.
GAZEV: GAN-Based Zero-Shot Voice Conversion Over Non-Parallel Speech Corpus
Zining Zhang, Bingsheng He, Zhenjie Zhang
Generative Adversarial Network Based Acoustic Echo Cancellation
Yi Zhang, Chengyun Deng, Shiqian Ma et al.
Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition
Kohei Matsuura, Masato Mimura, Shinsuke Sakai et al.
Generic Indic Text-to-Speech Synthesisers with Rapid Adaptation in an End-to-End Framework
Anusha Prakash, Hema A. Murthy
GEV Beamforming Supported by DOA-Based Masks Generated on Pairs of Microphones
François Grondin, Jean-Samuel Lauzon, Jonathan Vincent et al.
Glottal Closure Instants Detection from EGG Signal by Classification Approach
Gurunath Reddy M., K. Sreenivasa Rao, Partha Pratim Das