Papers
From adaptive score normalization to adaptive data normalization for speaker verification systems
Sandro Cumani, Salvatore Sarni
From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion
Jingyao Wu, Ting Dang, Vidhyasaharan Sethu et al.
FTA-net: A Frequency and Time Attention Network for Speech Depression Detection
Qifei Li, Dong Wang, Yiming Ren et al.
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Zhifu Gao, Zerui Li, Jiaming Wang et al.
FusedF0: Improving DNN-based F0 Estimation by Fusion of Summary-Correlograms and Raw Waveform Representations of Speech Signals
Eray Eren, Lee Ngee Tan, Abeer Alwan
Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations
Wenbin Wang, Yang Song, Sanjay Jha
General-purpose Adversarial Training for Enhanced Automatic Speech Recognition Model Generalization
Dohee Kim, Daeyeol Shim, Joon-Hyuk Chang
Generating high-resolution 3D real-time MRI of the vocal tract
Martin Strauch, Antoine Serrurier
Generating Multilingual Gender-Ambiguous Text-to-Speech Voices
Konstantinos Markopoulos, Georgia Maniati, Georgios Vamvoukakis et al.
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Yahuan Cong, Haoyu Zhang, Haopeng Lin et al.
Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Wenzhe Liu, Yupeng Shi, Jun Chen et al.
GhostRNN: Reducing State Redundancy in RNN with Cheap Operations
Hang Zhou, Xiaoxu Zheng, Yunhe Wang et al.
GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering
Xuxin Cheng, Zhihong Zhu, Ziyu Yao et al.
GigaST: A 10,000-hour Pseudo Speech Translation Corpus
Rong Ye, Chengqi Zhao, Tom Ko et al.
Glottal source analysis of voice deficits in basal ganglia dysfunction: evidence from de novo Parkinson's disease and Huntington's disease
Michal Novotný, Tereza Tykalová, Michal Šimek et al.
GL-SSD: Global and Local Speech Style Disentanglement by vector quantization for robust sentence boundary detection in speech stream
Kuncai Zhang, Wei Zhou, Pengcheng Zhu et al.
GPU-accelerated Guided Source Separation for Meeting Transcription
Desh Raj, Daniel Povey, Sanjeev Khudanpur
GRAVO: Learning to Generate Relevant Audio from Visual Features with Noisy Online Videos
Youngdo Ahn, Chengyi Wang, Yu Wu et al.
Group GMM-ResNet for Detection of Synthetic Speech Attacks
Zhenchun Lei, Yan Wen, Yingen Yang et al.
HABLA: A Dataset of Latin American Spanish Accents for Voice Anti-spoofing
Pablo Andrés Tamayo Flórez, Rubén Manrique, Bernardo Pereira Nunes
HAD-ANC: A Hybrid System Comprising an Adaptive Filter and Deep Neural Networks for Active Noise Control
JungPhil Park, Jeong-Hwan Choi, Yungyeo Kim et al.
Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Vinicius Ribeiro, Yiteng Huang, Yuan Shangguan et al.
Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model
Xiaohuai Le, Tong Lei, Li Chen et al.
HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Doyeon Kim, Soo-Whan Chung, Hyewon Han et al.
Head movements in two- and four-person interactive conversational tasks in noisy and moderately reverberant conditions
Alan Archer-Boyd, Rainer Martin