Research Explorer

From adaptive score normalization to adaptive data normalization for speaker verification systems

Sandro Cumani, Salvatore Sarni

2023 INTERSPEECH

From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion

Jingyao Wu, Ting Dang, Vidhyasaharan Sethu et al.

2023 INTERSPEECH

FTA-net: A Frequency and Time Attention Network for Speech Depression Detection

Qifei Li, Dong Wang, Yiming Ren et al.

2023 INTERSPEECH

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

Zhifu Gao, Zerui Li, Jiaming Wang et al.

2023 INTERSPEECH

FusedF0: Improving DNN-based F0 Estimation by Fusion of Summary-Correlograms and Raw Waveform Representations of Speech Signals

Eray Eren, Lee Ngee Tan, Abeer Alwan

2023 INTERSPEECH

Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations

Wenbin Wang, Yang Song, Sanjay Jha

2023 INTERSPEECH

General-purpose Adversarial Training for Enhanced Automatic Speech Recognition Model Generalization

Dohee Kim, Daeyeol Shim, Joon-Hyuk Chang

2023 INTERSPEECH

Generating high-resolution 3D real-time MRI of the vocal tract

Martin Strauch, Antoine Serrurier

2023 INTERSPEECH

Generating Multilingual Gender-Ambiguous Text-to-Speech Voices

Konstantinos Markopoulos, Georgia Maniati, Georgios Vamvoukakis et al.

2023 INTERSPEECH

GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech

Yahuan Cong, Haoyu Zhang, Haopeng Lin et al.

2023 INTERSPEECH

Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction

Wenzhe Liu, Yupeng Shi, Jun Chen et al.

2023 INTERSPEECH

GhostRNN: Reducing State Redundancy in RNN with Cheap Operations

Hang Zhou, Xiaoxu Zheng, Yunhe Wang et al.

2023 INTERSPEECH

GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering

Xuxin Cheng, Zhihong Zhu, Ziyu Yao et al.

2023 INTERSPEECH

GigaST: A 10,000-hour Pseudo Speech Translation Corpus

Rong Ye, Chengqi Zhao, Tom Ko et al.

2023 INTERSPEECH

Glottal source analysis of voice deficits in basal ganglia dysfunction: evidence from de novo Parkinson's disease and Huntington's disease

Michal Novotný, Tereza Tykalová, Michal Šimek et al.

2023 INTERSPEECH

GL-SSD: Global and Local Speech Style Disentanglement by vector quantization for robust sentence boundary detection in speech stream

Kuncai Zhang, Wei Zhou, Pengcheng Zhu et al.

2023 INTERSPEECH

GPU-accelerated Guided Source Separation for Meeting Transcription

Desh Raj, Daniel Povey, Sanjeev Khudanpur

2023 INTERSPEECH

GRAVO: Learning to Generate Relevant Audio from Visual Features with Noisy Online Videos

Youngdo Ahn, Chengyi Wang, Yu Wu et al.

2023 INTERSPEECH

Group GMM-ResNet for Detection of Synthetic Speech Attacks

Zhenchun Lei, Yan Wen, Yingen Yang et al.

2023 INTERSPEECH

HABLA: A Dataset of Latin American Spanish Accents for Voice Anti-spoofing

Pablo Andrés Tamayo Flórez, Rubén Manrique, Bernardo Pereira Nunes

2023 INTERSPEECH

HAD-ANC: A Hybrid System Comprising an Adaptive Filter and Deep Neural Networks for Active Noise Control

JungPhil Park, Jeong-Hwan Choi, Yungyeo Kim et al.

2023 INTERSPEECH

Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches

Vinicius Ribeiro, Yiteng Huang, Yuan Shangguan et al.

2023 INTERSPEECH

Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model

Xiaohuai Le, Tong Lei, Li Chen et al.

2023 INTERSPEECH

HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders

Doyeon Kim, Soo-Whan Chung, Hyewon Han et al.

2023 INTERSPEECH

Head movements in two- and four-person interactive conversational tasks in noisy and moderately reverberant conditions

Alan Archer-Boyd, Rainer Martin

2023 INTERSPEECH

Papers