Research Explorer

Privacy PORCUPINE: Anonymization of Speaker Attributes Using Occurrence Normalization for Space-Filling Vector Quantization

Mohammad Hassan Vali, Tom Bäckström

2024 INTERSPEECH

Probing the Feasibility of Multilingual Speaker Anonymization

Sarina Meyer, Florian Lux, Ngoc Thang Vu

2024 INTERSPEECH

Production of fricative consonants in French-speaking children with cochlear implants and typical hearing: acoustic and phonological analyses.

Sophie Fagniart, Brigitte Charlier, Véronique Delvaux et al.

2024 INTERSPEECH

Production of phrases by mechanical models of the human vocal tract

Takayuki Arai, Ryohei Suzuki, Chandler Earp et al.

2024 INTERSPEECH

Prompting Large Language Models with Audio for General-Purpose Speech Summarization

Wonjune Kang, Deb Roy

2024 INTERSPEECH

Prompting Large Language Models with Mispronunciation Detection and Diagnosis Abilities

Minglin Wu, Jing Xu, Xixin Wu et al.

2024 INTERSPEECH

Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding

Mohan Li, Simon Keizer, Rama Doddipatla

2024 INTERSPEECH

Prompt Link Multimodal Fusion in Multimodal Sentiment Analysis

Kang Zhu, Cunhang Fan, Jianhua Tao et al.

2024 INTERSPEECH

Prompt Tuning for Audio Deepfake Detection: Computationally Efficient Test-time Domain Adaptation with Limited Target Dataset

Hideyuki Oiso, Yuto Matsunaga, Kazuya Kakizaki et al.

2024 INTERSPEECH

Prompt Tuning for Speech Recognition on Unknown Spoken Name Entities

Xizi Wei, Stephen McGregor

2024 INTERSPEECH

Prosodic marking of syntactic boundaries in Khoekhoe

Kira Tulchynska, Sylvanus Job, Alena Witzlack-Makarevich et al.

2024 INTERSPEECH

Prosody-Driven Privacy-Preserving Dementia Detection

Dominika Woszczyk, Ranya Aloufi, Soteris Demetriou

2024 INTERSPEECH

Prosody of speech production in latent post-stroke aphasia

Cong Zhang, Tong Li, Gayle DeDe et al.

2024 INTERSPEECH

PRVAE-VC2: Non-Parallel Voice Conversion by Distillation of Speech Representations

Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko et al.

2024 INTERSPEECH

QGAN: Low Footprint Quaternion Neural Vocoder for Speech Synthesis

Aryan Chaudhary, Vinayak Abrol

2024 INTERSPEECH

QHM-GAN: Neural Vocoder based on Quasi-Harmonic Modeling

Shaowen Chen, Tomoki Toda

2024 INTERSPEECH

Qifusion-Net: Layer-adapted Stream/Non-stream Model for End-to-End Multi-Accent Speech Recognition

Jinming Chen, Jingyi Fang, Yuanzhong Zheng et al.

2024 INTERSPEECH

QMixCAT: Unsupervised Speech Enhancement Using Quality-guided Signal Mixing and Competitive Alternating Model Training

Shilin Wang, Haixin Guan, Yanhua Long

2024 INTERSPEECH

Quantification of stylistic differences in human- and ASR-produced transcripts of African American English

Annika Heuser, Tyler Kendall, Miguel del Rio et al.

2024 INTERSPEECH

Quantifying the effect of speech pathology on automatic and human speaker verification

Bence Mark Halpern, Thomas Tienkamp, Wen-Chin Huang et al.

2024 INTERSPEECH

Quantifying the Role of Textual Predictability in Automatic Speech Recognition

Sean Robertson, Gerald Penn, Ewan Dunbar

2024 INTERSPEECH

Quantifying Unintended Memorization in BEST-RQ ASR Encoders

Virat Shejwalkar, Om Thakkar, Arun Narayanan

2024 INTERSPEECH

Quantity-sensitivity affects recall performance of word stress

Constantijn Kaland, Maria Lialiou

2024 INTERSPEECH

Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning

Zhenyu Wang, Shuyu Kong, Li Wan et al.

2024 INTERSPEECH

RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention

Mingshuai Liu, Zhuangqi Chen, Xiaopeng Yan et al.

2024 INTERSPEECH

Papers