Research Explorer

Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition

Haoqin Sun, Shiwan Zhao, Xiangyu Kong et al.

2024 INTERSPEECH

It’s Time to Take Action: Acoustic Modeling of Motor Verbs to Detect Parkinson’s Disease

Daniel Escobar-Grisales, Cristian David Ríos-Urrego, Ilja Baumann et al.

2024 INTERSPEECH

JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis

Hyunjae Cho, Junhyeok Lee, Wonbin Jung

2024 INTERSPEECH

Joint Learning of Context and Feedback Embeddings in Spoken Dialogue

Livia Qian, Gabriel Skantze

2024 INTERSPEECH

Joint prediction of subjective listening effort and speech intelligibility based on end-to-end learning

Dirk Eike Hoffner, Jana Roßbach, Bernd T. Meyer

2024 INTERSPEECH

Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition

Guinan Li, Jiajun Deng, Youjun Chen et al.

2024 INTERSPEECH

Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control

Alexander Blatt, Aravind Krishnan, Dietrich Klakow

2024 INTERSPEECH

Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices.

Atli Sigurgeirsson, Eddie L. Ungless

2024 INTERSPEECH

Keep, Delete, or Substitute: Frame Selection Strategy for Noise-Robust Speech Emotion Recognition

Seong-Gyun Leem, Daniel Fulford, Jukka-Pekka Onnela et al.

2024 INTERSPEECH

Key Acoustic Cues for the Realization of Metrical Prominence in Tone Languages: A Cross-Dialect Study

Yiying Hu, Hui Feng

2024 INTERSPEECH

Key-Element-Informed sLLM Tuning for Document Summarization

Sangwon Ryu, Heejin Do, Yunsu Kim et al.

2024 INTERSPEECH

Keyword-Guided Adaptation of Automatic Speech Recognition

Aviv Shamsian, Aviv Navon, Neta Glazer et al.

2024 INTERSPEECH

K-means and hierarchical clustering of f0 contours

Constantijn Kaland, Jeremy Steffman, Jennifer Cole

2024 INTERSPEECH

Knowledge boosting during low-latency inference

Vidya Srinivas, Malek Itani, Tuochao Chen et al.

2024 INTERSPEECH

Knowledge Distillation for Tiny Speech Enhancement with Latent Feature Augmentation

Behnam Gholami, Mostafa El-Khamy, KeeBong Song

2024 INTERSPEECH

Knowledge Distillation from Self-Supervised Representation Learning Model with Discrete Speech Units for Any-to-Any Streaming Voice Conversion

Hiroki Kanagawa, Yusuke Ijima

2024 INTERSPEECH

Knowledge-Preserving Pluggable Modules for Multilingual Speech Translation Tasks

Nan Chen, Yonghe Wang, Feilong Bao

2024 INTERSPEECH

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation

Wenhao Guan, Kaidi Wang, Wangjin Zhou et al.

2024 INTERSPEECH

LAHAJA: A Robust Multi-accent Benchmark for Evaluating Hindi ASR Systems

Tahir Javed, Janki Nawale, Sakshi Joshi et al.

2024 INTERSPEECH

Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition

Hao Yen, Pin-Jui Ku, Sabato Marco Siniscalchi et al.

2024 INTERSPEECH

Large Language Model-based FMRI Encoding of Language Functions for Subjects with Neurocognitive Disorder

Yuejiao Wang, Xianmin Gong, Lingwei Meng et al.

2024 INTERSPEECH

Large Language Models for Dysfluency Detection in Stuttered Speech

Dominik Wagner, Sebastian P. Bayerl, Ilja Baumann et al.

2024 INTERSPEECH

LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks

Amit Meghanani, Thomas Hain

2024 INTERSPEECH

LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance

Shihao Chen, Yu Gu, Jie Zhang et al.

2024 INTERSPEECH

Learnable Layer Selection and Model Fusion for Speech Self-Supervised Learning Models

Sheng-Chieh Chiu, Chia-Hua Wu, Jih-Kang Hsieh et al.

2024 INTERSPEECH

Papers