Research Explorer

DB-PMAE: Dual-Branch Prototypical Masked AutoEncoder with locality for domain robust speaker verification

Wei-lin Xie, Yu-Xuan Xi, Yan Song et al.

2024 INTERSPEECH

Deciphering Assamese Vowel Harmony with Featural InfoWaveGAN

Sneha Ray Barman, Shakuntala Mahanta, Neeraj Kumar Sharma

2024 INTERSPEECH

Decoder-only Architecture for Streaming End-to-end Speech Recognition

Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi et al.

2024 INTERSPEECH

Decoding Human Language Acquisition: EEG Evidence for Predictive Probabilistic Statistics in Word Segmentation

Bin Zhao, Mingxuan Huang, Chenlu Ma et al.

2024 INTERSPEECH

Deep Echo Path Modeling for Acoustic Echo Cancellation

Fei Zhao, Chenggang Zhang, Shulin He et al.

2024 INTERSPEECH

Deep Prosodic Features in Tandem with Perceptual Judgments of Word Reduction for Tone Recognition in Conversed Speech

Xiang-Li Lu, Yi-Fen Liu

2024 INTERSPEECH

DeFTAN-AA: Array Geometry Agnostic Multichannel Speech Enhancement

Dongheon Lee, Jung-Woo Choi

2024 INTERSPEECH

Depression Enhances Internal Inconsistency between Spoken and Semantic Emotion: Evidence from the Analysis of Emotion Expression in Conversation

Xinyi Wu, Changqing Xu, Nan Li et al.

2024 INTERSPEECH

Design of Feedback Active Noise Cancellation Filter Using Nested Recurrent Neural Networks

Alireza Bayestehtashk, Amit Kumar, Mike Wurtz

2024 INTERSPEECH

DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu et al.

2024 INTERSPEECH

Detecting Empathy in Speech

Run Chen, Haozhe Chen, Anushka Kulkarni et al.

2024 INTERSPEECH

Detecting the terminality of speech-turn boundary for spoken interactions in French TV and Radio content

Rémi Uro, Marie Tahon, David Doukhan et al.

2024 INTERSPEECH

Detection of background agents speech in contact centers

Abhishek Kumar, Srikanth Konjeti, Jithendra Vepa

2024 INTERSPEECH

Detection of Cognitive Impairment And Alzheimer's Disease Using a Speech- and Language-Based Protocol

Tanya Talkar, Sherman Charles, Chelsea Krantsevich et al.

2024 INTERSPEECH

Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder

Jihyun Mun, Sunhee Kim, Minhwa Chung

2024 INTERSPEECH

Developing Multi-Disorder Voice Protocols: A team science approach involving clinical expertise, bioethics, standards, and DEI.

Anaïs Rameau, Satrajit Ghosh, Alexandros Sigaras et al.

2024 INTERSPEECH

Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features

Shaoxiang Dang, Tetsuya Matsumoto, Yoshinori Takeuchi et al.

2024 INTERSPEECH

DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing

Kuang Yuan, Shuo Han, Swarun Kumar et al.

2024 INTERSPEECH

DGPN: A Dual Graph Prototypical Network for Few-Shot Speech Spoofing Algorithm Recognition

Zirui Ge, Xinzhou Xu, Haiyan Guo et al.

2024 INTERSPEECH

DGSRN: Noise-Robust Speech Recognition Method with Dual-Path Gated Spectral Refinement Network

Wenjun Wang, Shangbin Mo, Ling Dong et al.

2024 INTERSPEECH

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Quan Wang, Yiling Huang, Guanlong Zhao et al.

2024 INTERSPEECH

DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval

Yifei Xin, Xuxin Cheng, Zhihong Zhu et al.

2024 INTERSPEECH

Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis

Chin-Yun Yu, György Fazekas

2024 INTERSPEECH

Diffusion Gaussian Mixture Audio Denoise

Pu Wang, Junhui Li, Jialu Li et al.

2024 INTERSPEECH

Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation

Nameer Hirschkind, Xiao Yu, Mahesh Kumar Nandwana et al.

2024 INTERSPEECH

Papers