Papers
8,761 papers found
Diff-E: Diffusion-based Learning for Decoding Imagined Speech EEG
Soowon Kim, Young-Eun Lee, Seo-Hyun Lee et al.
Differentially Private Adapters for Parameter Efficient Acoustic Modeling
Chun-Wei Ho, Chao-Han Huck Yang, Sabato Marco Siniscalchi
Differential Privacy enabled Dementia Classification: An Exploration of the Privacy-Accuracy Trade-off in Speech Signal Data
Suhas BN, Sarah Rajtmajer, Saeed Abdullah
Differentiating acoustic and physiological features in speech for hypoxia detection
Benjamin O'Brien, Adrien Gresse, Jean-Baptise Billaud et al.
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation
Ha-Yeong Choi, Sang-Hoon Lee, Seong-Whan Lee
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
Ryosuke Sawata, Naoki Murata, Yuhta Takida et al.
DiffSLU: Knowledge Distillation Based Diffusion Model for Cross-Lingual Spoken Language Understanding
Tianjun Mao, Chenghong Zhang
Diffusion-based accent modelling in speech synthesis
Kamil Deja, Georgi Tinchev, Marta Czarnowska et al.
Directional Speech Recognition for Speaker Disambiguation and Cross-talk Suppression
Ju Lin, Niko Moritz, Ruiming Xie et al.
(Dis)agreement and Preference Structure are Reflected in Matching Along Distinct Acoustic-prosodic Features
Anneliese Kelterer, Margaret Zellers, Barbara Schuppler
Discovering COVID-19 Coughing and Breathing Patterns from Unlabeled Data Using Contrastive Learning with Varying Pre-Training Domains
Jinjin Cai, Sudip Vhaduri, Xiao Luo
Discovering Phonetic Feature Event Patterns in Transformer Embeddings
Patrick Cormac English, John D. Kelleher, Julie Carson-Berndsen
Discrimination of the Different Intents Carried by the Same Text Through Integrating Multimodal Information
Zhongjie Li, Gaoyan Zhang, Longbiao Wang et al.
Disentangled Representation Learning for Multilingual Speaker Recognition
Kihyun Nam, Youkyum Kim, Jaesung Huh et al.
Disentangling the Contribution of Non-native Speech in Automated Pronunciation Assessment
Shuju Shi, Kaiqi Fu, Yiwei Gu et al.
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction
Vineet Bhat, Preethi Jyothi, Pushpak Bhattacharyya
Distant Speech Emotion Recognition in an Indoor Human-robot Interaction Scenario
Nicolás Grágeda, Eduardo Alvarado, Rodrigo Mahu et al.
Distillation Strategies for Discriminative Speech Recognition Rescoring
Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu et al.
Distilling knowledge from Gaussian process teacher to neural network student
Jeremy H. M. Wong, Huayun Zhang, Nancy F. Chen
DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
Haoyu Wang, Siyuan Wang, Wei-Qiang Zhang et al.
Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model
Xiang Li, Songxiang Liu, Max W. Y. Lam et al.
DNN-based Parameter Estimation for MVDR Beamforming and Post-filtering
Minseung Kim, Sein Cheong, Jong Won Shin
Domain Adaptation for Speech Enhancement in a Large Domain Gap
Lior Frenkel, Jacob Goldberger, Shlomo E. Chazan