Papers
8,761 papers found
DB-PMAE: Dual-Branch Prototypical Masked AutoEncoder with locality for domain robust speaker verification
Wei-lin Xie, Yu-Xuan Xi, Yan Song et al.
Deciphering Assamese Vowel Harmony with Featural InfoWaveGAN
Sneha Ray Barman, Shakuntala Mahanta, Neeraj Kumar Sharma
Decoder-only Architecture for Streaming End-to-end Speech Recognition
Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi et al.
Decoding Human Language Acquisition: EEG Evidence for Predictive Probabilistic Statistics in Word Segmentation
Bin Zhao, Mingxuan Huang, Chenlu Ma et al.
Deep Echo Path Modeling for Acoustic Echo Cancellation
Fei Zhao, Chenggang Zhang, Shulin He et al.
DeFTAN-AA: Array Geometry Agnostic Multichannel Speech Enhancement
Dongheon Lee, Jung-Woo Choi
Depression Enhances Internal Inconsistency between Spoken and Semantic Emotion: Evidence from the Analysis of Emotion Expression in Conversation
Xinyi Wu, Changqing Xu, Nan Li et al.
Design of Feedback Active Noise Cancellation Filter Using Nested Recurrent Neural Networks
Alireza Bayestehtashk, Amit Kumar, Mike Wurtz
DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment
Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu et al.
Detecting Empathy in Speech
Run Chen, Haozhe Chen, Anushka Kulkarni et al.
Detecting the terminality of speech-turn boundary for spoken interactions in French TV and Radio content
Rémi Uro, Marie Tahon, David Doukhan et al.
Detection of background agents speech in contact centers
Abhishek Kumar, Srikanth Konjeti, Jithendra Vepa
Detection of Cognitive Impairment And Alzheimer's Disease Using a Speech- and Language-Based Protocol
Tanya Talkar, Sherman Charles, Chelsea Krantsevich et al.
Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder
Jihyun Mun, Sunhee Kim, Minhwa Chung
Developing Multi-Disorder Voice Protocols: A team science approach involving clinical expertise, bioethics, standards, and DEI.
Anaïs Rameau, Satrajit Ghosh, Alexandros Sigaras et al.
Developing vocal system impaired patient-aimed voice quality assessment approach using ASR representation-included multiple features
Shaoxiang Dang, Tetsuya Matsumoto, Yoshinori Takeuchi et al.
DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing
Kuang Yuan, Shuo Han, Swarun Kumar et al.
DGPN: A Dual Graph Prototypical Network for Few-Shot Speech Spoofing Algorithm Recognition
Zirui Ge, Xinzhou Xu, Haiyan Guo et al.
DGSRN: Noise-Robust Speech Recognition Method with Dual-Path Gated Spectral Refinement Network
Wenjun Wang, Shangbin Mo, Ling Dong et al.
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
Quan Wang, Yiling Huang, Guanlong Zhao et al.
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval
Yifei Xin, Xuxin Cheng, Zhihong Zhu et al.
Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis
Chin-Yun Yu, György Fazekas
Diffusion Gaussian Mixture Audio Denoise
Pu Wang, Junhui Li, Jialu Li et al.
Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation
Nameer Hirschkind, Xiao Yu, Mahesh Kumar Nandwana et al.