Papers
8,761 papers found
Accentor: An Explicit Lexical Stress Model for TTS Systems
Diana Geneva, Georgi Shopov, Kostadin Garov et al.
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
Xian Shi, Haoneng Luo, Zhifu Gao et al.
Accurate and Structured Pruning for Efficient Automatic Speech Recognition
Huiqiang Jiang, Li Lyna Zhang, Yuang Li et al.
A Compact End-to-End Model with Local and Global Context for Spoken Language Identification
Fei Jia, Nithin Rao Koluguri, Jagadeesh Balam et al.
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks
Yifan Peng, Kwangyoun Kim, Felix Wu et al.
A Compressed Synthetic Speech Detection Method with Compression Feature Embedding
Jinghong Zhang, Xiaowei Yi, Xianfeng Zhao
A conformer-based classifier for variable-length utterance processing in anti-spoofing
Eros Rosello, Alejandro Gomez-Alanis, Angel M. Gomez et al.
A Context-Constrained Sentence Modeling for Deception Detection in Real Interrogation
Ya-Tse Wu, Yuan-Ting Chang, Shao-Hao Lu et al.
Acoustic characteristics of depression in older adults' speech: the role of covariates
Carmen Mijnders, Esther Janse, Paul Naarding et al.
Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /ɹ/ in Child Speech Sound Disorders
Nina R Benway, Yashish M Siriwardena, Jonathan L Preston et al.
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling
Ramon Sanabria, Ondřej Klejch, Hao Tang et al.
Active Learning for Abnormal Lung Sound Data Curation and Detection in Asthma
Shabnam Ghaffarzadegan, Luca Bondi, Ho-Hsiang Wu et al.
AdaMS: Deep Metric Learning with Adaptive Margin and Adaptive Scale for Acoustic Word Discrimination
Myunghun Jung, Hoirin Kim
Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Guy Yariv, Itai Gat, Lior Wolf et al.
Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks
László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya et al.
Adaptation of Whisper models to child speech recognition
Rishabh Jain, Andrei Barcovschi, Mariam Yiwere et al.
Adaptation to predictive prosodic cues in non-native standard dialect
Sabine Gosselke Berthelsen
Adapter-Based Extension of Multi-Speaker Text-To-Speech Model for New Speakers
Cheng-Ping Hsieh, Subhankar Ghosh, Boris Ginsburg
Adapter Incremental Continual Learning of Efficient Audio Spectrogram Transformers
Nithish Muthuchamy Selvaraj, Xiaobao Guo, Adams Kong et al.
ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Ambuj Mehrish, Abhinav Ramesh Kashyap, Li Yingting et al.
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition
Dianwen Ng, Chong Zhang, Ruixi Zhang et al.
Adapting a ConvNeXt Model to Audio Classification on AudioSet
Thomas Pellegrini, Ismail Khalfaoui-Hassani, Etienne Labbé et al.
Adapting an Unadaptable ASR System
Rao Ma, Mengjie Qian, Mark J. F. Gales et al.
Adapting Language-Audio Models as Few-Shot Audio Learners
Jinhua Liang, Xubo Liu, Haohe Liu et al.