Papers
8,761 papers found
Comparing first spectral moment of Australian English /s/ between straight and gay voices using three analysis window sizes
Tünde Szalay, John Holik, Duy Duong Nguyen et al.
Comparing Hand-Crafted Features to Spectrograms for Autism Severity Estimation
Marina Eni, Ilan Dinstein, Yaniv Zigel
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Guangyan Zhang, Thomas Merritt, Sam Ribeiro et al.
Comparing Self-Supervised Pre-Training and Semi-Supervised Training for Speech Recognition in Languages with Weak Language Models
Léa-Marie Lam-Yee-Mui, Lucas Ondel Yang, Ondřej Klejch
Comparison of acoustic measures of dysphonia in Parkinson's disease and Huntington's disease: Effect of sex and speaking task
Michal Šimek, Tomáš Kouba, Michal Novotný et al.
Comparison of GIF- and SSL-based Features in Pathological-voice Detection
Akira Sasou, Yang Chen
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages
Andrew Rouditchenko, Sameer Khurana, Samuel Thomas et al.
Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Tina Raissi, Christoph Lüscher, Moritz Gunz et al.
Complex Image Generation SwinTransformer Network for Audio Denoising
Youshan Zhang, Jialu Li
Complex-valued neural networks for voice anti-spoofing
Nicolas M. Müller, Philip Sperl, Konstantin Böttinger
Composing Spoken Hints for Follow-on Question Suggestion in Voice Assistants
Pedro Faustini, Besnik Fetahu, Giuseppe Castellucci et al.
Compositional Generalization in Spoken Language Understanding
Avik Ray, Yilin Shen, Hongxia Jin
Compressed MoE ASR Model Based on Knowledge Distillation and Quantization
Yuping Yuan, Zhao You, Shulin Feng et al.
Computational modeling of auditory brainstem responses derived from modified speech
Tzu-Han Zoe Cheng, Paul Calamia
Computation and Memory Efficient Noise Adaptation of Wav2Vec2.0 for Noisy Speech Emotion Recognition with Skip Connection Adapters
Seong-Gyun Leem, Daniel Fulford, Jukka-Pekka Onnela et al.
Confidence-based Ensembles of End-to-End Speech Recognition Models
Igor Gitman, Vitaly Lavrukhin, Aleksandr Laptev et al.
Conformer-based Language Embedding with Self-Knowledge Distillation for Spoken Language Identification
Feng Wang, Lingyan Huang, Tao Li et al.
Conmer: Streaming Conformer Without Self-attention for Interactive Voice Assistants
Martin Radfar, Paulina Lyskawa, Brandon Trujillo et al.
Consonant-emphasis Method Incorporating Robust Consonant-section Detection to Improve Intelligibility of Bone-conducted speech
Yasufumi Uezu, Sicheng Wang, Teruki Toya et al.
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Yujia Xiao, Shaofei Zhang, Xi Wang et al.
Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Kaixun Huang, Ao Zhang, Zhanheng Yang et al.
Contrastive Disentangled Learning for Memory-Augmented Transformer
Jen-Tzung Chien, Shang-En Li
Contrastive Learning Based ASR Robust Knowledge Selection For Spoken Dialogue System
Zhiyuan Zhu, Yusheng Liao, Yu Wang et al.
Contrastive Learning based Deep Latent Masking for Music Source Separation
Jihyun Kim, Hong-Goo Kang
Controllable Generation of Artificial Speaker Embeddings through Discovery of Principal Directions
Florian Lux, Pascal Tilli, Sarina Meyer et al.