Papers
Towards Reference Speech Characterization for Health Applications
Catarina Botelho, Alberto Abad, Tanja Schultz et al.
Towards Robust Family-Infant Audio Analysis Based on Unsupervised Pretraining of Wav2vec 2.0 on Large-Scale Unlabeled Family Audio
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain
Towards Robust FastSpeech 2 by Modelling Residual Multimodality
Fabian Kögel, Bac Nguyen, Fabien Cardinaux
Towards robust paralinguistic assessment for real-world mobile health (mHealth) monitoring: an initial study of reverberation effects on speech
Judith Dineley, Ewan Carr, Faith Matcham et al.
Towards Single Integrated Spoofing-aware Speaker Verification Embeddings
Sung Hwan Mun, Hye-jin Shim, Hemlata Tak et al.
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis
Weiqin Li, Shun Lei, Qiaochu Huang et al.
Towards Supporting an Early Diagnosis of Multiple Sclerosis using Vocal Features
Monica Gonzalez-Machorro, Pascal Hecker, Uwe D. Reichel et al.
Towards Two-point Neuron-inspired Energy-efficient Multimodal Open Master Hearing Aid
Mohsin Raza, Adewale Adetomi, Khubaib Ahmed et al.
Towards Ultrasound Tongue Image prediction from EEG during speech production
Tamás Gábor Csapó, Frigyes Viktor Arthur, Péter Nagy et al.
Tracking Must Go On : Dialogue State Tracking with Verified Self-Training
Jihyun Lee, Chaebin Lee, Yunsu Kim et al.
Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model
Mana Ihori, Hiroshi Sato, Tomohiro Tanaka et al.
Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection
Yizhou Tan, Haojun Ai, Shengchen Li et al.
Transfer Learning for Personality Perception via Speech Emotion Recognition
Yuanchao Li, Peter Bell, Catherine Lai
Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization
Kohei Matsuura, Takanori Ashihara, Takafumi Moriya et al.
Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis
Tanuka Bhattacharjee, Anjali Jayakumar, Yamini Belur et al.
Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech
Jan Lehečka, Jan Švec, Josef V. Psutka et al.
Transforming the Embeddings: A Lightweight Technique for Speech Emotion Recognition Tasks
Orchid Chetia Phukan, Arun Balaji Buduru, Rajesh Sharma
Transvelar Nasal Coupling Contributing to Speaker Characteristics in Non-nasal Vowels
Ziyu Zhu, Yujie Chi, Zhao Zhang et al.
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition
Hongfei Xue, Qijie Shao, Peikun Chen et al.
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Dacheng Yin, Zhiyuan Zhao, Chuanxin Tang et al.
Tri-level Joint Natural Language Understanding for Multi-turn Conversational Datasets
Henry Weld, Sijia Hu, Siqu Long et al.
Turbo your multi-modal classification with contrastive learning
Zhiyu Zhang, Da Liu, Shengqiang Liu et al.
Two Stage Contextual Word Filtering for Context Bias in Unified Streaming and Non-streaming Transducer
Zhanheng Yang, Sining Sun, Xiong Wang et al.
Two-stage Finetuning of Wav2vec 2.0 for Speech Emotion Recognition with ASR and Gender Pretraining
Yuan Gao, Chenhui Chu, Tatsuya Kawahara
Two-Stage Voice Anonymization for Enhanced Privacy
Francesco Nespoli, Daniel Barreda, Jöerg Bitzer et al.