Research Explorer

Towards Reference Speech Characterization for Health Applications

Catarina Botelho, Alberto Abad, Tanja Schultz et al.

2023 INTERSPEECH

Towards Robust Family-Infant Audio Analysis Based on Unsupervised Pretraining of Wav2vec 2.0 on Large-Scale Unlabeled Family Audio

Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain

2023 INTERSPEECH

Towards Robust FastSpeech 2 by Modelling Residual Multimodality

Fabian Kögel, Bac Nguyen, Fabien Cardinaux

2023 INTERSPEECH

Towards robust paralinguistic assessment for real-world mobile health (mHealth) monitoring: an initial study of reverberation effects on speech

Judith Dineley, Ewan Carr, Faith Matcham et al.

2023 INTERSPEECH

Towards Single Integrated Spoofing-aware Speaker Verification Embeddings

Sung Hwan Mun, Hye-jin Shim, Hemlata Tak et al.

2023 INTERSPEECH

Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis

Weiqin Li, Shun Lei, Qiaochu Huang et al.

2023 INTERSPEECH

Towards Supporting an Early Diagnosis of Multiple Sclerosis using Vocal Features

Monica Gonzalez-Machorro, Pascal Hecker, Uwe D. Reichel et al.

2023 INTERSPEECH

Towards Two-point Neuron-inspired Energy-efficient Multimodal Open Master Hearing Aid

Mohsin Raza, Adewale Adetomi, Khubaib Ahmed et al.

2023 INTERSPEECH

Towards Ultrasound Tongue Image prediction from EEG during speech production

Tamás Gábor Csapó, Frigyes Viktor Arthur, Péter Nagy et al.

2023 INTERSPEECH

Tracking Must Go On : Dialogue State Tracking with Verified Self-Training

Jihyun Lee, Chaebin Lee, Yunsu Kim et al.

2023 INTERSPEECH

Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model

Mana Ihori, Hiroshi Sato, Tomohiro Tanaka et al.

2023 INTERSPEECH

Transductive Feature Space Regularization for Few-shot Bioacoustic Event Detection

Yizhou Tan, Haojun Ai, Shengchen Li et al.

2023 INTERSPEECH

Transfer Learning for Personality Perception via Speech Emotion Recognition

Yuanchao Li, Peter Bell, Catherine Lai

2023 INTERSPEECH

Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization

Kohei Matsuura, Takanori Ashihara, Takafumi Moriya et al.

2023 INTERSPEECH

Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis

Tanuka Bhattacharjee, Anjali Jayakumar, Yamini Belur et al.

2023 INTERSPEECH

Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech

Jan Lehečka, Jan Švec, Josef V. Psutka et al.

2023 INTERSPEECH

Transforming the Embeddings: A Lightweight Technique for Speech Emotion Recognition Tasks

Orchid Chetia Phukan, Arun Balaji Buduru, Rajesh Sharma

2023 INTERSPEECH

Transvelar Nasal Coupling Contributing to Speaker Characteristics in Non-nasal Vowels

Ziyu Zhu, Yujie Chi, Zhao Zhang et al.

2023 INTERSPEECH

TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition

Hongfei Xue, Qijie Shao, Peikun Chen et al.

2023 INTERSPEECH

TridentSE: Guiding Speech Enhancement with 32 Global Tokens

Dacheng Yin, Zhiyuan Zhao, Chuanxin Tang et al.

2023 INTERSPEECH

Tri-level Joint Natural Language Understanding for Multi-turn Conversational Datasets

Henry Weld, Sijia Hu, Siqu Long et al.

2023 INTERSPEECH

Turbo your multi-modal classification with contrastive learning

Zhiyu Zhang, Da Liu, Shengqiang Liu et al.

2023 INTERSPEECH

Two Stage Contextual Word Filtering for Context Bias in Unified Streaming and Non-streaming Transducer

Zhanheng Yang, Sining Sun, Xiong Wang et al.

2023 INTERSPEECH

Two-stage Finetuning of Wav2vec 2.0 for Speech Emotion Recognition with ASR and Gender Pretraining

Yuan Gao, Chenhui Chu, Tatsuya Kawahara

2023 INTERSPEECH

Two-Stage Voice Anonymization for Enhanced Privacy

Francesco Nespoli, Daniel Barreda, Jöerg Bitzer et al.

2023 INTERSPEECH

Papers