Research Explorer

Accentor: An Explicit Lexical Stress Model for TTS Systems

Diana Geneva, Georgi Shopov, Kostadin Garov et al.

2023 INTERSPEECH

Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System

Xian Shi, Haoneng Luo, Zhifu Gao et al.

2023 INTERSPEECH

Accurate and Structured Pruning for Efficient Automatic Speech Recognition

Huiqiang Jiang, Li Lyna Zhang, Yuang Li et al.

2023 INTERSPEECH

A Compact End-to-End Model with Local and Global Context for Spoken Language Identification

Fei Jia, Nithin Rao Koluguri, Jagadeesh Balam et al.

2023 INTERSPEECH

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

Yifan Peng, Kwangyoun Kim, Felix Wu et al.

2023 INTERSPEECH

A Compressed Synthetic Speech Detection Method with Compression Feature Embedding

Jinghong Zhang, Xiaowei Yi, Xianfeng Zhao

2023 INTERSPEECH

A conformer-based classifier for variable-length utterance processing in anti-spoofing

Eros Rosello, Alejandro Gomez-Alanis, Angel M. Gomez et al.

2023 INTERSPEECH

A Context-Constrained Sentence Modeling for Deception Detection in Real Interrogation

Ya-Tse Wu, Yuan-Ting Chang, Shao-Hao Lu et al.

2023 INTERSPEECH

Acoustic characteristics of depression in older adults' speech: the role of covariates

Carmen Mijnders, Esther Janse, Paul Naarding et al.

2023 INTERSPEECH

Acoustic cues to stress perception in Spanish – a mismatch negativity study

Karolina Broś

2023 INTERSPEECH

Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /ɹ/ in Child Speech Sound Disorders

Nina R Benway, Yashish M Siriwardena, Jonathan L Preston et al.

2023 INTERSPEECH

Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling

Ramon Sanabria, Ondřej Klejch, Hao Tang et al.

2023 INTERSPEECH

Active Learning for Abnormal Lung Sound Data Curation and Detection in Asthma

Shabnam Ghaffarzadegan, Luca Bondi, Ho-Hsiang Wu et al.

2023 INTERSPEECH

AdaMS: Deep Metric Learning with Adaptive Margin and Adaptive Scale for Acoustic Word Discrimination

Myunghun Jung, Hoirin Kim

2023 INTERSPEECH

Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

Guy Yariv, Itai Gat, Lior Wolf et al.

2023 INTERSPEECH

Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks

László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya et al.

2023 INTERSPEECH

Adaptation of Whisper models to child speech recognition

Rishabh Jain, Andrei Barcovschi, Mariam Yiwere et al.

2023 INTERSPEECH

Adaptation to predictive prosodic cues in non-native standard dialect

Sabine Gosselke Berthelsen

2023 INTERSPEECH

Adapter-Based Extension of Multi-Speaker Text-To-Speech Model for New Speakers

Cheng-Ping Hsieh, Subhankar Ghosh, Boris Ginsburg

2023 INTERSPEECH

Adapter Incremental Continual Learning of Efficient Audio Spectrogram Transformers

Nithish Muthuchamy Selvaraj, Xiaobao Guo, Adams Kong et al.

2023 INTERSPEECH

ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation

Ambuj Mehrish, Abhinav Ramesh Kashyap, Li Yingting et al.

2023 INTERSPEECH

Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition

Dianwen Ng, Chong Zhang, Ruixi Zhang et al.

2023 INTERSPEECH

Adapting a ConvNeXt Model to Audio Classification on AudioSet

Thomas Pellegrini, Ismail Khalfaoui-Hassani, Etienne Labbé et al.

2023 INTERSPEECH

Adapting an Unadaptable ASR System

Rao Ma, Mengjie Qian, Mark J. F. Gales et al.

2023 INTERSPEECH

Adapting Language-Audio Models as Few-Shot Audio Learners

Jinhua Liang, Xubo Liu, Haohe Liu et al.

2023 INTERSPEECH

Papers