Papers - Conftrace

Transducer-based language embedding for spoken language identification

Peng Shen, Xugang Lu, Hisashi Kawai

2022 INTERSPEECH

Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping

Jenthe Thienpondt, Kris Demuynck

2022 INTERSPEECH

Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus

Minchan Kim, Myeonghun Jeong, Byoung Jin Choi et al.

2022 INTERSPEECH

Transfer Learning from Multi-Lingual Speech Translation Benefits Low-Resource Speech Recognition

Geoffroy Vanderreydt, François REMY, Kris Demuynck

2022 INTERSPEECH

Transformer-Based Automatic Speech Recognition with Auxiliary Input of Source Language Text Toward Transcribing Simultaneous Interpretation

Shuta Taniguchi, Tsuneo Kato, Akihiro Tamura et al.

2022 INTERSPEECH

Transformer-based quality assessment model for generalized user-generated multimedia audio content

Deebha Mumtaz, Ajit Jena, Vinit Jakhetiya et al.

2022 INTERSPEECH

Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Muti-Person Video

Dmitriy Serdyuk, Otavio Braga, Olivier Siohan

2022 INTERSPEECH

Transformer Networks for Non-Intrusive Speech Quality Prediction

M K Jayesh, Mukesh Sharma, Praneeth Vonteddu et al.

2022 INTERSPEECH

Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis

Raul Fernandez, David Haws, Guy Lorberbom et al.

2022 INTERSPEECH

Transport-Oriented Feature Aggregation for Speaker Embedding Learning

Yusheng Tian, Jingyu Li, Tan Lee

2022 INTERSPEECH

Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition

Guangzhi Sun, Chao Zhang, Phil Woodland

2022 INTERSPEECH

TRILLsson: Distilled Universal Paralinguistic Speech Representations

Joel Shor, Subhashini Venugopalan

2022 INTERSPEECH

TriniTTS: Pitch-controllable End-to-end TTS without External Aligner

Yooncheol Ju, Ilhwan Kim, Hongsun Yang et al.

2022 INTERSPEECH

TRUNet: Transformer-Recurrent-U Network for Multi-channel Reverberant Sound Source Separation

Ali Aroudi, Stefan Uhlich, Marc Ferras Font

2022 INTERSPEECH

TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder

Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon et al.

2022 INTERSPEECH

Turn-Taking Prediction for Natural Conversational Speech

Shuo-Yiin Chang, Bo Li, Tara Sainath et al.

2022 INTERSPEECH

Two Methods for Spoofing-Aware Speaker Verification: Multi-Layer Perceptron Score Fusion Model and Integrated Embedding Projector

Jungwoo Heo, Ju-Ho Kim, Hyun-seo Shin

2022 INTERSPEECH

Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems

Mingyu Cui, Jiajun Deng, Shoukang Hu et al.

2022 INTERSPEECH

Two-Pass Low Latency End-to-End Spoken Language Understanding

Siddhant Arora, Siddharth Dalmia, Xuankai Chang et al.

2022 INTERSPEECH

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers

Ali Siahkoohi, Michael Chinen, Tom Denton et al.

2022 INTERSPEECH

Uncertainty Calibration for Deep Audio Classifiers

Tong Ye, Shijing Si, Jianzong Wang et al.

2022 INTERSPEECH

UNet-DenseNet for Robust Far-Field Speaker Verification

Zhenke Gao, Manwai Mak, Weiwei Lin

2022 INTERSPEECH

Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation

Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda

2022 INTERSPEECH

Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)

Ariadna Sanchez, Alessio Falai, Ziyao Zhang et al.

2022 INTERSPEECH

Unifying Cosine and PLDA Back-ends for Speaker Verification

Zhiyuan Peng, Xuanji He, Ke Ding et al.

2022 INTERSPEECH