Papers - Conftrace

Learning Speech Models from Multi-Modal Data

Karen Livescu

2021 INTERSPEECH

Learning Speech Structure to Improve Time-Frequency Masks

Suliang Bu, Yunxin Zhao, Shaojun Wang et al.

2021 INTERSPEECH

Learning to Rank Microphones for Distant Speech Recognition

Samuele Cornell, Alessio Brutti, Marco Matassoni et al.

2021 INTERSPEECH

LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech

Solène Evain, Ha Nguyen, Hang Le et al.

2021 INTERSPEECH

Leveraging ASR N-Best in Deep Entity Retrieval

Haoyu Wang, John Chen, Majid Laali et al.

2021 INTERSPEECH

Leveraging Non-Target Language Resources to Improve ASR Performance in a Target Language

Jayadev Billa

2021 INTERSPEECH

Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition

Guodong Ma, Pengfei Hu, Jian Kang et al.

2021 INTERSPEECH

Leveraging Pre-Trained Language Model for Speech Sentiment Analysis

Suwon Shon, Pablo Brusco, Jing Pan et al.

2021 INTERSPEECH

Leveraging Real-Time MRI for Illuminating Linguistic Velum Action

Miran Oh, Dani Byrd, Shrikanth S. Narayanan

2021 INTERSPEECH

Leveraging Speaker Attribute Information Using Multi Task Learning for Speaker Verification and Diarization

Chau Luu, Peter Bell, Steve Renals

2021 INTERSPEECH

Leveraging the Uniformity Framework to Examine Crosslinguistic Similarity for Long-Lag Stops in Spontaneous Cantonese-English Bilingual Speech

Khia A. Johnson

2021 INTERSPEECH

Lexical Density Analysis of Word Productions in Japanese English Using Acoustic Word Embeddings

Shintaro Ando, Nobuaki Minematsu, Daisuke Saito

2021 INTERSPEECH

Lexical Entrainment and Intra-Speaker Variability in Cooperative Dialogues

Alla Menshikova, Daniil Kocharov, Tatiana Kachkovskaia

2021 INTERSPEECH

Lexical Modeling of ASR Errors for Robust Speech Translation

Giuseppe Martucci, Mauro Cettolo, Matteo Negri et al.

2021 INTERSPEECH

Librispeech Transducer Model with Internal Language Model Prior Correction

Albert Zeyer, André Merboldt, Wilfried Michel et al.

2021 INTERSPEECH

Lightweight Causal Transformer with Local Self-Attention for Real-Time Speech Enhancement

Koen Oostermeijer, Qing Wang, Jun Du

2021 INTERSPEECH

Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-Stage Sequence-to-Sequence Training

Kun Zhou, Berrak Sisman, Haizhou Li

2021 INTERSPEECH

LinearSpeech: Parallel Text-to-Speech with Linear Complexity

Haozhe Zhang, Zhihua Huang, Zengqiang Shang et al.

2021 INTERSPEECH

LiRA: Learning Visual Speech Representations from Audio Through Self-Supervision

Pingchuan Ma, Rodrigo Mira, Stavros Petridis et al.

2021 INTERSPEECH

Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End

Swayambhu Nath Ray, Minhua Wu, Anirudh Raju et al.

2021 INTERSPEECH

LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks

Huu-Kim Nguyen, Kihyuk Jeong, Seyun Um et al.

2021 INTERSPEECH

Live Subtitling for BigBlueButton with Open-Source Software

Robert Geislinger, Benjamin Milde, Timo Baumann et al.

2021 INTERSPEECH

Live TV Subtitling Through Respeaking

Aleš Pražák, Zdeněk Loose, Josef V. Psutka et al.

2021 INTERSPEECH

Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems

Victoria Mingote, Antonio Miguel, Alfonso Ortega et al.

2021 INTERSPEECH

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition

W. Ronny Huang, Tara N. Sainath, Cal Peyser et al.

2021 INTERSPEECH