Research Explorer

Learning neural audio features without supervision

Sarthak Yadav, Neil Zeghidour

2022 INTERSPEECH

Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers

Liumeng Xue, Shan Yang, Na Hu et al.

2022 INTERSPEECH

Learning to rank with BERT-based confidence models in ASR rescoring

Ting-Wei Wu, I-Fan Chen, Ankur Gandhe

2022 INTERSPEECH

Learning Under Label Noise for Robust Spoken Language Understanding systems

Anoop Kumar, Pankaj Kumar Sharma, Aravind Illa et al.

2022 INTERSPEECH

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR

Kun Wei, Yike Zhang, Sining Sun et al.

2022 INTERSPEECH

Leveraging Prosody for Punctuation Prediction of Spontaneous Speech

Yeonjin Cho, Sara Ng, Trang Tran et al.

2022 INTERSPEECH

Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation

Qianqian Dong, Fengpeng Yue, Tom Ko et al.

2022 INTERSPEECH

Leveraging Real Conversational Data for Multi-Channel Continuous Speech Separation

Xiaofei Wang, Dongmei Wang, Naoyuki Kanda et al.

2022 INTERSPEECH

Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism

Kak Soky, Sheng Li, Masato Mimura et al.

2022 INTERSPEECH

Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer

Shrutina Agarwal, Naoya Takahashi, Sriram Ganapathy

2022 INTERSPEECH

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation

Ye Jia, Yifan Ding, Ankur Bapna et al.

2022 INTERSPEECH

Lexical stress in Spanish word segmentation

Alvaro Martin Iturralde Zurita, Meghan Clayards

2022 INTERSPEECH

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Rui Wang, Qibing Bai, Junyi Ao et al.

2022 INTERSPEECH

Lightweight Full-band and Sub-band Fusion Network for Real Time Speech Enhancement

Zhuangqi Chen, Pingjian Zhang

2022 INTERSPEECH

Light-Weight Speaker Verification with Global Context Information

MISEUL KIM, ZHENYU PIAO, Seyun Um et al.

2022 INTERSPEECH

Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition

Qijie Shao, Jinghao Yan, Jian Kang et al.

2022 INTERSPEECH

Linguistically Informed Post-processing for ASR Error correction in Sanskrit

Rishabh Kumar, Devaraja Adiga, Rishav Ranjan et al.

2022 INTERSPEECH

Linguistic versus biological factors governing acoustic voice variation

Yoonjeong Lee, Jody Kreiman

2022 INTERSPEECH

Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition

Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee

2022 INTERSPEECH

Listening with Googlears: Low-Latency Neural Multiframe Beamforming and Equalization for Hearing Aids

Samuel Yang, Scott Wisdom, Chet Gnegy et al.

2022 INTERSPEECH

Listen only to me! How well can target speech extraction handle false alarms?

Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai et al.

2022 INTERSPEECH

Local Context-aware Self-attention for Continuous Sign Language Recognition

Ronglai Zuo, Brian Mak

2022 INTERSPEECH

Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features

Maximilian Karl Scharf, Sabine Hochmuth, Lena L.N. Wong et al.

2022 INTERSPEECH

Low-bit Shift Network for End-to-End Spoken Language Understanding

Anderson R. Avila, Khalil Bibi, Rui Heng Yang et al.

2022 INTERSPEECH

Low-complex and Highly-performed Binary Residual Neural Network for Small-footprint Keyword Spotting

Xiao Wang, Song Cheng, Jun Li et al.

2022 INTERSPEECH

Papers