Research Explorer

Low-bit Shift Network for End-to-End Spoken Language Understanding

Anderson R. Avila, Khalil Bibi, Rui Heng Yang et al.

2022 INTERSPEECH

Low-complex and Highly-performed Binary Residual Neural Network for Small-footprint Keyword Spotting

Xiao Wang, Song Cheng, Jun Li et al.

2022 INTERSPEECH

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation

Giulia Comini, Goeric Huybrechts, Manuel Sam Ribeiro et al.

2022 INTERSPEECH

Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers

Chiori Hori, Takaaki Hori, Jonathan Le Roux

2022 INTERSPEECH

Low-Level Physiological Implications of End-to-End Learning for Speech Recognition

Louise Coppieters de Gibson, Philip N. Garner

2022 INTERSPEECH

Low-resource Accent Classification in Geographically-proximate Settings: A Forensic and Sociophonetics Perspective

Qingcheng Zeng, Dading Chong, Peilin Zhou et al.

2022 INTERSPEECH

Low Resource Comparison of Attention-based and Hybrid ASR Exploiting wav2vec 2.0

Aku Rouhe, Anja Virkkunen, Juho Leinonen et al.

2022 INTERSPEECH

Low-resource Low-footprint Wake-word Detection using Knowledge Distillation

Arindam Ghosh, Mark Fuhs, Deblin Bagchi et al.

2022 INTERSPEECH

M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation

Jinming Zhao, Hao Yang, Gholamreza Haffari et al.

2022 INTERSPEECH

MAE-AST: Masked Autoencoding Audio Spectrogram Transformer

Alan Baade, Puyuan Peng, David Harwath

2022 INTERSPEECH

MAESTRO: Matched Speech Text Representations through Modality Matching

Zhehuai Chen, Yu Zhang, Andrew Rosenberg et al.

2022 INTERSPEECH

Mandarin Lombard Grid: a Lombard-grid-like corpus of Standard Chinese

Yuhong Yang, Xufeng Chen, Qingmu Liu et al.

2022 INTERSPEECH

Mandarin nasal place assimilation revisited: an acoustic study

Mingqiong Luo

2022 INTERSPEECH

Mandarin Tone Sandhi Realization: Evidence from Large Speech Corpora

Zuoyu Tian, Xiao Dong, Feier Gao et al.

2022 INTERSPEECH

MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

Ryandhimas Edo Zezario, Fei Chen, Chiou-Shann Fuh et al.

2022 INTERSPEECH

mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

Seong-Hwan Heo, WonKee Lee, Jong-Hyeok Lee

2022 INTERSPEECH

Membership Inference Attacks Against Self-supervised Speech Models

Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee

2022 INTERSPEECH

Memory-Efficient Multi-Step Speech Enhancement with Neural ODE

Jen-Hung Huang, Chung-Hsien Wu

2022 INTERSPEECH

Memory-Efficient Training of RNN-Transducer with Sampled Softmax

Jaesong Lee, Lukas Lee, Shinji Watanabe

2022 INTERSPEECH

Meta Auxiliary Learning for Low-resource Spoken Language Understanding

Yingying Gao, Junlan Feng, Chao Deng et al.

2022 INTERSPEECH

Method for improving the word intelligibility of presented speech using bone-conduction headphones

Teruki Toya, Wenyu Zhu, Maori Kobayashi et al.

2022 INTERSPEECH

MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification

Yang Zhang, Zhiqiang Lv, Haibin Wu et al.

2022 INTERSPEECH

Microphone Array Channel Combination Algorithms for Overlapped Speech Detection

Theo Mariotte, Anthony Larcher, Silvio Montrésor et al.

2022 INTERSPEECH

MIM-DG: Mutual information minimization-based domain generalization for speaker verification

Woohyun Kang, Md Jahangir Alam, Abderrahim Fathan

2022 INTERSPEECH

MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources

Haoran Yin, Meng Ge, Yanjie Fu et al.

2022 INTERSPEECH

Papers