Research Explorer

A Method of Audio-Visual Person Verification by Mining Connections between Time Series

Peiwen Sun, Shanshan Zhang, Zishan Liu et al.

2023 INTERSPEECH

A Metric-Driven Approach to Conformer Layer Pruning for Efficient ASR Inference

Dhanush Bekal, Karthik Gopalakrishnan, Karel Mundnich et al.

2023 INTERSPEECH

A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization

Edward Fish, Umberto Michieli, Mete Ozay

2023 INTERSPEECH

A More Accurate Internal Language Model Score Estimation for the Hybrid Autoregressive Transducer

Kyungmin Lee, Haeri Kim, Sichen Jin et al.

2023 INTERSPEECH

A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models

Pin-Jui Ku, Chao-Han Huck Yang, Sabato Siniscalchi et al.

2023 INTERSPEECH

A Multimodal Investigation of Speech, Text, Cognitive and Facial Video Features for Characterizing Depression With and Without Medication

Michael Neumann, Hardik Kothare, Doug Habberstad et al.

2023 INTERSPEECH

A multimodal prototypical approach for unsupervised sound classification

Saksham Singh Kushwaha, Magdalena Fuentes

2023 INTERSPEECH

A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting

Haotian Wang, Jun Du, Hengshun Zhou et al.

2023 INTERSPEECH

A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation

Xipin Wei, Junhui Chen, Zirui Zheng et al.

2023 INTERSPEECH

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds

Tanmay Khandelwal, Rohan Kumar Das

2023 INTERSPEECH

An Acoustic Analysis of Fricative Variation in Three Accents of English

Roland Adams, Calbert Graham

2023 INTERSPEECH

Analysis and automatic prediction of exertion from speech: Contrasting objective and subjective measures collected while running

Andreas Triantafyllopoulos, Alexander Gebhard, Alexander Kathan et al.

2023 INTERSPEECH

Analysis of Acoustic information in End-to-End Spoken Language Translation

Gerard Sant, Carlos Escolano

2023 INTERSPEECH

Analysis of Mean Opinion Scores in Subjective Evaluation of Synthetic Speech Based on Tail Probabilities

Yusuke Yasuda, Tomoki Toda

2023 INTERSPEECH

An Analysis of Glottal Features of Chronic Kidney Disease Speech and Its Application to CKD Detection

Jihyun Mun, Sunhee Kim, Myeong Ju Kim et al.

2023 INTERSPEECH

An Analysis of Goodness of Pronunciation for Child Speech

Xinwei Cao, Zijian Fan, Torbjørn Svendsen et al.

2023 INTERSPEECH

An ASR-enabled Reading Tutor: Investigating Feedback to Optimize Interaction for Learning to Read

Yu Bai, Ferdy Hubers, Catia Cucchiarini et al.

2023 INTERSPEECH

An Automatic Multimodal Approach to Analyze Linguistic and Acoustic Cues on Parkinson's Disease Patients

Daniel Escobar-Grisales, Tomás Arias-Vergara, Cristian David Ríos-Urrego et al.

2023 INTERSPEECH

An Autoregressive Conversational Dynamics Model for Dialogue Systems

Matthew McNeill, Rivka Levitan

2023 INTERSPEECH

An Efficient and Noise-Robust Audiovisual Encoder for Audiovisual Speech Recognition

Zhengyang Li, Chenwei Liang, Timo Lohrenz et al.

2023 INTERSPEECH

An Efficient Approach for the Automated Segmentation and Transcription of the People's Speech Sorpus

Astik Biswas, Abdelmoumene Boumadane, Stephane Peillon et al.

2023 INTERSPEECH

An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention

Junyu Wang

2023 INTERSPEECH

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

Yafeng Chen, Siqi Zheng, Hui Wang et al.

2023 INTERSPEECH

An Equitable Framework for Automatically Assessing Children's Oral Narrative Language Abilities

Alexander Johnson, Hariram Veeramani, Natarajan Balaji Shankar et al.

2023 INTERSPEECH

A neural architecture for selective attention to speech features

Nika Jurov, William Idsardi, Naomi H. Feldman

2023 INTERSPEECH

Papers