Research Explorer

Novel-view Acoustic Synthesis From 3D Reconstructed Rooms

Byeongjoo Ahn, Karren Yang, Brian Hamilton et al.

2024 INTERSPEECH

NumberLie: a game-based experiment to understand the acoustics of deception and truthfulness

Alessandro De Luca, Andrew Clark, Volker Dellwo

2024 INTERSPEECH

OCEAN-AI: open multimodal framework for personality traits assessment and HR-processes automatization

Elena Ryumina, Dmitry Ryumin, Alexey Karpov

2024 INTERSPEECH

On Calibration of Speech Classification Models: Insights from Energy-Based Model Investigations

Yaqian Hao, Chenguang Hu, Yingying Gao et al.

2024 INTERSPEECH

Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment

Christoph Boeddeker, Tobias Cord-Landwehr, Reinhold Haeb-Umbach

2024 INTERSPEECH

On Comparing Time- and Frequency-Domain Rhythm Measures in Classifying Assamese Dialects

Joyshree Chakraborty, Leena Dihingia, Priyankoo Sarmah et al.

2024 INTERSPEECH

On Disfluency and Non-lexical Sound Labeling for End-to-end Automatic Speech Recognition

Peter Mihajlik, Yan Meng, Mate S Kadar et al.

2024 INTERSPEECH

One-class learning with adaptive centroid shift for audio deepfake detection

Hyun Myung Kim, Kangwook Jang, Hoirin Kim

2024 INTERSPEECH

One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model

Zhaoqing Li, Haoning Xu, Tianzi Wang et al.

2024 INTERSPEECH

On Improving Error Resilience of Neural End-to-End Speech Coders

Kishan Gupta, Nicola Pia, Srikanth Korse et al.

2024 INTERSPEECH

Online Knowledge Distillation of Decoder-Only Large Language Models for Efficient Speech Recognition

Jeehye Lee, Hyeji Seo

2024 INTERSPEECH

Online Subloop Search via Uncertainty Quantization for Efficient Test-Time Adaptation

Jae-Hong Lee, Sang-Eon Lee, Dong-Hyun Kim et al.

2024 INTERSPEECH

On the calibration of powerset speaker diarization models

Alexis Plaquet, Hervé Bredin

2024 INTERSPEECH

On the Effectiveness of Acoustic BPE in Decoder-Only TTS

Bohan Li, Feiyu Shen, Yiwei Guo et al.

2024 INTERSPEECH

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

Jinchuan Tian, Yifan Peng, William Chen et al.

2024 INTERSPEECH

On the Encoding of Gender in Transformer-based ASR Representations

Aravind Krishnan, Badr M. Abdullah, Dietrich Klakow

2024 INTERSPEECH

On the impact of several regularization techniques on label noise robustness of self-supervised speaker verification systems

Abderrahim Fathan, Xiaolin Zhu, Jahangir Alam

2024 INTERSPEECH

On The Performance of EMA-synchronized Speech and Stand-alone Speech in Acoustic-to-articulatory Inversion

Qiang Fang

2024 INTERSPEECH

On the relationship between speech production and vocabulary size in 3-5 year olds

Alexis DeMaere, Nicole van Rootselaar, Fangfang Li et al.

2024 INTERSPEECH

On the social bias of speech self-supervised models

Yi-Cheng Lin, Tzu-Quan Lin, Hsi-Che Lin et al.

2024 INTERSPEECH

On the Success and Limitations of Auxiliary Network Based Word-Level End-to-End Neural Speaker Diarization

Yiling Huang, Weiran Wang, Guanlong Zhao et al.

2024 INTERSPEECH

On the Usefulness of Speaker Embeddings for Speaker Retrieval in the Wild: A Comparative Study of x-vector and ECAPA-TDNN Models

Erfan Loweimi, Mengjie Qian, Kate Knill et al.

2024 INTERSPEECH

On the Use of Plausible Arguments in Explainable Conversational AI

Martina Di Bratto, Maria Di Maro, Antonio Origlia

2024 INTERSPEECH

Optical Flow Guided Tongue Trajectory Generation for Diffusion-based Acoustic to Articulatory Inversion

Yudong Yang, Rongfeng Su, Rukiye Ruzi et al.

2024 INTERSPEECH

Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies

Chung-Wen Wu, Berlin Chen

2024 INTERSPEECH

Papers