Research Explorer

Sequence-to-Sequence Multi-Modal Speech In-Painting

Mahsa Kadkhodaei Elyaderani, Shahram Shirani

2023 INTERSPEECH

Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features

Sudarsana Reddy Kadiri, Manila Kodali, Paavo Alku

2023 INTERSPEECH

SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization

Changhun Kim, Joonhyung Park, Hajin Shim et al.

2023 INTERSPEECH

Short-term Extrapolation of Speech Signals Using Recursive Neural Networks in the STFT Domain

Maurice Oberhag, Daniel Neudek, Rainer Martin et al.

2023 INTERSPEECH

Show & Tell: Voice Activity Projection and Turn-taking

Erik Ekstedt, Gabriel Skantze

2023 INTERSPEECH

Silent Speech Recognition with Articulator Positions Estimated from Tongue Ultrasound and Lip Video

Rachel Beeson, Korin Richmond

2023 INTERSPEECH

Tzu-Han Zoe Cheng, Kuan-Lin Chen, Juliane Schubert et al.

2023 INTERSPEECH

SlothSpeech: Denial-of-service Attack Against Speech Recognition Models

Mirazul Haque, Rutvij Shah, Simin Chen et al.

2023 INTERSPEECH

Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness

Dianwen Ng, Yang Xiao, Jia Qi Yip et al.

2023 INTERSPEECH

Sociodemographic and Attitudinal Effects on Dialect Speakers’ Articulation of the Standard Language: Evidence from German-Speaking Switzerland

Carina Steiner, Dieter Studer-Joho, Corinne Lanthemann et al.

2023 INTERSPEECH

Some Voices are Too Common: Building Fair Speech Recognition Systems Using the CommonVoice Dataset

Lucas Maison, Yannick Estève

2023 INTERSPEECH

So-to-Speak: An Exploratory Platform for Investigating the Interplay between Style and Prosody in TTS

Éva Székely, Siyang Wang, Joakim Gustafson

2023 INTERSPEECH

SOT: Self-supervised Learning-Assisted Optimal Transport for Unsupervised Adaptive Speech Emotion Recognition

Ruiteng Zhang, Jianguo Wei, Xugang Lu et al.

2023 INTERSPEECH

Sp1NY: A Quick and Flexible Speech Visualisation Tool in Python

Sébastien Le Maguer, Mark Anderson, Naomi Harte

2023 INTERSPEECH

Spanish Phone Confusion Analysis for EMG-Based Silent Speech Interfaces

Inge Salomons, Eder del Blanco, Eva Navas et al.

2023 INTERSPEECH

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition

Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma et al.

2023 INTERSPEECH

Spatialization Quality Metric for Binaural Speech

Pranay Manocha, Israel Dejene Gebru, Anurag Kumar et al.

2023 INTERSPEECH

Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning

Miguel Sarabia, Elena Menyaylenko, Alessandro Toso et al.

2023 INTERSPEECH

Speaker-Aware Anti-spoofing

Xuechen Liu, Md Sahidullah, Kong Aik Lee et al.

2023 INTERSPEECH

Speaker-aware Cross-modal Fusion Architecture for Conversational Emotion Recognition

Huan Zhao, Bo Li, Zixing Zhang

2023 INTERSPEECH

Speaker Diarization for ASR Output with T-vectors: A Sequence Classification Approach

Midia Yousefi, Naoyuki Kanda, Dongmei Wang et al.

2023 INTERSPEECH

Speaker Embeddings as Individuality Proxy for Voice Stress Detection

Zihan Wu, Neil Scheidwasser-Clow, Karl El Hajal et al.

2023 INTERSPEECH

Speaker Extraction with Detection of Presence and Absence of Target Speakers

Ke Zhang, Marvin Borsdorf, Zexu Pan et al.

2023 INTERSPEECH

Speaker-independent neural formant synthesis

Pablo Pérez Zarazaga, Zofia Malisz, Gustav Eje Henter et al.

2023 INTERSPEECH

Speaker-independent Speech Inversion for Estimation of Nasalance

Yashish M Siriwardena, Carol Espy-Wilson, Suzanne Boyce et al.

2023 INTERSPEECH

Papers