Research Explorer

Deep Speech Synthesis from Articulatory Representations

Peter Wu, Shinji Watanabe, Louis Goldstein et al.

2022 INTERSPEECH

Deep Transductive Transfer Regression Network for Cross-Corpus Speech Emotion Recognition

Yan Zhao, Jincen Wang, Ru Ye et al.

2022 INTERSPEECH

Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models

Takanori Ashihara, Takafumi Moriya, Kohei Matsuura et al.

2022 INTERSPEECH

Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser

Sonal Joshi, Saurabh Kataria, Yiwen Shao et al.

2022 INTERSPEECH

Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification

Yuya Yamamoto, Juhan Nam, Hiroko Terasawa

2022 INTERSPEECH

DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition

Jiamin Xie, John H.L. Hansen

2022 INTERSPEECH

DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion

Ruibin Yuan, Yuxuan Wu, Jacob Li et al.

2022 INTERSPEECH

Deliberation Model for On-Device Spoken Language Understanding

Duc Le, Akshat Shrivastava, Paden D. Tomasello et al.

2022 INTERSPEECH

DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders

Yanqing Liu, Ruiqing Xue, Lei He et al.

2022 INTERSPEECH

Densely-connected Convolutional Recurrent Network for Fundamental Frequency Estimation in Noisy Speech

Yixuan Zhang, Heming Wang, DeLiang Wang

2022 INTERSPEECH

DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition

Zixun Guo, Chen Chen, Eng Siong Chng

2022 INTERSPEECH

Design Guidelines for Inclusive Speaker Verification Evaluation Datasets

Wiebke Toussaint, Lauriane Gorce, Aaron Yi Ding

2022 INTERSPEECH

Detecting Dysfluencies in Stuttering Therapy Using wav2vec 2.0

Sebastian Peter Bayerl, Dominik Wagner, Elmar Noeth et al.

2022 INTERSPEECH

Detecting Heart Failure Through Voice Analysis using Self-Supervised Mode-Based Memory Fusion

Darshana Priyasad, Andi Partovi, Sridha Sridharan et al.

2022 INTERSPEECH

Detecting Unintended Memorization in Language-Model-Fused ASR

W. Ronny Huang, Steve Chien, Om Dipakbhai Thakkar et al.

2022 INTERSPEECH

Detection of Learners' Listening Breakdown with Oral Dictation and Its Use to Model Listening Skill Improvement Exclusively Through Shadowing

Takuya Kunihara, Chuanbo Zhu, Daisuke Saito et al.

2022 INTERSPEECH

DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utterances

Sreyan Ghosh, Samden Lepcha, S Sakshi et al.

2022 INTERSPEECH

Development of allophonic realization until adolescence: A production study of the affricate-fricative variation of /z/ among Japanese children

Sanae Matsui, Kyoji Iwamoto, Reiko Mazuka

2022 INTERSPEECH

Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models

Vineet Garg, Ognjen Rudovic, Pranay Dighe et al.

2022 INTERSPEECH

DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design

Bei Liu, Zhengyang Chen, Shuai Wang et al.

2022 INTERSPEECH

Dialogue Acts Aided Important Utterance Detection Based on Multiparty and Multimodal Information

Fumio Nihei, Ryo Ishii, Yukiko Nakano et al.

2022 INTERSPEECH

Differential Time-frequency Log-mel Spectrogram Features for Vision Transformer Based Infant Cry Recognition

Hai-tao Xu, Jie Zhang, Li-rong Dai

2022 INTERSPEECH

Diffusion Generative Vocoder for Fullband Speech Synthesis Based on Weak Third-order SDE Solver

Hideyuki Tachibana, Muneyoshi Inahara, Mocho Go et al.

2022 INTERSPEECH

Directed speech separation for automatic speech recognition of long form conversational speech

Rohit Paturi, Sundararajan Srinivasan, Katrin Kirchhoff et al.

2022 INTERSPEECH

Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments

Yicheng Du, Aditya Arie Nugraha, Kouhei Sekiguchi et al.

2022 INTERSPEECH

Papers