Research Explorer

Gradual Improvements Observed in Learners' Perception and Production of L2 Sounds Through Continuing Shadowing Practices on a Daily Basis

Takuya Kunihara, Chuanbo Zhu, Nobuaki Minematsu et al.

2022 INTERSPEECH

Gram Vaani ASR Challenge on spontaneous telephone speech recordings in regional variations of Hindi

Anish Bhanushali, Grant Bridgman, Deekshitha G et al.

2022 INTERSPEECH

Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification

Long Chen, Yixiong Meng, Venkatesh Ravichandran et al.

2022 INTERSPEECH

Hear No Evil: Towards Adversarial Robustness of Automatic Speech Recognition via Multi-Task Learning

Nilaksh Das, Polo Chau

2022 INTERSPEECH

Hesitations in Urdu/Hindi: Distribution and Properties of Fillers & Silences

Farhat Jabeen, Simon Betz

2022 INTERSPEECH

Heterogeneous Target Speech Separation

Efthymios Tzinis, Gordon Wichern, Aswin Shanmugam Subramanian et al.

2022 INTERSPEECH

Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech

Jaesung Bae, Jinhyeok Yang, Taejun Bak et al.

2022 INTERSPEECH

Hierarchical Attention Network for Evaluating Therapist Empathy in Counseling Session

Dehua Tao, Tan Lee, Harold Chui et al.

2022 INTERSPEECH

Hierarchical Tagger with Multi-task Learning for Cross-domain Slot Filling

Xiao Wei, Yuke Si, Shiquan Wang et al.

2022 INTERSPEECH

High level feature fusion in forensic voice comparison

Michael Carne, Yuko Kinoshita, Shunichi Ishihara

2022 INTERSPEECH

Homophone Disambiguation Profits from Durational Information

Barbara Schuppler, Emil Berger, Xenia Kogler et al.

2022 INTERSPEECH

How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR

Kazuma Iwamoto, Tsubasa Ochiai, Marc Delcroix et al.

2022 INTERSPEECH

How do our eyebrows respond to masks and whispering? The case of Persians

Nasim Mahdinazhad Sardhaei, Marzena Zygis, Hamid Sharifzadeh

2022 INTERSPEECH

How to Listen? Rethinking Visual Sound Localization

Ho-Hsiang Wu, Magdalena Fuentes, Prem Seetharaman et al.

2022 INTERSPEECH

Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS

Kenta Udagawa, Yuki Saito, Hiroshi Saruwatari

2022 INTERSPEECH

Humanizing bionic voice: interactive demonstration of aesthetic design and control factors influencing the devices assembly and waveshape engineering

Konrad Zieliński, Marek Grzelec, Martin Hagmüller

2022 INTERSPEECH

Human Sound Classification based on Feature Fusion Method with Air and Bone Conducted Signal

Liang Xu, Jing Wang, Lizhong Wang et al.

2022 INTERSPEECH

Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load

Gasser Elbanna, Alice Biryukov, Neil Scheidwasser-Clow et al.

2022 INTERSPEECH

HYU Submission for the SASV Challenge 2022: Reforming Speaker Embeddings with Spoofing-Aware Conditioning

Jeong-Hwan Choi, Joon-Young Yang, Ye-Rin Jeoung et al.

2022 INTERSPEECH

iCNN-Transformer: An improved CNN-Transformer with Channel-spatial Attention and Keyword Prediction for Automated Audio Captioning

Kun Chen, Jun Wang, Feng Deng et al.

2022 INTERSPEECH

iDeepMMSE: An improved deep learning approach to MMSE speech and noise power spectrum estimation for speech enhancement

Minseung Kim, Hyungchan Song, Sein Cheong et al.

2022 INTERSPEECH

Idiosyncratic lingual articulation of American English /æ/ and /ɑ/ using network analysis

Carolina Lins Machado, Volker Dellwo, Lei He

2022 INTERSPEECH

Impact of Acoustic Event Tagging on Scene Classification in a Multi-Task Learning Framework

Rahil Parikh, Harshavardhan Sundar, Ming Sun et al.

2022 INTERSPEECH

Impact of Background Noise and Contribution of Visual Information in Emotion Identification by Native Mandarin Speakers

Minyue Zhang, Hongwei Ding

2022 INTERSPEECH

Impairment Representation Learning for Speech Quality Assessment

Lianwu Chen, Xinlei Ren, Xu Zhang et al.

2022 INTERSPEECH

Papers