Papers - Conftrace

Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts

Trang Tran, Mari Ostendorf

2021 INTERSPEECH

Assessment of von Mises-Bernoulli Deep Neural Network in Sound Source Localization

Katsutoshi Itoyama, Yoshiya Morimoto, Shungo Masaki et al.

2021 INTERSPEECH

AST: Audio Spectrogram Transformer

Yuan Gong, Yu-An Chung, James Glass

2021 INTERSPEECH

A Study into Pre-Training Strategies for Spoken Language Understanding on Dysarthric Speech

Pu Wang, Bagher BabaAli, Hugo Van hamme

2021 INTERSPEECH

A Study on Fine-Tuning wav2vec2.0 Model for the Task of Mispronunciation Detection and Diagnosis

Linkai Peng, Kaiqi Fu, Binghuai Lin et al.

2021 INTERSPEECH

A Systematic Review and Analysis of Multilingual Data Strategies in Text-to-Speech for Low-Resource Languages

Phat Do, Matt Coler, Jelske Dijkstra et al.

2021 INTERSPEECH

A Thousand Words are Worth More Than One Recording:Word-EmbeddingBased Speaker Change Detection

Or Haim Anidjar, Itshak Lapidot, Chen Hajaj et al.

2021 INTERSPEECH

Attention-Based Convolutional Neural Network for ASV Spoofing Detection

Hefei Ling, Leichao Huang, Junrui Huang et al.

2021 INTERSPEECH

Attention-Based Cross-Modal Fusion for Audio-Visual Voice Activity Detection in Musical Video Streams

Yuanbo Hou, Zhesong Yu, Xia Liang et al.

2021 INTERSPEECH

Attention-Based Keyword Localisation in Speech Using Visual Grounding

Kayode Olaleye, Herman Kamper

2021 INTERSPEECH

A Two-Stage Approach to Speech Bandwidth Extension

Ju Lin, Yun Wang, Kaustubh Kalgaonkar et al.

2021 INTERSPEECH

Audio Retrieval with Natural Language Queries

Andreea-Maria Oncescu, A. Sophia Koepke, João F. Henriques et al.

2021 INTERSPEECH

Audio Segmentation Based Conversational Silence Detection for Contact Center Calls

Krishnachaitanya Gogineni, Tarun Reddy Yadama, Jithendra Vepa

2021 INTERSPEECH

Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments

Hengshun Zhou, Jun Du, Hang Chen et al.

2021 INTERSPEECH

Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party

Yifei Wu, Chenda Li, Song Yang et al.

2021 INTERSPEECH

Audio-Visual Recognition of Emotional Engagement of People with Dementia

Lars Steinert, Felix Putze, Dennis Küster et al.

2021 INTERSPEECH

Audio-Visual Speech Emotion Recognition by Disentangling Emotion and Identity Attributes

Koichiro Ito, Takuya Fujioka, Qinghua Sun et al.

2021 INTERSPEECH

Audiovisual Transfer Learning for Audio Tagging and Sound Event Detection

Wim Boes, Hugo Van hamme

2021 INTERSPEECH

Augmenting Slot Values and Contexts for Spoken Language Understanding with Pretrained Models

Haitao Lin, Lu Xiang, Yu Zhou et al.

2021 INTERSPEECH

A Universal Multi-Speaker Multi-Style Text-to-Speech via Disentangled Representation Learning Based on Rényi Divergence Minimization

Dipjyoti Paul, Sankar Mukherjee, Yannis Pantazis et al.

2021 INTERSPEECH

AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children’s Speech

Beena Ahmed, Kirrie J. Ballard, Denis Burnham et al.

2021 INTERSPEECH

Auto-KWS 2021 Challenge: Task, Datasets, and Baselines

Jingsong Wang, Yuxuan He, Chunyu Zhao et al.

2021 INTERSPEECH

Automated Detection of Voice Disorder in the Saarbrücken Voice Database: Effects of Pathology Subset and Audio Materials

Mark Huckvale, Catinca Buciuleac

2021 INTERSPEECH

Automatically Detecting Errors and Disfluencies in Read Speech to Predict Cognitive Impairment in People with Parkinson’s Disease

Amrit Romana, John Bandon, Matthew Perez et al.

2021 INTERSPEECH

Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit

Einari Vaaras, Sari Ahlqvist-Björkroth, Konstantinos Drossos et al.

2021 INTERSPEECH