Papers - Conftrace

PodcastMix: A dataset for separating music and speech in podcasts

Nicolás Schmidt, Jordi Pons, Marius Miron

2022 INTERSPEECH

PoeticTTS - Controllable Poetry Reading for Literary Studies

Julia Koch, Florian Lux, Nadja Schauffler et al.

2022 INTERSPEECH

Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection

Hira Dhamyal, Bhiksha Raj, Rita Singh

2022 INTERSPEECH

Practical Over-the-air Perceptual AcousticWatermarking

Ameya Agaskar

2022 INTERSPEECH

Predicting Emotional Intensity in Political Debates via Non-verbal Signals

Jeewoo Yoon, Jinyoung Han, Erik Bucy et al.

2022 INTERSPEECH

Predicting label distribution improves non-intrusive speech quality estimation

Abu Zaher Md Faridee, Hannes Gamper

2022 INTERSPEECH

Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks

Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts et al.

2022 INTERSPEECH

Predicting Speech Intelligibility using the Spike Acativity Mutual Information Index

Franklin Alvarez Cardinale, Waldo Nogueira

2022 INTERSPEECH

Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis

Wataru Nakata, Tomoki Koriyama, Shinnosuke Takamichi et al.

2022 INTERSPEECH

Prediction of L2 speech proficiency based on multi-level linguistic features

Verdiana De Fino, Lionel Fontan, Julien Pinquier et al.

2022 INTERSPEECH

Pre-trained Speech Representations as Feature Extractors for Speech Quality Assessment in Online Conferencing Applications

Bastiaan Tamm, Helena Balabin, Rik Vandenberghe et al.

2022 INTERSPEECH

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data

Junyi Ao, Ziqiang Zhang, Long Zhou et al.

2022 INTERSPEECH

Preventing sensitive-word recognition using self-supervised learning to preserve user-privacy for automatic speech recognition

Yuchen Liu, Apu Kapadia, Donald Williamson

2022 INTERSPEECH

PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification

Siqi Zheng, Hongbin Suo, Qian Chen

2022 INTERSPEECH

Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings

Niko Brummer, Albert Swart, Ladislav Mosner et al.

2022 INTERSPEECH

Probing phoneme, language and speaker information in unsupervised speech representations

Maureen de Seyssel, Marvin Lavechin, Yossi Adi et al.

2022 INTERSPEECH

Probing speech emotion recognition transformers for linguistic knowledge

Andreas Triantafyllopoulos, Johannes Wagner, Hagen Wierstorf et al.

2022 INTERSPEECH

Production characteristics of obstruents in WaveNet and older TTS systems

Ayushi Pandey, Sébastien Le Maguer, Julie Carson-Berndsen et al.

2022 INTERSPEECH

Production federated keyword spotting via distillation, filtering, and joint federated-centralized training

Andrew Hard, Kurt Partridge, Neng Chen et al.

2022 INTERSPEECH

Production Strategies of Vocal Attitudes

Léane Salais, Pablo Arias, Clément Le Moine et al.

2022 INTERSPEECH

Prompt-based Re-ranking Language Model for ASR

Mengxi Nie, Ming Yan, Caixia Gong

2022 INTERSPEECH

Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations

Chang Liu, Zhen-Hua Ling, Ling-Hui Chen

2022 INTERSPEECH

Prosodic alignment for off-screen automatic dubbing

Yogesh Virkar, Marcello Federico, Robert Enyedi et al.

2022 INTERSPEECH

Prosodic Information in Dialect Identification of a Tonal Language: The case of Ao

Moakala Tzudir, Priyankoo Sarmah, S R Mahadeva Prasanna

2022 INTERSPEECH

Prototypical speaker-interference loss for target voice separation using non-parallel audio samples

Seongkyu Mun, Dhananjaya Gowda, Jihwan Lee et al.

2022 INTERSPEECH