conftrace_

Artificial Intelligence › Core AI ›

Multimodal Learning

13,057 papers

Papers per year

Papers

Mixture of Orthogonal Sequences Made from Extended Time-Stretched Pulses Enables Measurement of Involuntary Voice Fundamental Frequency Response to Pitch Perturbation INTERSPEECH 2021

Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition INTERSPEECH 2021

Modeling the Effect of Military Oxygen Masks on Speech Characteristics INTERSPEECH 2021

Learning Speech Models from Multi-Modal Data INTERSPEECH 2021

Towards the Prediction of the Vocal Tract Shape from the Sequence of Phonemes to be Articulated INTERSPEECH 2021

Temporal Context in Speech Emotion Recognition INTERSPEECH 2021

Acted vs. Improvised: Domain Adaptation for Elicitation Approaches in Audio-Visual Emotion Recognition INTERSPEECH 2021

Applying TDNN Architectures for Analyzing Duration Dependencies on Speech Emotion Recognition INTERSPEECH 2021

Teacher-Student MixIT for Unsupervised and Semi-Supervised Speech Separation INTERSPEECH 2021

AvaTr: One-Shot Speaker Extraction with Transformers INTERSPEECH 2021

Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech INTERSPEECH 2021

Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder INTERSPEECH 2021

Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset INTERSPEECH 2021

Towards Automatic Speech to Sign Language Generation INTERSPEECH 2021

Exploring Emotional Prototypes in a High Dimensional TTS Latent Space INTERSPEECH 2021

A Preliminary Study on Discourse Prosody Encoding in L1 and L2 English Spontaneous Narratives INTERSPEECH 2021

A Cross-Dialectal Comparison of Apical Vowels in Beijing Mandarin, Northeastern Mandarin and Southwestern Mandarin: An EMA and Ultrasound Study INTERSPEECH 2021

Speech Perception and Loanword Adaptations: The Case of Copy-Vowel Epenthesis INTERSPEECH 2021

Context and Co-Text Influence on the Accuracy Production of Italian L2 Non-Native Sounds INTERSPEECH 2021

Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction INTERSPEECH 2021

Unsupervised Learning of Disentangled Speech Content and Style Representation INTERSPEECH 2021

Fake Audio Detection in Resource-Constrained Settings Using Microfeatures INTERSPEECH 2021

Coughing-Based Recognition of Covid-19 with Spatial Attentive ConvLSTM Recurrent Neural Networks INTERSPEECH 2021

The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech INTERSPEECH 2021

Speech Emotion Recognition via Multi-Level Cross-Modal Distillation INTERSPEECH 2021