Carlos Busso
33 papers · 2016–2026 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (16) π Academic Marathon (8)
π
Interdisciplinary Bridge
π
Academic Marathon
(8)
π§
Keyword Pioneer
π
Conference Loyalist
(32)
π¬
Deep Specialist
(11)
π
Keyword Champion
(7)
ποΈ
Keyword Collector
(134)
π
Conference Pioneer
π
Trend Setter
π
Century Club
(32)
π₯
Unstoppable
(9)
β‘
Prolific Year
(5)
Conferences
INTERSPEECH (32)
AAAI (1)
Top co-authors
Keywords
speech emotion recognition
(16)
deep neural network
(4)
preference learning
(4)
long short-term memory
(3)
emotion recognition
(3)
multimodal learning
(3)
teacher-student learning
(2)
bidirectional lstm
(2)
voice activity detection
(2)
noise robustness
(2)
semi-supervised learning
(2)
recurrent neural network
(2)
self-supervised learning
(2)
emotion classification
(2)
representation learning
(2)
multi-label classification
(2)
voice conversion
(2)
knowledge distillation
(2)
domain adaptation
(2)
speech enhancement
(2)
Papers
RankList β a Listwise Preference Learning Framework for Predicting Subjective Preferences
AAAI 2026
A Layer-Anchoring Strategy for Enhancing Cross-Lingual Speech Emotion Recognition
INTERSPEECH 2024
Bridging Emotions Across Languages: Low Rank Adaptation for Multilingual Speech Emotion Recognition
INTERSPEECH 2024
Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline
INTERSPEECH 2024
Keep, Delete, or Substitute: Frame Selection Strategy for Noise-Robust Speech Emotion Recognition
INTERSPEECH 2024
Speech emotion recognition with deep learning beamforming on a distant human-robot interaction scenario
INTERSPEECH 2024
WHiSER: White House Tapes Speech Emotion Recognition Corpus
INTERSPEECH 2024
Unsupervised Domain Adaptation for Speech Emotion Recognition using K-Nearest Neighbors Voice Conversion
INTERSPEECH 2024
Computation and Memory Efficient Noise Adaptation of Wav2Vec2.0 for Noisy Speech Emotion Recognition with Skip Connection Adapters
INTERSPEECH 2023
Distant Speech Emotion Recognition in an Indoor Human-robot Interaction Scenario
INTERSPEECH 2023
The Importance of Calibration: Rethinking Confidence and Performance of Speech Multi-label Emotion Classifiers
INTERSPEECH 2023
Preference Learning Labels by Anchoring on Consecutive Annotations
INTERSPEECH 2023
Exploiting Co-occurrence Frequency of Emotions in Perceptual Evaluations To Train A Speech Emotion Classifier
INTERSPEECH 2022
Improving Speech Emotion Recognition Using Self-Supervised Learning with Domain-Specific Audiovisual Tasks
INTERSPEECH 2022
Separation of Emotional and Reconstruction Embeddings on Ladder Network to Improve Speech Emotion Recognition Robustness in Noisy Conditions
INTERSPEECH 2021
Voice Activity Detection with Teacher-Student Domain Emulation
INTERSPEECH 2021
An Efficient Temporal Modeling Approach for Speech Emotion Recognition by Mapping Varied Duration Sentences into Fixed Number of Chunks
INTERSPEECH 2020
Ensemble of Students Taught by Probabilistic Teachers to Improve Speech Emotion Recognition
INTERSPEECH 2020
The MSP-Conversation Corpus
INTERSPEECH 2020
Speech Emotion Recognition with a Reject Option
INTERSPEECH 2019
Preference-Learning with Qualitative Agreement for Sentence Level Emotional Annotations
INTERSPEECH 2018
Ladder Networks for Emotion Recognition: Using Unsupervised Auxiliary Tasks to Improve Predictions of Emotional Attributes
INTERSPEECH 2018
Audiovisual Speech Activity Detection with Advanced Long Short-Term Memory
INTERSPEECH 2018
Predicting Categorical Emotions by Jointly Learning Primary and Secondary Emotions through Multitask Learning
INTERSPEECH 2018
Role of Regularization in the Prediction of Valence from Speech
INTERSPEECH 2018
Jointly Predicting Arousal, Valence and Dominance with Multi-Task Learning
INTERSPEECH 2017
Bimodal Recurrent Neural Network for Audiovisual Voice Activity Detection
INTERSPEECH 2017
A Stepwise Analysis of Aggregated Crowdsourced Labels Describing Multimodal Emotional Behaviors
INTERSPEECH 2017
A Portable Automatic PA-TA-KA Syllable Detection System to Derive Biomarkers for Neurological Disorders
INTERSPEECH 2016
Defining Emotionally Salient Regions Using Qualitative Agreement Method
INTERSPEECH 2016
Improving Boundary Estimation in Audiovisual Speech Activity Detection Using Bayesian Information Criterion
INTERSPEECH 2016
Retrieving Categorical Emotions Using a Probabilistic Framework to Define Preference Learning Samples
INTERSPEECH 2016
Head Motion Generation with Synthetic Speech: A Data Driven Approach
INTERSPEECH 2016