Björn Schuller
69 papers · 2016–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (19) 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (9)
🌍
Conference Polyglot
(9)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🌟
Keyword Trendsetter Combo
(9)
🏠
Conference Loyalist
(56)
👥
Mega-Team
(22)
🔬
Deep Specialist
(14)
🤝
Dynamic Duo
(13)
🏆
Keyword Champion
(2)
⚡
Prolific Year
(16)
❓
The Questioner
(6)
🚀
Conference Pioneer
🗃️
Keyword Collector
(79)
💎
Century Club
(68)
📈
Trend Setter
Conferences
INTERSPEECH (56)
ACL (3)
IJCAI (3)
JMLR (2)
AAAI (1)
EACL (1)
ECCV (1)
ICCV (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
emotion recognition
(11)
speech analysis
(11)
speech emotion recognition
(8)
support vector machine
(8)
recurrent neural network
(7)
convolutional neural network
(6)
bidirectional long short-term memory
(5)
acoustic feature
(5)
audio classification
(5)
speech processing
(4)
long short-term memory
(4)
speech corpus
(4)
deception detection
(4)
feature extraction
(4)
transfer learning
(3)
binary classification
(3)
deep learning
(3)
autism spectrum disorder
(3)
data augmentation
(3)
sentiment analysis
(3)
Papers
Discovering and Causally Validating Emotion-Sensitive Neurons in Large Audio-Language Models
ACL 2026
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis
AAAI 2025
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning
ACL 2024
EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation
ECCV 2024
Hierarchical Distribution Adaptation for Unsupervised Cross-corpus Speech Emotion Recognition
INTERSPEECH 2024
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition
INTERSPEECH 2024
MFDR: Multiple-stage Fusion and Dynamically Refined Network for Multimodal Emotion Recognition
INTERSPEECH 2024
Real-world PTSD Recognition: A Cross-corpus and Cross-linguistic Evaluation
INTERSPEECH 2024
This Paper Had the Smartest Reviewers - Flattery Detection Utilising an Audio-Textual Transformer-Based Approach
INTERSPEECH 2024
Neural Compression Augmentation for Contrastive Audio Representation Learning
INTERSPEECH 2024
Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition
INTERSPEECH 2024
“So . . . my child . . . ” – How Child ADHD Influences the Way Parents Talk
INTERSPEECH 2024
INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition
INTERSPEECH 2024
ParaCLAP – Towards a general language-audio model for computational paralinguistic tasks
INTERSPEECH 2024
Sustained Vowels for Pre- vs Post-Treatment COPD Classification
INTERSPEECH 2024
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
INTERSPEECH 2024
Multi-Type Outer Product-Based Fusion of Respiratory Sounds for Detecting COVID-19
INTERSPEECH 2022
Data Augmentation for Dementia Detection in Spoken Language.
INTERSPEECH 2022
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement
INTERSPEECH 2022
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
INTERSPEECH 2022
Uncertainty Aware Review Hallucination for Science Article Classification
IJCNLP 2021
Uncertainty Aware Review Hallucination for Science Article Classification
ACL 2021
A Walkthrough for the Principle of Logit Separation
IJCAI 2019
Annotator Trustability-based Cooperative Learning Solutions for Intelligent Audio Analysis
INTERSPEECH 2018
Categorical vs Dimensional Perception of Italian Emotional Speech
INTERSPEECH 2018
auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks
JMLR 2018
The INTERSPEECH 2018 Computational Paralinguistics Challenge: Atypical & Self-Assessed Affect, Crying & Heart Beats
INTERSPEECH 2018
Evolving Learning for Analysing Mood-Related Infant Vocalisation
INTERSPEECH 2018
State of Mind: Classification through Self-reported Affect and Word Use in Speech.
INTERSPEECH 2018
Towards Temporal Modelling of Categorical Speech Emotion Recognition
INTERSPEECH 2018
Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks
INTERSPEECH 2018
Automated Classification of Children’s Linguistic versus Non-Linguistic Vocalisations
INTERSPEECH 2018
The Perception and Analysis of the Likeability and Human Likeness of Synthesized Speech
INTERSPEECH 2018
Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech
INTERSPEECH 2018
How Did You like 2017? Detection of Language Markers of Depression and Narcissism in Personal Narratives
INTERSPEECH 2018
Towards Intelligent Crowdsourcing for Audio Data Annotation: Integrating Active Learning in the Real World
INTERSPEECH 2017
Implementing Gender-Dependent Vowel-Level Analysis for Boosting Speech-Based Depression Recognition
INTERSPEECH 2017
Earlier Identification of Children with Autism Spectrum Disorder: An Automatic Vocalisation-Based Approach
INTERSPEECH 2017
Automatic Classification of Autistic Child Vocalisations: A Novel Database and Results
INTERSPEECH 2017
“Did you laugh enough today?” — Deep Neural Networks for Mobile and Wearable Laughter Trackers
INTERSPEECH 2017
Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective
INTERSPEECH 2017
Emotional Speech of Mentally and Physically Disabled Individuals: Introducing the EmotAsS Database and First Findings
INTERSPEECH 2017
Cross-Domain Classification of Drowsiness in Speech: The Case of Alcohol Intoxication and Sleep Deprivation
INTERSPEECH 2017
The Perception of Emotions in Noisified Nonsense Speech
INTERSPEECH 2017
The INTERSPEECH 2017 Computational Paralinguistics Challenge: Addressee, Cold & Snoring
INTERSPEECH 2017
An ‘End-to-Evolution’ Hybrid Approach for Snore Sound Classification
INTERSPEECH 2017
Snore Sound Classification Using Image-Based Deep Spectrum Features
INTERSPEECH 2017
Discussion
INTERSPEECH 2017
DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding
ICCV 2017
openXBOW -- Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit
JMLR 2017
Contextual Bidirectional Long Short-Term Memory Recurrent Neural Network Language Models: A Generative Approach to Sentiment Analysis
EACL 2017
The INTERSPEECH 2016 Computational Paralinguistics Challenge: A Summary of Results
INTERSPEECH 2016
Convolutional Neural Networks with Data Augmentation for Classifying Speakers’ Native Language
INTERSPEECH 2016
The Native Language Sub-Challenge: The Data
INTERSPEECH 2016
Sincerity and Deception in Speech: Two Sides of the Same Coin? A Transfer- and Multi-Task Learning Perspective
INTERSPEECH 2016
The Sincerity Sub-Challenge: The Data
INTERSPEECH 2016
Is Deception Emotional? An Emotion-Driven Predictive Approach
INTERSPEECH 2016
The Deception Sub-Challenge: The Data
INTERSPEECH 2016
The INTERSPEECH 2016 Computational Paralinguistics Challenge: Deception, Sincerity & Native Language
INTERSPEECH 2016
Does She Speak RTT? Towards an Earlier Identification of Rett Syndrome Through Intelligent Pre-Linguistic Vocalisation Analysis
INTERSPEECH 2016
Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children
INTERSPEECH 2016
Real-Time Tracking of Speakers’ Emotions, States, and Traits on Mobile Platforms
INTERSPEECH 2016
At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech
INTERSPEECH 2016
Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio
IJCAI 2016
Driver Frustration Detection from Audio and Video in the Wild
IJCAI 2016
Manual versus Automated: The Challenging Routine of Infant Vocalisation Segmentation in Home Videos to Study Neuro(mal)development
INTERSPEECH 2016
Enhancing Multilingual Recognition of Emotion in Speech by Language Identification
INTERSPEECH 2016
Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks
INTERSPEECH 2016
Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks for Grapheme-to-Phoneme Conversion Utilizing Complex Many-to-Many Alignments
INTERSPEECH 2016