Björn Schuller

69 papers · 2016–2026 · 9 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (19) 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (9)

🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌟 Keyword Trendsetter Combo (9) 🏠 Conference Loyalist (56) 👥 Mega-Team (22) 🔬 Deep Specialist (14) 🤝 Dynamic Duo (13) 🏆 Keyword Champion (2) ⚡ Prolific Year (16) ❓ The Questioner (6) 🚀 Conference Pioneer 🗃️ Keyword Collector (79) 💎 Century Club (68) 📈 Trend Setter

Conferences

INTERSPEECH (56) ACL (3) IJCAI (3) JMLR (2) AAAI (1) EACL (1) ECCV (1) ICCV (1) IJCNLP (1)

Top co-authors

Alice Baird (13) Anton Batliner (13) Nicholas Cummins (13) Shahin Amiriparian (11) Yue Zhang (8) Erik Marchi (7) Stefan Steidl (7) Maximilian Schmitt (6) Andreas Triantafyllopoulos (6) Zixing Zhang (6)

Research topics

Analysis (1)

Keywords

emotion recognition (11) speech analysis (11) speech emotion recognition (8) support vector machine (8) recurrent neural network (7) convolutional neural network (6) bidirectional long short-term memory (5) acoustic feature (5) audio classification (5) speech processing (4) long short-term memory (4) speech corpus (4) deception detection (4) feature extraction (4) transfer learning (3) binary classification (3) deep learning (3) autism spectrum disorder (3) data augmentation (3) sentiment analysis (3)

Papers

Discovering and Causally Validating Emotion-Sensitive Neurons in Large Audio-Language Models ACL 2026 ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis AAAI 2025 Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning ACL 2024 EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation ECCV 2024 Hierarchical Distribution Adaptation for Unsupervised Cross-corpus Speech Emotion Recognition INTERSPEECH 2024 Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition INTERSPEECH 2024 MFDR: Multiple-stage Fusion and Dynamically Refined Network for Multimodal Emotion Recognition INTERSPEECH 2024 Real-world PTSD Recognition: A Cross-corpus and Cross-linguistic Evaluation INTERSPEECH 2024 This Paper Had the Smartest Reviewers - Flattery Detection Utilising an Audio-Textual Transformer-Based Approach INTERSPEECH 2024 Neural Compression Augmentation for Contrastive Audio Representation Learning INTERSPEECH 2024 Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition INTERSPEECH 2024 “So . . . my child . . . ” – How Child ADHD Influences the Way Parents Talk INTERSPEECH 2024 INTERSPEECH 2009 Emotion Challenge Revisited: Benchmarking 15 Years of Progress in Speech Emotion Recognition INTERSPEECH 2024 ParaCLAP – Towards a general language-audio model for computational paralinguistic tasks INTERSPEECH 2024 Sustained Vowels for Pre- vs Post-Treatment COPD Classification INTERSPEECH 2024 DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition INTERSPEECH 2024 Multi-Type Outer Product-Based Fusion of Respiratory Sounds for Detecting COVID-19 INTERSPEECH 2022 Data Augmentation for Dementia Detection in Spoken Language. INTERSPEECH 2022 Cross-Layer Similarity Knowledge Distillation for Speech Enhancement INTERSPEECH 2022 Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning INTERSPEECH 2022 Uncertainty Aware Review Hallucination for Science Article Classification IJCNLP 2021 Uncertainty Aware Review Hallucination for Science Article Classification ACL 2021 A Walkthrough for the Principle of Logit Separation IJCAI 2019 Annotator Trustability-based Cooperative Learning Solutions for Intelligent Audio Analysis INTERSPEECH 2018 Categorical vs Dimensional Perception of Italian Emotional Speech INTERSPEECH 2018 auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks JMLR 2018 The INTERSPEECH 2018 Computational Paralinguistics Challenge: Atypical & Self-Assessed Affect, Crying & Heart Beats INTERSPEECH 2018 Evolving Learning for Analysing Mood-Related Infant Vocalisation INTERSPEECH 2018 State of Mind: Classification through Self-reported Affect and Word Use in Speech. INTERSPEECH 2018 Towards Temporal Modelling of Categorical Speech Emotion Recognition INTERSPEECH 2018 Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks INTERSPEECH 2018 Automated Classification of Children’s Linguistic versus Non-Linguistic Vocalisations INTERSPEECH 2018 The Perception and Analysis of the Likeability and Human Likeness of Synthesized Speech INTERSPEECH 2018 Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech INTERSPEECH 2018 How Did You like 2017? Detection of Language Markers of Depression and Narcissism in Personal Narratives INTERSPEECH 2018 Towards Intelligent Crowdsourcing for Audio Data Annotation: Integrating Active Learning in the Real World INTERSPEECH 2017 Implementing Gender-Dependent Vowel-Level Analysis for Boosting Speech-Based Depression Recognition INTERSPEECH 2017 Earlier Identification of Children with Autism Spectrum Disorder: An Automatic Vocalisation-Based Approach INTERSPEECH 2017 Automatic Classification of Autistic Child Vocalisations: A Novel Database and Results INTERSPEECH 2017 “Did you laugh enough today?” — Deep Neural Networks for Mobile and Wearable Laughter Trackers INTERSPEECH 2017 Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective INTERSPEECH 2017 Emotional Speech of Mentally and Physically Disabled Individuals: Introducing the EmotAsS Database and First Findings INTERSPEECH 2017 Cross-Domain Classification of Drowsiness in Speech: The Case of Alcohol Intoxication and Sleep Deprivation INTERSPEECH 2017 The Perception of Emotions in Noisified Nonsense Speech INTERSPEECH 2017 The INTERSPEECH 2017 Computational Paralinguistics Challenge: Addressee, Cold & Snoring INTERSPEECH 2017 An ‘End-to-Evolution’ Hybrid Approach for Snore Sound Classification INTERSPEECH 2017 Snore Sound Classification Using Image-Based Deep Spectrum Features INTERSPEECH 2017 Discussion INTERSPEECH 2017 DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding ICCV 2017 openXBOW -- Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit JMLR 2017 Contextual Bidirectional Long Short-Term Memory Recurrent Neural Network Language Models: A Generative Approach to Sentiment Analysis EACL 2017 The INTERSPEECH 2016 Computational Paralinguistics Challenge: A Summary of Results INTERSPEECH 2016 Convolutional Neural Networks with Data Augmentation for Classifying Speakers’ Native Language INTERSPEECH 2016 The Native Language Sub-Challenge: The Data INTERSPEECH 2016 Sincerity and Deception in Speech: Two Sides of the Same Coin? A Transfer- and Multi-Task Learning Perspective INTERSPEECH 2016 The Sincerity Sub-Challenge: The Data INTERSPEECH 2016 Is Deception Emotional? An Emotion-Driven Predictive Approach INTERSPEECH 2016 The Deception Sub-Challenge: The Data INTERSPEECH 2016 The INTERSPEECH 2016 Computational Paralinguistics Challenge: Deception, Sincerity & Native Language INTERSPEECH 2016 Does She Speak RTT? Towards an Earlier Identification of Rett Syndrome Through Intelligent Pre-Linguistic Vocalisation Analysis INTERSPEECH 2016 Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children INTERSPEECH 2016 Real-Time Tracking of Speakers’ Emotions, States, and Traits on Mobile Platforms INTERSPEECH 2016 At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech INTERSPEECH 2016 Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio IJCAI 2016 Driver Frustration Detection from Audio and Video in the Wild IJCAI 2016 Manual versus Automated: The Challenging Routine of Infant Vocalisation Segmentation in Home Videos to Study Neuro(mal)development INTERSPEECH 2016 Enhancing Multilingual Recognition of Emotion in Speech by Language Identification INTERSPEECH 2016 Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks INTERSPEECH 2016 Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks for Grapheme-to-Phoneme Conversion Utilizing Complex Many-to-Many Alignments INTERSPEECH 2016