Longbiao Wang

59 papers · 2016–2026 · 7 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🌍 Conference Polyglot (7) 🗺️ Taxonomy Completionist (20) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (9)

🌈 Renaissance Researcher (9) 🗺️ Taxonomy Completionist (20) 🧭 Keyword Pioneer 🏠 Conference Loyalist (43) 🤝 Dynamic Duo (45) 🧬 Topic Evolution 🔬 Deep Specialist (13) 🏆 Keyword Champion (2) 🔥 Unstoppable (8) 📈 Trend Setter 🚀 Conference Pioneer ⚡ Prolific Year (8) 💎 Century Club (57) 🗃️ Keyword Collector (63)

Conferences

INTERSPEECH (43) IJCAI (5) AAAI (4) ACL (3) COLING (2) EMNLP (1) IJCNLP (1)

Top co-authors

Jianwu Dang (47) Meng Ge (16) Xiaobao Wang (10) Di Jin (9) Gaoyan Zhang (8) Meng Liu (6) Hao Shi (6) Sheng Li (6) Cheng Gong (6) Yuke Si (5)

Keywords

convolutional neural network (7) neural network (5) speech emotion recognition (4) sentiment analysis (4) attention mechanism (4) knowledge distillation (4) speech enhancement (4) graph neural network (4) semi-supervised learning (3) automatic speech recognition (3) multimodal learning (3) bidirectional long short-term memory (3) speech recognition (3) speaker extraction (3) representation learning (3) speaker recognition (3) multi-task learning (3) sarcasm detection (2) metric learning (2) feature extraction (2)

Papers

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions ACL 2026 Evaluating the Expressive Appropriateness of Speech in Rich Contexts ACL 2026 Enriching Multimodal Sentiment Analysis Through Textual Emotional Descriptions of Visual-Audio Content AAAI 2025 Rethinking Contrastive Learning in Graph Anomaly Detection: A Clean-View Perspective IJCAI 2025 HeterGP: Bridging Heterogeneity in Graph Neural Networks with Multi-View Prompting AAAI 2025 Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework IJCAI 2025 G^2SAM: Graph-Based Global Semantic Awareness Method for Multimodal Sarcasm Detection AAAI 2024 An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios INTERSPEECH 2024 VoiCor: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement INTERSPEECH 2024 Exploring Pre-trained Speech Model for Articulatory Feature Extraction in Dysarthric Speech Using ASR INTERSPEECH 2024 Multi-Modal Sarcasm Detection Based on Dual Generative Processes IJCAI 2024 Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition INTERSPEECH 2024 SDNet: Stream-attention and Dual-feature Learning Network for Ad-hoc Array Speech Separation INTERSPEECH 2023 Commonsense Knowledge Enhanced Sentiment Dependency Graph for Sarcasm Detection IJCAI 2023 Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation INTERSPEECH 2023 Rethinking the Visual Cues in Audio-Visual Speaker Extraction INTERSPEECH 2023 Discrimination of the Different Intents Carried by the Same Text Through Integrating Multimodal Information INTERSPEECH 2023 Improving Zero-shot Cross-domain Slot Filling via Transformer-based Slot Semantics Fusion INTERSPEECH 2023 Multi-Level Knowledge Distillation for Speech Emotion Recognition in Noisy Conditions INTERSPEECH 2023 Auditory Attention Detection in Real-Life Scenarios Using Common Spatial Patterns from EEG INTERSPEECH 2023 Augmenting Affective Dependency Graph via Iterative Incongruity Graph Learning for Sarcasm Detection AAAI 2023 Tackling Modality Heterogeneity with Multi-View Calibration Network for Multimodal Sentiment Detection ACL 2023 Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training INTERSPEECH 2022 Language-specific Characteristic Assistance for Code-switching Speech Recognition INTERSPEECH 2022 Hierarchical Tagger with Multi-task Learning for Cross-domain Slot Filling INTERSPEECH 2022 Finer-grained Modeling units-based Meta-Learning for Low-resource Tibetan Speech Recognition INTERSPEECH 2022 VCSE: Time-Domain Visual-Contextual Speaker Extraction Network INTERSPEECH 2022 Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model INTERSPEECH 2022 TopicKS: Topic-driven Knowledge Selection for Knowledge-grounded Dialogue Generation INTERSPEECH 2022 Monaural Speech Enhancement Based on Spectrogram Decomposition for Convolutional Neural Network-sensitive Feature Extraction INTERSPEECH 2022 Iterative Sound Source Localization for Unknown Number of Sources INTERSPEECH 2022 Global Signal-to-noise Ratio Estimation Based on Multi-subband Processing Using Convolutional Neural Network INTERSPEECH 2022 MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources INTERSPEECH 2022 Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection INTERSPEECH 2022 Information Sieve: Content Leakage Reduction in End-to-End Prosody Transfer for Expressive Speech Synthesis INTERSPEECH 2021 TacoLPCNet: Fast and Stable TTS by Conditioning LPCNet on Mel Spectrogram Predictions INTERSPEECH 2021 Domain-Specific Multi-Agent Dialog Policy Learning in Multi-Domain Task-Oriented Scenarios INTERSPEECH 2021 Joint Feature Enhancement and Speaker Recognition with Multi-Objective Task-Oriented Network INTERSPEECH 2021 Metric Learning Based Feature Representation with Gated Fusion Model for Speech Emotion Recognition INTERSPEECH 2021 Time-Frequency Representation Learning with Graph Convolutional Network for Dialogue-Level Speech Emotion Recognition INTERSPEECH 2021 Dynamic Margin Softmax Loss for Speaker Verification INTERSPEECH 2020 Staged Knowledge Distillation for End-to-End Dysarthric Speech Recognition and Speech Attribute Transcription INTERSPEECH 2020 EEG-Based Short-Time Auditory Attention Detection Using Multi-Task Deep Learning INTERSPEECH 2020 Singing Voice Extraction with Attention-Based Spectrograms Fusion INTERSPEECH 2020 Temporal Attention Convolutional Network for Speech Emotion Recognition with Latent Representation INTERSPEECH 2020 SpEx+: A Complete Time Domain Speaker Extraction Network INTERSPEECH 2020 Adversarial Separation Network for Speaker Recognition INTERSPEECH 2020 ARET: Aggregated Residual Extended Time-Delay Neural Networks for Speaker Verification INTERSPEECH 2020 Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement INTERSPEECH 2019 CNN-BLSTM Based Question Detection from Dialogs Considering Phase and Context Information INTERSPEECH 2019 A Semi-Supervised Stable Variational Network for Promoting Replier-Consistency in Dialogue Generation IJCNLP 2019 A Semi-Supervised Stable Variational Network for Promoting Replier-Consistency in Dialogue Generation EMNLP 2019 Integrative Network Embedding via Deep Joint Reconstruction IJCAI 2018 Revealing Spatiotemporal Brain Dynamics of Speech Production Based on EEG and Eye Movement INTERSPEECH 2018 Interaction-Aware Topic Model for Microblog Conversations through Network Embedding and User Attention COLING 2018 Multiple Phase Information Combination for Replay Attacks Detection INTERSPEECH 2018 Implicit Discourse Relation Recognition using Neural Tensor Network with Interactive Attention and Sparse Learning COLING 2018 Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network INTERSPEECH 2018 DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification INTERSPEECH 2016