Longbiao Wang
59 papers · 2016–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (7) πΊοΈ Taxonomy Completionist (20) π Interdisciplinary Bridge π§ Keyword Pioneer π Academic Marathon (9)
π
Renaissance Researcher
(9)
πΊοΈ
Taxonomy Completionist
(20)
π§
Keyword Pioneer
π
Conference Loyalist
(43)
π€
Dynamic Duo
(45)
π§¬
Topic Evolution
π¬
Deep Specialist
(13)
π
Keyword Champion
(2)
π₯
Unstoppable
(8)
π
Trend Setter
π
Conference Pioneer
β‘
Prolific Year
(8)
π
Century Club
(57)
ποΈ
Keyword Collector
(63)
Conferences
INTERSPEECH (43)
IJCAI (5)
AAAI (4)
ACL (3)
COLING (2)
EMNLP (1)
IJCNLP (1)
Top co-authors
Keywords
convolutional neural network
(7)
neural network
(5)
speech emotion recognition
(4)
sentiment analysis
(4)
attention mechanism
(4)
knowledge distillation
(4)
speech enhancement
(4)
graph neural network
(4)
semi-supervised learning
(3)
automatic speech recognition
(3)
multimodal learning
(3)
bidirectional long short-term memory
(3)
speech recognition
(3)
speaker extraction
(3)
representation learning
(3)
speaker recognition
(3)
multi-task learning
(3)
sarcasm detection
(2)
metric learning
(2)
feature extraction
(2)
Papers
UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions
ACL 2026
Evaluating the Expressive Appropriateness of Speech in Rich Contexts
ACL 2026
Enriching Multimodal Sentiment Analysis Through Textual Emotional Descriptions of Visual-Audio Content
AAAI 2025
Rethinking Contrastive Learning in Graph Anomaly Detection: A Clean-View Perspective
IJCAI 2025
HeterGP: Bridging Heterogeneity in Graph Neural Networks with Multi-View Prompting
AAAI 2025
Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework
IJCAI 2025
G^2SAM: Graph-Based Global Semantic Awareness Method for Multimodal Sarcasm Detection
AAAI 2024
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios
INTERSPEECH 2024
VoiCor: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
INTERSPEECH 2024
Exploring Pre-trained Speech Model for Articulatory Feature Extraction in Dysarthric Speech Using ASR
INTERSPEECH 2024
Multi-Modal Sarcasm Detection Based on Dual Generative Processes
IJCAI 2024
Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition
INTERSPEECH 2024
SDNet: Stream-attention and Dual-feature Learning Network for Ad-hoc Array Speech Separation
INTERSPEECH 2023
Commonsense Knowledge Enhanced Sentiment Dependency Graph for Sarcasm Detection
IJCAI 2023
Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
INTERSPEECH 2023
Rethinking the Visual Cues in Audio-Visual Speaker Extraction
INTERSPEECH 2023
Discrimination of the Different Intents Carried by the Same Text Through Integrating Multimodal Information
INTERSPEECH 2023
Improving Zero-shot Cross-domain Slot Filling via Transformer-based Slot Semantics Fusion
INTERSPEECH 2023
Multi-Level Knowledge Distillation for Speech Emotion Recognition in Noisy Conditions
INTERSPEECH 2023
Auditory Attention Detection in Real-Life Scenarios Using Common Spatial Patterns from EEG
INTERSPEECH 2023
Augmenting Affective Dependency Graph via Iterative Incongruity Graph Learning for Sarcasm Detection
AAAI 2023
Tackling Modality Heterogeneity with Multi-View Calibration Network for Multimodal Sentiment Detection
ACL 2023
Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training
INTERSPEECH 2022
Language-specific Characteristic Assistance for Code-switching Speech Recognition
INTERSPEECH 2022
Hierarchical Tagger with Multi-task Learning for Cross-domain Slot Filling
INTERSPEECH 2022
Finer-grained Modeling units-based Meta-Learning for Low-resource Tibetan Speech Recognition
INTERSPEECH 2022
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
INTERSPEECH 2022
Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model
INTERSPEECH 2022
TopicKS: Topic-driven Knowledge Selection for Knowledge-grounded Dialogue Generation
INTERSPEECH 2022
Monaural Speech Enhancement Based on Spectrogram Decomposition for Convolutional Neural Network-sensitive Feature Extraction
INTERSPEECH 2022
Iterative Sound Source Localization for Unknown Number of Sources
INTERSPEECH 2022
Global Signal-to-noise Ratio Estimation Based on Multi-subband Processing Using Convolutional Neural Network
INTERSPEECH 2022
MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
INTERSPEECH 2022
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection
INTERSPEECH 2022
Information Sieve: Content Leakage Reduction in End-to-End Prosody Transfer for Expressive Speech Synthesis
INTERSPEECH 2021
TacoLPCNet: Fast and Stable TTS by Conditioning LPCNet on Mel Spectrogram Predictions
INTERSPEECH 2021
Domain-Specific Multi-Agent Dialog Policy Learning in Multi-Domain Task-Oriented Scenarios
INTERSPEECH 2021
Joint Feature Enhancement and Speaker Recognition with Multi-Objective Task-Oriented Network
INTERSPEECH 2021
Metric Learning Based Feature Representation with Gated Fusion Model for Speech Emotion Recognition
INTERSPEECH 2021
Time-Frequency Representation Learning with Graph Convolutional Network for Dialogue-Level Speech Emotion Recognition
INTERSPEECH 2021
Dynamic Margin Softmax Loss for Speaker Verification
INTERSPEECH 2020
Staged Knowledge Distillation for End-to-End Dysarthric Speech Recognition and Speech Attribute Transcription
INTERSPEECH 2020
EEG-Based Short-Time Auditory Attention Detection Using Multi-Task Deep Learning
INTERSPEECH 2020
Singing Voice Extraction with Attention-Based Spectrograms Fusion
INTERSPEECH 2020
Temporal Attention Convolutional Network for Speech Emotion Recognition with Latent Representation
INTERSPEECH 2020
SpEx+: A Complete Time Domain Speaker Extraction Network
INTERSPEECH 2020
Adversarial Separation Network for Speaker Recognition
INTERSPEECH 2020
ARET: Aggregated Residual Extended Time-Delay Neural Networks for Speaker Verification
INTERSPEECH 2020
Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement
INTERSPEECH 2019
CNN-BLSTM Based Question Detection from Dialogs Considering Phase and Context Information
INTERSPEECH 2019
A Semi-Supervised Stable Variational Network for Promoting Replier-Consistency in Dialogue Generation
IJCNLP 2019
A Semi-Supervised Stable Variational Network for Promoting Replier-Consistency in Dialogue Generation
EMNLP 2019
Integrative Network Embedding via Deep Joint Reconstruction
IJCAI 2018
Revealing Spatiotemporal Brain Dynamics of Speech Production Based on EEG and Eye Movement
INTERSPEECH 2018
Interaction-Aware Topic Model for Microblog Conversations through Network Embedding and User Attention
COLING 2018
Multiple Phase Information Combination for Replay Attacks Detection
INTERSPEECH 2018
Implicit Discourse Relation Recognition using Neural Tensor Network with Interactive Attention and Sparse Learning
COLING 2018
Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network
INTERSPEECH 2018
DNN-Based Amplitude and Phase Feature Enhancement for Noise Robust Speaker Identification
INTERSPEECH 2016