Bo Xu
145 papers · 2002–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (23) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (17)
π
Cross-Pollinator
(5)
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
π
Conference Loyalist
(20)
π
Keyword Trendsetter Combo
(5)
π€
Dynamic Duo
(24)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π¬
Deep Specialist
(24)
π
Keyword Champion
(2)
π
Century Club
(139)
π
Conference Pioneer
β‘
Prolific Year
(6)
β
The Questioner
(2)
π
Trend Setter
ποΈ
Keyword Collector
(66)
π₯
Unstoppable
(14)
Conferences
AAAI (20)
COLING (19)
INTERSPEECH (19)
ACL (18)
EMNLP (16)
IJCAI (11)
NAACL (7)
NIPS (6)
IJCNLP (5)
WACV (4)
SEMEVAL (4)
ICLR (4)
ECCV (3)
CVPR (3)
ICML (2)
ICCV (2)
CONLL (2)
Top co-authors
Keywords
large language model
(11)
spiking neural network
(10)
multimodal learning
(9)
attention mechanism
(8)
text classification
(7)
recurrent neural network
(7)
speech recognition
(7)
neural machine translation
(6)
knowledge distillation
(6)
reinforcement learning
(5)
contrastive learning
(5)
speech separation
(5)
named entity recognition
(4)
in-context learning
(4)
adversarial training
(4)
multi-task learning
(4)
sentiment analysis
(4)
multi-label classification
(4)
energy efficiency
(4)
end-to-end learning
(4)
Papers
Rose-SQL: Role-State Evolution Guided Structured Reasoning for Multi-Turn Text-to-SQL
ACL 2026
Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition
AAAI 2026
TinyChemVL: Advancing Chemical Vision-Language Models via Efficient Visual Token Reduction and Complex Reaction Tasks
AAAI 2026
KNNDA: A New Perspective of Alignment Recovery for Partially View-Aligned Clustering
AAAI 2026
MrCoM: A Meta-Regularized World-Model Generalizing Across Multi-Scenarios
AAAI 2026
MetaGPT: A Large Vision-Language Model for Meme Metaphor Understanding
AAAI 2026
Boosting Text-to-SQL through Multi-grained Error Identification
COLING 2025
HyperHatePrompt: A Hypergraph-based Prompting Fusion Model for Multimodal Hate Detection
COLING 2025
Skeletons Matter: Dynamic Data Augmentation for Text-to-Query
EMNLP 2025
CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries
ICML 2025
Consistency-Aware Padding for Incomplete Multi-Modal Alignment Clustering Based on Self-Repellent Greedy Anchor Search
IJCAI 2025
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
ICLR 2025
Episodic Novelty Through Temporal Distance
ICLR 2025
Gen-SQL: Efficient Text-to-SQL By Bridging Natural Language Question And Database Schema With Pseudo-Schema
COLING 2025
Enhancing Multimodal Named Entity Recognition through Adaptive Mixup Image Augmentation
COLING 2025
Text-Guided Fine-grained Counterfactual Inference for Short Video Fake News Detection
AAAI 2025
Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation
AAAI 2025
Efficient 3D Recognition with Event-driven Spike Sparse Convolution
AAAI 2025
Incomplete and Unpaired Multi-View Graph Clustering with Cross-View Feature Fusion
AAAI 2025
GuideNER: Annotation Guidelines Are Better than Examples for In-Context Named Entity Recognition
AAAI 2025
Leveraging Attention to Effectively Compress Prompts for Long-Context LLMs
AAAI 2025
Self-Guided Function Calling in Large Language Models via Stepwise Experience Recall
EMNLP 2025
S-EPOA: Overcoming the Indistinguishability of Segments with Skill-Driven Preference-Based Reinforcement Learning
IJCAI 2025
Dual Robust Unbiased Multi-View Clustering for Incomplete and Unpaired Information
IJCAI 2025
DUTIR831 at SemEval-2025 Task 5: A Multi-Stage LLM Approach to GND Subject Assignment for TIBKAT Records
SEMEVAL 2025
111DUT at SemEval-2025 Task 8:Hierarchical Chain-of-Thought Reasoning and Multi-Model Deliberation for Robust TableQA
SEMEVAL 2025
EchoGPT: An Interactive Cardiac Function Assessment Model for Echocardiogram Videos
IJCAI 2025
Unveiling Maternity and Infant Care Conversations: A Chinese Dialogue Dataset for Enhanced Parenting Support
IJCAI 2025
MMDEND: Dendrite-Inspired Multi-Branch Multi-Compartment Parallel Spiking Neuron for Sequence Modeling
ACL 2025
Is LLM an Overconfident Judge? Unveiling the Capabilities of LLMs in Detecting Offensive Language with Annotation Disagreement
ACL 2025
111DUT at SemEval-2025 Task 8:Hierarchical Chain-of-Thought Reasoning and Multi-Model Deliberation for Robust TableQA
ACL 2025
Coarse-to-Fine Grounded Memory for LLM Agent Planning
EMNLP 2025
DUTIR831 at SemEval-2025 Task 5: A Multi-Stage LLM Approach to GND Subject Assignment for TIBKAT Records
ACL 2025
Learnable Infinite Taylor Gaussian for Dynamic View Rendering
CVPR 2025
MRE-MI: A Multi-image Dataset for Multimodal Relation Extraction in Social Media Posts
NAACL 2025
Prototype Tuning: A Meta-Learning Approach for Few-Shot Document-Level Relation Extraction with Large Language Models
NAACL 2025
Dialect-SQL: An Adaptive Framework for Bridging the Dialect Gap in Text-to-SQL
EMNLP 2025
Conditional Semantic Textual Similarity via Conditional Contrastive Learning
COLING 2025
Temporal Knowledge Graph Reasoning with Dynamic Hypergraph Embedding
COLING 2024
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips
ICLR 2024
Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration
NIPS 2024
Privileged Prior Information Distillation for Image Matting
AAAI 2024
Video-Context Aligned Transformer for Video Question Answering
AAAI 2024
SDNet: An Extremely Efficient Portrait Matting Model via Self-Distillation
WACV 2024
SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural Network
ACL 2024
Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection
ACL 2024
DUTIR938 at SemEval-2024 Task 4: Semi-Supervised Learning and Model Ensemble for Persuasion Techniques Detection in Memes
SEMEVAL 2024
DUTIR938 at SemEval-2024 Task 4: Semi-Supervised Learning and Model Ensemble for Persuasion Techniques Detection in Memes
NAACL 2024
Adaptive Reinforcement Tuning Language Models as Hard Data Generators for Sentence Representation
COLING 2024
Beyond Linguistic Cues: Fine-grained Conversational Emotion Recognition via Belief-Desire Modelling
COLING 2024
MNER-MI: A Multi-image Dataset for Multimodal Named Entity Recognition in Social Media
COLING 2024
RENN: A Rule Embedding Enhanced Neural Network Framework for Temporal Knowledge Graph Completion
COLING 2024
Take Its Essence, Discard Its Dross! Debiasing for Toxic Language Detection via Counterfactual Causal Effect
COLING 2024
Integer-Valued Training and Spike-driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection
ECCV 2024
URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields
ECCV 2024
GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation
ECCV 2024
High-Performance Temporal Reversible Spiking Neural Networks with $\mathcalO(L)$ Training Memory and $\mathcalO(1)$ Inference Cost
ICML 2024
Towards Comprehensive Detection of Chinese Harmful Memes
NIPS 2024
Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks
EMNLP 2024
Breaking the Boundaries: A Unified Framework for Chinese Named Entity Recognition Across Text and Speech
EMNLP 2024
PclGPT: A Large Language Model for Patronizing and Condescending Language Detection
EMNLP 2024
MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map
NIPS 2024
Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmarks
ACL 2023
Just Like a Human Would, Direct Access to Sarcasm Augmented with Potential Result and Reaction
ACL 2023
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers
INTERSPEECH 2023
P-vectors: A Parallel-coupled TDNN/Transformer Network for Speaker Verification
INTERSPEECH 2023
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
INTERSPEECH 2023
A Rotation-Translation-Decoupled Solution for Robust and Efficient Visual-Inertial Initialization
CVPR 2023
Spike-driven Transformer
NIPS 2023
Video Object Matting via Hierarchical Space-Time Semantic Guidance
WACV 2023
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction
AAAI 2023
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
AAAI 2023
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay
ICLR 2023
Inherent Redundancy in Spiking Neural Networks
ICCV 2023
ODE-based Recurrent Model-free Reinforcement Learning for POMDPs
NIPS 2023
GUTS at SemEval-2022 Task 4: Adversarial Training and Balancing Methods for Patronizing and Condescending Language Detection
SEMEVAL 2022
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
INTERSPEECH 2022
Deep Two-Stream Video Inference for Human Body Pose and Shape Estimation
WACV 2022
RealMedDial: A Real Telemedical Dialogue Dataset Collected from Online Chinese Short-Video Clips
COLING 2022
Different Data, Different Modalities! Reinforced Data Splitting for Effective Multimodal Information Extraction from Social Media Posts
COLING 2022
GUTS at SemEval-2022 Task 4: Adversarial Training and Balancing Methods for Patronizing and Condescending Language Detection
NAACL 2022
Multi-Sacle Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning
AAAI 2022
Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network
NAACL 2021
Consecutive Decoding for Speech-to-text Translation
AAAI 2021
Listen, Understand and Translate: Triple Supervision Decouples End-to-end Speech-to-text Translation
AAAI 2021
Locality Preserving Sentence Encoding
EMNLP 2021
Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction
ICCV 2021
MIMO Self-Attentive RNN Beamformer for Multi-Speaker Speech Separation
INTERSPEECH 2021
Exploring wav2vec 2.0 on Speaker Verification and Language Identification
INTERSPEECH 2021
Watch to Listen Clearly: Visual Speech Enhancement Driven Multi-modality Speech Recognition
WACV 2020
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments
INTERSPEECH 2020
Speaker-Conditional Chain Model for Speech Separation and Extraction
INTERSPEECH 2020
DMRM: A Dual-Channel Multi-Hop Reasoning Model for Visual Dialog
AAAI 2020
Discriminative Multi-Modality Speech Recognition
CVPR 2020
LISNN: Improving Spiking Neural Networks with Lateral Interactions for Robust Object Recognition
IJCAI 2020
Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation
EMNLP 2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
NIPS 2020
Knowledge Aware Emotion Recognition in Textual Conversations via Multi-Task Incremental Transformer
COLING 2020
Ectc-Docd: An End-to-End Structure with CTC Encoder and OCD Decoder for Speech Recognition
INTERSPEECH 2019
A Working Memory Model for Task-oriented Dialog Response Generation
ACL 2019
The World in My Mind: Visual Dialog with Adversarial Multi-modal Feature Encoding
NAACL 2019
Adapting Translation Models for Transcript Disfluency Detection
AAAI 2019
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring
INTERSPEECH 2019
Which Ones Are Speaking? Speaker-Inferred Model for Multi-Talker Speech Separation
INTERSPEECH 2019
Cascaded Mutual Modulation for Visual Reasoning
EMNLP 2018
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
INTERSPEECH 2018
Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets
NAACL 2018
Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation
IJCAI 2018
Single-channel Speech Dereverberation via Generative Adversarial Training
INTERSPEECH 2018
Brain-inspired Balanced Tuning for Spiking Neural Networks
IJCAI 2018
Construction of a Chinese Corpus for the Analysis of the Emotionality of Metaphorical Expressions
ACL 2018
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese
INTERSPEECH 2018
Semi-Supervised Disfluency Detection
COLING 2018
Unsupervised Neural Machine Translation with Weight Sharing
ACL 2018
WECA: A WordNet-Encoded Collocation-Attention Network for Homographic Pun Recognition
EMNLP 2018
Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme
ACL 2017
Towards Compact and Fast Neural Machine Translation Using a Combined Method
EMNLP 2017
Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation
IJCNLP 2017
Multilingual Recurrent Neural Networks with Residual Learning for Low-Resource Speech Recognition
INTERSPEECH 2017
Hierarchical Memory Networks for Answer Selection on Unknown Words
COLING 2016
Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling
COLING 2016
A Character-Aware Encoder for Neural Machine Translation
COLING 2016
Learning Defining Features for Categories
IJCAI 2016
Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification
ACL 2016
End-to-End Language Identification Using Attention-Based Recurrent Neural Networks
INTERSPEECH 2016
First Step Towards End-to-End Parametric TTS Synthesis: Generating Spectral Parameters with Neural Attention
INTERSPEECH 2016
Multidimensional Residual Learning Based on Recurrent Neural Networks for Acoustic Modeling
INTERSPEECH 2016
Gating Recurrent Enhanced Memory Neural Networks on Language Identification
INTERSPEECH 2016
Convolutional Neural Networks for Text Hashing
IJCAI 2015
Dialogue Management based on Sentence Clustering
IJCNLP 2015
Semantic Clustering and Convolutional Neural Network for Short Text Categorization
IJCNLP 2015
Semi-supervised Chinese Word Segmentation based on Bilingual Information
EMNLP 2015
Dialogue Management based on Sentence Clustering
ACL 2015
Semantic Clustering and Convolutional Neural Network for Short Text Categorization
ACL 2015
Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machine Translation
ACL 2014
Phrase-based Parallel Fragments Extraction from Comparable Corpora
IJCNLP 2013
Joint and Coupled Bilingual Topic Model Based Sentence Representations for Language Model Adaptation
IJCAI 2013
Automated Essay Scoring Based on Finite State Transducer: towards ASR Transcription of Oral English Speech
ACL 2012
Translation Model Based Cross-Lingual Language Model Adaptation: from Word Models to Phrase Models
EMNLP 2012
Translation Model Based Cross-Lingual Language Model Adaptation: from Word Models to Phrase Models
CONLL 2012
Probabilistic Parsing Action Models for Multi-Lingual Dependency Parsing
EMNLP 2007
Probabilistic Parsing Action Models for Multi-Lingual Dependency Parsing
CONLL 2007
Chinese Named Entity Recognition with Multiple Features
EMNLP 2005
Product Named Entity Recognition Based on Hierarchical Hidden Markov Model
IJCNLP 2005
Chinese Syntactic Parsing Based on Extended GLR Parsing Algorithm with PCFG*
COLING 2002