Shuai Wang

123 papers · 2015–2026 · 20 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🗺️ Taxonomy Completionist (40) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (8) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (10) 🐝 Cross-Pollinator (7) 🏠 Conference Loyalist (20) 🤝 Dynamic Duo (11) 🧬 Topic Evolution 🏆 Keyword Champion (4) 🏆 Grand Slam 👥 Mega-Team (27) 🔬 Deep Specialist (17) ❓ The Questioner 🚀 Conference Pioneer ⚡ Prolific Year (19) 🔥 Unstoppable (9) 🗃️ Keyword Collector (89) 💎 Century Club (106) 📈 Trend Setter

Conferences

AAAI (21) INTERSPEECH (20) ACL (17) EMNLP (11) NIPS (6) EACL (6) MICCAI (5) CVPR (5) IJCAI (5) ICML (4) ICLR (4) ICCV (4) IJCNLP (3) NAACL (3) COLING (2) ACML (2) NSDI (2) ECCV (1) AISTATS (1) OSDI (1)

Top co-authors

Yanmin Qian (12) Kai Yu (10) Miguel Ballesteros (9) Malu Zhang (8) Wenjie Wei (7) Bing Liu (6) Meihan Tong (6) Zhengyang Chen (5) Yixin Cao (5) Dehao Zhang (5)

Research topics

Differential Privacy (1)

Keywords

large language model (14) speaker verification (9) speaker embedding (7) knowledge distillation (6) domain adaptation (6) attention mechanism (6) neural network (5) convolutional neural network (5) named entity recognition (5) model compression (5) adversarial training (4) federated learning (4) spiking neural network (4) few-shot learning (4) multi-task learning (4) sentiment classification (4) visual question answering (3) semantic segmentation (3) zero-shot learning (3) catastrophic forgetting (3)

Papers

NeuPAN: Direct Point Robot Navigation with End-to-End Model-Based Learning (Abstract Reprint) AAAI 2026 Training-Free ANN-to-SNN Conversion for High-Performance Spiking Transformers AAAI 2026 Towards Training-Free and Accurate ANN-to-SNN Conversion via Activation-Aware Redistribution AAAI 2026 When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors ACL 2026 DigimonGPT: An Evolvable Agent with Hierarchical Human-like Memory for Video Question Answering AAAI 2026 Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment AAAI 2026 Diffusion Language Model Inference with Monte Carlo Tree Search EACL 2026 LMGL-WD: LLM-Guided Multi-Task Graph Learning for Category-Level Warehouse Demand Prediction in E-Commerce AAAI 2026 Scaling Law Analysis in Federated Learning: How to Select the Optimal Model Size? AAAI 2026 Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary AAAI 2026 USE: A Unified Model for Universal Sound Separation and Extraction AAAI 2026 ORTCL: Towards Continual Learning of Time Series Foundation Models on Streaming Data via Orthogonal Rotation AAAI 2026 AHAMask: Reliable Task Specification for Large Audio Language Models Without Instructions AAAI 2026 SQL-Trail: Multi-Turn Reinforcement Learning with Interleaved Feedback for Text-to-SQL ACL 2026 PromptPrism: A Linguistically-Inspired Taxonomy for Prompts EACL 2026 AutoBool: Reinforcement-Learned LLM for Effective Automatic Systematic Reviews Boolean Query Generation EACL 2026 JARVIS or Ultron? A Survey on the Safety and Security Threats of Computer-Using Agents ACL 2026 SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor AAAI 2025 VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis AAAI 2025 LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs – No Silver Bullet for LC or RAG Routing ICML 2025 BSO: Binary Spiking Online Optimization Algorithm ICML 2025 Differentiable Solver Search for Fast Diffusion Sampling ICML 2025 Drop the Beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation AAAI 2025 ToolACE: Winning the Points of LLM Function Calling ICLR 2025 CD-PolypNet: Cross-Domain Polyp Segmentation Network with Internal Feature Distillation and Dual-Stream Boundary Focus via Large Vision Model MICCAI 2025 Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models NAACL 2025 Aligning to Constraints for Data-Efficient Language Model Customization NAACL 2025 iQUEST: An Iterative Question-Guided Framework for Knowledge Base Question Answering ACL 2025 Can’t See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs ACL 2025 MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization ACL 2025 SocialEval: Evaluating Social Intelligence of Large Language Models ACL 2025 NovelCR: A Large-Scale Bilingual Dataset Tailored for Long-Span Coreference Resolution ACL 2025 Chain-of-Jailbreak Attack for Image Generation Models via Step by Step Editing ACL 2025 Rethinking Spiking Self-Attention Mechanism: Implementing a-XNOR Similarity Calculation in Spiking Transformers CVPR 2025 Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism AAAI 2025 Region-Based Text-Consistent Augmentation for Multimodal Medical Segmentation MICCAI 2025 A Systematic Survey of Automatic Prompt Optimization Techniques EMNLP 2025 Plugging Schema Graph into Multi-Table QA: A Human-Guided Framework for Reducing LLM Reliance EMNLP 2025 Less is More: Empowering GUI Agent with Context-Aware Simplification ICCV 2025 Spiking Vision Transformer with Saccadic Attention ICLR 2025 SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION ICLR 2025 Tackling Data Heterogeneity in Federated Learning via Loss Decomposition MICCAI 2024 A Weak Supervision Approach for Few-Shot Aspect Based Sentiment Analysis EACL 2024 Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models INTERSPEECH 2024 WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark INTERSPEECH 2024 OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models ACL 2024 WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction INTERSPEECH 2024 DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion INTERSPEECH 2024 UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding AAAI 2024 Generalized Robust Fundus Photography-based Vision Loss Estimation for High Myopia MICCAI 2024 Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting INTERSPEECH 2024 Split and Merge: Aligning Position Biases in LLM-based Evaluators EMNLP 2024 BERGEN: A Benchmarking Library for Retrieval-Augmented Generation EMNLP 2024 Joint Input and Output Coordination for Class-Incremental Learning IJCAI 2024 Exploring DCN-like architecture for fast image generation with arbitrary resolution NIPS 2024 Benchmarking the Simplification of Dutch Municipal Text COLING 2024 Spike-based Neuromorphic Model for Sound Source Localization NIPS 2024 On the Effectiveness of Acoustic BPE in Decoder-Only TTS INTERSPEECH 2024 MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer MICCAI 2024 Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing CVPR 2024 ESP-PCT: Enhanced VR Semantic Performance through Efficient Compression of Temporal and Spatial Redundancies in Point Cloud Transformers IJCAI 2024 NN-Defined Modulator: Reconfigurable and Portable Software Modulator on IoT Gateways NSDI 2024 Taxonomy Expansion for Named Entity Recognition EMNLP 2023 Explain Any Concept: Segment Anything Meets Concept-Based Explanation NIPS 2023 Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents ACL 2023 Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis ACL 2023 Detecting and Repairing Deviated Outputs of Compressed Models ACML 2023 Byzantine-Robust Federated Learning with Optimal Statistical Rates AISTATS 2023 Feature Alignment and Uniformity for Test Time Adaptation CVPR 2023 Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views EACL 2023 Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization EACL 2023 DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning NIPS 2023 InsightPilot: An LLM-Empowered Automated Data Exploration System EMNLP 2023 Deep Equilibrium Object Detection ICCV 2023 Towards Open-Vocabulary Video Instance Segmentation ICCV 2023 Secure Federated Correlation Test and Entropy Estimation ICML 2023 Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning AAAI 2023 Teaching What You Should Teach: A Data-Based Distillation Method IJCAI 2023 Beyond ADMM: A Unified Client-Variance-Reduced Adaptive Federated Learning Framework AAAI 2023 DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding INTERSPEECH 2023 Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor INTERSPEECH 2023 Buffer-based End-to-end Request Event Monitoring in the Cloud NSDI 2022 DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction NAACL 2022 DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design INTERSPEECH 2022 SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles NIPS 2022 Rethinking Video Rain Streak Removal: A New Synthesis Model and a Deraining Network with Video Rain Prior ECCV 2022 Context-aware Multimodal Fusion for Emotion Recognition INTERSPEECH 2022 Multi-Task Learning and Adapted Knowledge Models for Emotion-Cause Extraction ACL 2021 Detecting Domain Polarity-Changes of Words in a Sentiment Lexicon ACL 2021 Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recognition ACL 2021 Multi-Task Learning and Adapted Knowledge Models for Emotion-Cause Extraction IJCNLP 2021 Detecting Domain Polarity-Changes of Words in a Sentiment Lexicon IJCNLP 2021 Private Image Reconstruction from System Side Channels Using Generative Models ICLR 2021 Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recognition IJCNLP 2021 SANRAZOR: Reducing Redundant Sanitizer Checks in C/C++ Programs OSDI 2021 A General Recurrent Tracking Framework Without Real Data ICCV 2021 Sequential Cross-Document Coreference Resolution EMNLP 2021 Multi-Domain Multi-Task Rehearsal for Lifelong Learning AAAI 2021 Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic Testing CVPR 2021 Resource-Enhanced Neural Model for Event Argument Extraction EMNLP 2020 CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines AAAI 2020 Image Enhanced Event Detection in News Articles AAAI 2020 Improving Event Detection via Open-domain Trigger Knowledge ACL 2020 Bayes-enhanced Lifelong Attention Networks for Sentiment Classification COLING 2020 Intelligent Home 3D: Automatic 3D-House Design From Linguistic Descriptions Only CVPR 2020 Severing the Edge Between Before and After: Neural Architectures for Temporal Ordering of Events EMNLP 2020 A Knowledge-Driven Approach to Classifying Object and Attribute Coreferences in Opinion Mining EMNLP 2020 Automatic recognition of abdominal lymph nodes from clinical text EMNLP 2020 Metamorphic Testing and Certified Mitigation of Fairness Violations in NLP Models IJCAI 2020 Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection INTERSPEECH 2020 Multi-Modality Matters: A Performance Leap on VoxCeleb INTERSPEECH 2020 Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network INTERSPEECH 2020 Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training INTERSPEECH 2019 Forward and Backward Knowledge Transfer for Sentiment Classification ACML 2019 On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction INTERSPEECH 2019 The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge INTERSPEECH 2019 Bayesian HMM Based x-Vector Clustering for Speaker Diarization INTERSPEECH 2019 Data Augmentation Using Variational Autoencoder for Embedding Based Speaker Verification INTERSPEECH 2019 Target-Sensitive Memory Networks for Aspect Sentiment Classification ACL 2018 Angular Softmax for Short-Duration Text-independent Speaker Verification INTERSPEECH 2018 BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training NIPS 2018 What Does the Speaker Embedding Encode? INTERSPEECH 2017 A Unified Probabilistic Model of User Activities and Relations on Social Networking Sites IJCAI 2015