Shuai Wang
123 papers · 2015–2026 · 20 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (40) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (8) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π
Academic Marathon
(10)
π
Cross-Pollinator
(7)
π
Conference Loyalist
(20)
π€
Dynamic Duo
(11)
π§¬
Topic Evolution
π
Keyword Champion
(4)
π
Grand Slam
π₯
Mega-Team
(27)
π¬
Deep Specialist
(17)
β
The Questioner
π
Conference Pioneer
β‘
Prolific Year
(19)
π₯
Unstoppable
(9)
ποΈ
Keyword Collector
(89)
π
Century Club
(106)
π
Trend Setter
Conferences
AAAI (21)
INTERSPEECH (20)
ACL (17)
EMNLP (11)
NIPS (6)
EACL (6)
MICCAI (5)
CVPR (5)
IJCAI (5)
ICML (4)
ICLR (4)
ICCV (4)
IJCNLP (3)
NAACL (3)
COLING (2)
ACML (2)
NSDI (2)
ECCV (1)
AISTATS (1)
OSDI (1)
Top co-authors
Research topics
Keywords
large language model
(14)
speaker verification
(9)
speaker embedding
(7)
knowledge distillation
(6)
domain adaptation
(6)
attention mechanism
(6)
neural network
(5)
convolutional neural network
(5)
named entity recognition
(5)
model compression
(5)
adversarial training
(4)
federated learning
(4)
spiking neural network
(4)
few-shot learning
(4)
multi-task learning
(4)
sentiment classification
(4)
visual question answering
(3)
semantic segmentation
(3)
zero-shot learning
(3)
catastrophic forgetting
(3)
Papers
NeuPAN: Direct Point Robot Navigation with End-to-End Model-Based Learning (Abstract Reprint)
AAAI 2026
Training-Free ANN-to-SNN Conversion for High-Performance Spiking Transformers
AAAI 2026
Towards Training-Free and Accurate ANN-to-SNN Conversion via Activation-Aware Redistribution
AAAI 2026
When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors
ACL 2026
DigimonGPT: An Evolvable Agent with Hierarchical Human-like Memory for Video Question Answering
AAAI 2026
Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment
AAAI 2026
Diffusion Language Model Inference with Monte Carlo Tree Search
EACL 2026
LMGL-WD: LLM-Guided Multi-Task Graph Learning for Category-Level Warehouse Demand Prediction in E-Commerce
AAAI 2026
Scaling Law Analysis in Federated Learning: How to Select the Optimal Model Size?
AAAI 2026
Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
AAAI 2026
USE: A Unified Model for Universal Sound Separation and Extraction
AAAI 2026
ORTCL: Towards Continual Learning of Time Series Foundation Models on Streaming Data via Orthogonal Rotation
AAAI 2026
AHAMask: Reliable Task Specification for Large Audio Language Models Without Instructions
AAAI 2026
SQL-Trail: Multi-Turn Reinforcement Learning with Interleaved Feedback for Text-to-SQL
ACL 2026
PromptPrism: A Linguistically-Inspired Taxonomy for Prompts
EACL 2026
AutoBool: Reinforcement-Learned LLM for Effective Automatic Systematic Reviews Boolean Query Generation
EACL 2026
JARVIS or Ultron? A Survey on the Safety and Security Threats of Computer-Using Agents
ACL 2026
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor
AAAI 2025
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
AAAI 2025
LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs β No Silver Bullet for LC or RAG Routing
ICML 2025
BSO: Binary Spiking Online Optimization Algorithm
ICML 2025
Differentiable Solver Search for Fast Diffusion Sampling
ICML 2025
Drop the Beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation
AAAI 2025
ToolACE: Winning the Points of LLM Function Calling
ICLR 2025
CD-PolypNet: Cross-Domain Polyp Segmentation Network with Internal Feature Distillation and Dual-Stream Boundary Focus via Large Vision Model
MICCAI 2025
Black-Box Visual Prompt Engineering for Mitigating Object Hallucination in Large Vision Language Models
NAACL 2025
Aligning to Constraints for Data-Efficient Language Model Customization
NAACL 2025
iQUEST: An Iterative Question-Guided Framework for Knowledge Base Question Answering
ACL 2025
Canβt See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs
ACL 2025
MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization
ACL 2025
SocialEval: Evaluating Social Intelligence of Large Language Models
ACL 2025
NovelCR: A Large-Scale Bilingual Dataset Tailored for Long-Span Coreference Resolution
ACL 2025
Chain-of-Jailbreak Attack for Image Generation Models via Step by Step Editing
ACL 2025
Rethinking Spiking Self-Attention Mechanism: Implementing a-XNOR Similarity Calculation in Spiking Transformers
CVPR 2025
Towards Accurate Binary Spiking Neural Networks: Learning with Adaptive Gradient Modulation Mechanism
AAAI 2025
Region-Based Text-Consistent Augmentation for Multimodal Medical Segmentation
MICCAI 2025
A Systematic Survey of Automatic Prompt Optimization Techniques
EMNLP 2025
Plugging Schema Graph into Multi-Table QA: A Human-Guided Framework for Reducing LLM Reliance
EMNLP 2025
Less is More: Empowering GUI Agent with Context-Aware Simplification
ICCV 2025
Spiking Vision Transformer with Saccadic Attention
ICLR 2025
SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION
ICLR 2025
Tackling Data Heterogeneity in Federated Learning via Loss Decomposition
MICCAI 2024
A Weak Supervision Approach for Few-Shot Aspect Based Sentiment Analysis
EACL 2024
Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models
INTERSPEECH 2024
WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark
INTERSPEECH 2024
OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models
ACL 2024
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
INTERSPEECH 2024
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion
INTERSPEECH 2024
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
AAAI 2024
Generalized Robust Fundus Photography-based Vision Loss Estimation for High Myopia
MICCAI 2024
Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting
INTERSPEECH 2024
Split and Merge: Aligning Position Biases in LLM-based Evaluators
EMNLP 2024
BERGEN: A Benchmarking Library for Retrieval-Augmented Generation
EMNLP 2024
Joint Input and Output Coordination for Class-Incremental Learning
IJCAI 2024
Exploring DCN-like architecture for fast image generation with arbitrary resolution
NIPS 2024
Benchmarking the Simplification of Dutch Municipal Text
COLING 2024
Spike-based Neuromorphic Model for Sound Source Localization
NIPS 2024
On the Effectiveness of Acoustic BPE in Decoder-Only TTS
INTERSPEECH 2024
MMFusion: Multi-modality Diffusion Model for Lymph Node Metastasis Diagnosis in Esophageal Cancer
MICCAI 2024
Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing
CVPR 2024
ESP-PCT: Enhanced VR Semantic Performance through Efficient Compression of Temporal and Spatial Redundancies in Point Cloud Transformers
IJCAI 2024
NN-Defined Modulator: Reconfigurable and Portable Software Modulator on IoT Gateways
NSDI 2024
Taxonomy Expansion for Named Entity Recognition
EMNLP 2023
Explain Any Concept: Segment Anything Meets Concept-Based Explanation
NIPS 2023
Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents
ACL 2023
Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis
ACL 2023
Detecting and Repairing Deviated Outputs of Compressed Models
ACML 2023
Byzantine-Robust Federated Learning with Optimal Statistical Rates
AISTATS 2023
Feature Alignment and Uniformity for Test Time Adaptation
CVPR 2023
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views
EACL 2023
Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization
EACL 2023
DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning
NIPS 2023
InsightPilot: An LLM-Empowered Automated Data Exploration System
EMNLP 2023
Deep Equilibrium Object Detection
ICCV 2023
Towards Open-Vocabulary Video Instance Segmentation
ICCV 2023
Secure Federated Correlation Test and Entropy Estimation
ICML 2023
Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning
AAAI 2023
Teaching What You Should Teach: A Data-Based Distillation Method
IJCAI 2023
Beyond ADMM: A Unified Client-Variance-Reduced Adaptive Federated Learning Framework
AAAI 2023
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
INTERSPEECH 2023
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor
INTERSPEECH 2023
Buffer-based End-to-end Request Event Monitoring in the Cloud
NSDI 2022
DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction
NAACL 2022
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design
INTERSPEECH 2022
SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles
NIPS 2022
Rethinking Video Rain Streak Removal: A New Synthesis Model and a Deraining Network with Video Rain Prior
ECCV 2022
Context-aware Multimodal Fusion for Emotion Recognition
INTERSPEECH 2022
Multi-Task Learning and Adapted Knowledge Models for Emotion-Cause Extraction
ACL 2021
Detecting Domain Polarity-Changes of Words in a Sentiment Lexicon
ACL 2021
Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recognition
ACL 2021
Multi-Task Learning and Adapted Knowledge Models for Emotion-Cause Extraction
IJCNLP 2021
Detecting Domain Polarity-Changes of Words in a Sentiment Lexicon
IJCNLP 2021
Private Image Reconstruction from System Side Channels Using Generative Models
ICLR 2021
Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recognition
IJCNLP 2021
SANRAZOR: Reducing Redundant Sanitizer Checks in C/C++ Programs
OSDI 2021
A General Recurrent Tracking Framework Without Real Data
ICCV 2021
Sequential Cross-Document Coreference Resolution
EMNLP 2021
Multi-Domain Multi-Task Rehearsal for Lifelong Learning
AAAI 2021
Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic Testing
CVPR 2021
Resource-Enhanced Neural Model for Event Argument Extraction
EMNLP 2020
CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines
AAAI 2020
Image Enhanced Event Detection in News Articles
AAAI 2020
Improving Event Detection via Open-domain Trigger Knowledge
ACL 2020
Bayes-enhanced Lifelong Attention Networks for Sentiment Classification
COLING 2020
Intelligent Home 3D: Automatic 3D-House Design From Linguistic Descriptions Only
CVPR 2020
Severing the Edge Between Before and After: Neural Architectures for Temporal Ordering of Events
EMNLP 2020
A Knowledge-Driven Approach to Classifying Object and Attribute Coreferences in Opinion Mining
EMNLP 2020
Automatic recognition of abdominal lymph nodes from clinical text
EMNLP 2020
Metamorphic Testing and Certified Mitigation of Fairness Violations in NLP Models
IJCAI 2020
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection
INTERSPEECH 2020
Multi-Modality Matters: A Performance Leap on VoxCeleb
INTERSPEECH 2020
Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network
INTERSPEECH 2020
Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training
INTERSPEECH 2019
Forward and Backward Knowledge Transfer for Sentiment Classification
ACML 2019
On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction
INTERSPEECH 2019
The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge
INTERSPEECH 2019
Bayesian HMM Based x-Vector Clustering for Speaker Diarization
INTERSPEECH 2019
Data Augmentation Using Variational Autoencoder for Embedding Based Speaker Verification
INTERSPEECH 2019
Target-Sensitive Memory Networks for Aspect Sentiment Classification
ACL 2018
Angular Softmax for Short-Duration Text-independent Speaker Verification
INTERSPEECH 2018
BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training
NIPS 2018
What Does the Speaker Embedding Encode?
INTERSPEECH 2017
A Unified Probabilistic Model of User Activities and Relations on Social Networking Sites
IJCAI 2015