SHUAI ZHANG

97 papers · 2018–2026 · 17 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (17) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🐣 Hot Topic Early Bird

🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (17) 🤝 Dynamic Duo (11) 👑 Triple Crown 🏆 Keyword Champion (2) 🏆 Grand Slam 🔬 Deep Specialist (14) 🧬 Topic Evolution ⚡ Prolific Year (13) 🔥 Unstoppable (8) ❓ The Questioner (3) 🗃️ Keyword Collector (347) 💎 Century Club (88) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

AAAI (17) ACL (17) NIPS (12) ICLR (9) ICML (8) IJCAI (7) INTERSPEECH (7) EMNLP (4) NAACL (4) CVPR (2) ICCV (2) IJCNLP (2) MICCAI (2) EACL (1) CONLL (1) AISTATS (1) WACV (1)

Top co-authors

Jianhua Tao (16) Zhengqi Wen (11) Yi Tay (10) Boran Han (9) Meng Wang (8) Jiangyan Yi (8) Jinyang Wu (8) Pin-Yu Chen (8) Sijia Liu (7) Aston Zhang (7)

Keywords

large language model (9) attention mechanism (6) model compression (6) reinforcement learning (5) contrastive learning (5) federated learning (4) sample complexity (4) named entity recognition (4) automatic speech recognition (4) sentiment analysis (4) deep learning (3) knowledge distillation (3) representation learning (3) variational autoencoder (3) transformer architecture (3) machine translation (3) data augmentation (3) few-shot learning (3) natural language processing (3) speech recognition (3)

Papers

Beyond Examples: Towards Automated Thought-level In-Context Reasoning for Large Language Models ACL 2026 LGSA: Label Geometry Structuring and Aligning for Hierarchical Text Classification ACL 2026 From Imitation to Discrimination: Toward a Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks AAAI 2026 AStar: Boosting Multimodal Reasoning with Automated Structured Thinking AAAI 2026 Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices AAAI 2026 Efficient Table Retrieval and Understanding with Multimodal Large Language Models EACL 2026 SPARK: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning ACL 2026 ReFL: Reflective Feedback Learning for Hallucination Detection of Large Language Models ACL 2026 Two-Stage Regularization-Based Structured Pruning for LLMs ACL 2026 Iterative Substructure Extraction for Molecular Relational Learning with Interactive Graph Information Bottleneck ICLR 2025 Prompt Tuning In a Compact Attribute Space AAAI 2025 S²MILE: Semantic-and-Structure-Aware Music-Driven Lyric Generation AAAI 2025 MalDetectFormer: Leveraging Sparse SpatioTemporal Information for Effective Malicious Traffic Detection AAAI 2025 Code-switching Mediated Sentence-level Semantic Learning AAAI 2025 AoI-MDP: An AoI Optimized Markov Decision Process Dedicated in the Underwater Task (Student Abstract) AAAI 2025 ERFSL: An Efficient Reward Function Searcher via Large Language Models for Custom-Environment Multi-Objective Reinforcement Learning (Student Abstract) AAAI 2025 UACOF: A USV-AUV Collaboration Framework for Underwater Tasks Under Extreme Sea Conditions (Student Abstract) AAAI 2025 When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers ICLR 2025 RadialRouter: Structured Representation for Efficient and Robust Large Language Models Routing EMNLP 2025 PALMBENCH: A COMPREHENSIVE BENCHMARK OF COMPRESSED LARGE LANGUAGE MODELS ON MOBILE PLATFORMS ICLR 2025 Unlearning through Knowledge Overwriting: Reversible Federated Unlearning via Selective Sparse Adapter CVPR 2025 Adapting to Online Distribution Shifts in Deep Learning: A Black-Box Approach AISTATS 2025 Multi-level Relevance Document Identifier Learning for Generative Retrieval ACL 2025 Pandora’s Box or Aladdin’s Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models ACL 2025 RetrieverGuard: Empowering Information Retrieval to Combat LLM-Generated Misinformation NAACL 2025 3D Acetabular Surface Reconstruction from 2D Pre-operative X-ray Images using SRVF Elastic Registration and Deformation Graph MICCAI 2025 Sharpness-aware Zeroth-order Optimization for Graph Transformers IJCAI 2025 Conformal Anomaly Detection in Event Sequences ICML 2025 Bilateral Masking with prompt for Knowledge Graph Completion NAACL 2024 Unraveling the Gradient Descent Dynamics of Transformers NIPS 2024 MobileInst: Video Instance Segmentation on the Mobile AAAI 2024 Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators AAAI 2024 How to Trade Off the Quantity and Capacity of Teacher Ensemble: Learning Categorical Distribution to Stochastically Employ a Teacher for Distillation AAAI 2024 CaMML: Context-Aware Multimodal Learner for Large Models ACL 2024 MolTC: Towards Molecular Relational Modeling In Language Models ACL 2024 Bridging Remote Sensors with Multisensor Geospatial Foundation Models CVPR 2024 SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health Support EMNLP 2024 Understanding the Therapeutic Relationship between Counselors and Clients in Online Text-based Counseling using LLMs EMNLP 2024 Discovering Bias in Latent Space: An Unsupervised Debiasing Approach ICML 2024 FedSC: Provable Federated Self-supervised Learning with Spectral Contrastive Objective over Non-i.i.d. Data ICML 2024 Transferring Knowledge From Large Foundation Models to Small Downstream Models ICML 2024 SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning ICML 2024 Neural Jump-Diffusion Temporal Point Processes ICML 2024 MMGNN: A Molecular Merged Graph Neural Network for Explainable Solvation Free Energy Prediction IJCAI 2024 Gaussian Pancakes: Geometrically-Regularized 3D Gaussian Splatting for Realistic Endoscopic Reconstruction MICCAI 2024 CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving NAACL 2024 Data Augmentation for Object Detection via Controllable Diffusion Models WACV 2024 Understanding Client Reactions in Online Mental Health Counseling ACL 2023 Rethinking Document-Level Relation Extraction: A Reality Check ACL 2023 Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features INTERSPEECH 2023 MAS: Towards Resource-Efficient Federated Multiple-Task Learning ICCV 2023 Offline Imitation Learning with Variational Counterfactual Reasoning NIPS 2023 TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection INTERSPEECH 2023 SKDBERT: Compressing BERT via Stochastic Knowledge Distillation AAAI 2023 Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks ICML 2023 Data-Informed Geometric Space Selection NIPS 2023 On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $\epsilon$-Greedy Exploration NIPS 2023 Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition NIPS 2023 Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning NIPS 2023 Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks ICLR 2023 Divergence-aware Federated Self-Supervised Learning ICLR 2022 ClusterFormer: Neural Clustering Attention for Efficient and Effective Transformer ACL 2022 Syntax-guided Contrastive Learning for Pre-trained Language Model ACL 2022 Neural Methods for Logical Reasoning over Knowledge Graphs ICLR 2022 Jump Self-attention: Capturing High-order Statistics in Transformers NIPS 2022 AutoST: Towards the Universal Modeling of Spatio-temporal Sequences NIPS 2022 reducing multilingual context confusion for end-to-end code-switching automatic speech recognition INTERSPEECH 2022 How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis ICLR 2022 De-Bias for Generative Extraction in Unified NER Task ACL 2022 A Fine-grained Interpretability Evaluation Benchmark for Neural NLP EMNLP 2022 A Fine-grained Interpretability Evaluation Benchmark for Neural NLP CONLL 2022 Collaborative Unsupervised Visual Representation Learning From Decentralized Data ICCV 2021 Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters ICLR 2021 Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition ACL 2021 Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting AAAI 2021 A Sequence-to-Set Network for Nested Named Entity Recognition IJCAI 2021 Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition IJCNLP 2021 On Orthogonality Constraints for Transformers IJCNLP 2021 End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-Switching Speech Recognition INTERSPEECH 2021 FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization INTERSPEECH 2021 Self-Instantiated Recurrent Units with Dynamic Soft Recursion NIPS 2021 Knowledge Router: Learning Disentangled Representations for Knowledge Graphs NAACL 2021 Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks NIPS 2021 On Orthogonality Constraints for Transformers ACL 2021 Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition INTERSPEECH 2020 Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case ICML 2020 Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition INTERSPEECH 2020 TRP: Trained Rank Pruning for Efficient Deep Neural Networks IJCAI 2020 Symmetric Metric Learning with Adaptive Margin for Recommendation AAAI 2020 Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks ACL 2019 Holographic Factorization Machines for Recommendation AAAI 2019 Understanding Straight-Through Estimator in Training Activation Quantized Neural Nets ICLR 2019 Quaternion Knowledge Graph Embeddings NIPS 2019 DeepRec: An Open-source Toolkit for Deep Learning based Recommendation IJCAI 2019 Quaternion Collaborative Filtering for Recommendation IJCAI 2019 A Tensorized Transformer for Language Modeling NIPS 2019 NeuRec: On Nonlinear Transformation for Personalized Ranking IJCAI 2018