SHUAI ZHANG
97 papers · 2018–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (17) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(17)
π€
Dynamic Duo
(11)
π
Triple Crown
π
Keyword Champion
(2)
π
Grand Slam
π¬
Deep Specialist
(14)
π§¬
Topic Evolution
β‘
Prolific Year
(13)
π₯
Unstoppable
(8)
β
The Questioner
(3)
ποΈ
Keyword Collector
(347)
π
Century Club
(88)
π
Trend Setter
π
Conference Pioneer
Conferences
AAAI (17)
ACL (17)
NIPS (12)
ICLR (9)
ICML (8)
IJCAI (7)
INTERSPEECH (7)
EMNLP (4)
NAACL (4)
CVPR (2)
ICCV (2)
IJCNLP (2)
MICCAI (2)
EACL (1)
CONLL (1)
AISTATS (1)
WACV (1)
Top co-authors
Keywords
large language model
(9)
attention mechanism
(6)
model compression
(6)
reinforcement learning
(5)
contrastive learning
(5)
federated learning
(4)
sample complexity
(4)
named entity recognition
(4)
automatic speech recognition
(4)
sentiment analysis
(4)
deep learning
(3)
knowledge distillation
(3)
representation learning
(3)
variational autoencoder
(3)
transformer architecture
(3)
machine translation
(3)
data augmentation
(3)
few-shot learning
(3)
natural language processing
(3)
speech recognition
(3)
Papers
Beyond Examples: Towards Automated Thought-level In-Context Reasoning for Large Language Models
ACL 2026
LGSA: Label Geometry Structuring and Aligning for Hierarchical Text Classification
ACL 2026
From Imitation to Discrimination: Toward a Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks
AAAI 2026
AStar: Boosting Multimodal Reasoning with Automated Structured Thinking
AAAI 2026
Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
AAAI 2026
Efficient Table Retrieval and Understanding with Multimodal Large Language Models
EACL 2026
SPARK: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning
ACL 2026
ReFL: Reflective Feedback Learning for Hallucination Detection of Large Language Models
ACL 2026
Two-Stage Regularization-Based Structured Pruning for LLMs
ACL 2026
Iterative Substructure Extraction for Molecular Relational Learning with Interactive Graph Information Bottleneck
ICLR 2025
Prompt Tuning In a Compact Attribute Space
AAAI 2025
SΒ²MILE: Semantic-and-Structure-Aware Music-Driven Lyric Generation
AAAI 2025
MalDetectFormer: Leveraging Sparse SpatioTemporal Information for Effective Malicious Traffic Detection
AAAI 2025
Code-switching Mediated Sentence-level Semantic Learning
AAAI 2025
AoI-MDP: An AoI Optimized Markov Decision Process Dedicated in the Underwater Task (Student Abstract)
AAAI 2025
ERFSL: An Efficient Reward Function Searcher via Large Language Models for Custom-Environment Multi-Objective Reinforcement Learning (Student Abstract)
AAAI 2025
UACOF: A USV-AUV Collaboration Framework for Underwater Tasks Under Extreme Sea Conditions (Student Abstract)
AAAI 2025
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
ICLR 2025
RadialRouter: Structured Representation for Efficient and Robust Large Language Models Routing
EMNLP 2025
PALMBENCH: A COMPREHENSIVE BENCHMARK OF COMPRESSED LARGE LANGUAGE MODELS ON MOBILE PLATFORMS
ICLR 2025
Unlearning through Knowledge Overwriting: Reversible Federated Unlearning via Selective Sparse Adapter
CVPR 2025
Adapting to Online Distribution Shifts in Deep Learning: A Black-Box Approach
AISTATS 2025
Multi-level Relevance Document Identifier Learning for Generative Retrieval
ACL 2025
Pandoraβs Box or Aladdinβs Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models
ACL 2025
RetrieverGuard: Empowering Information Retrieval to Combat LLM-Generated Misinformation
NAACL 2025
3D Acetabular Surface Reconstruction from 2D Pre-operative X-ray Images using SRVF Elastic Registration and Deformation Graph
MICCAI 2025
Sharpness-aware Zeroth-order Optimization for Graph Transformers
IJCAI 2025
Conformal Anomaly Detection in Event Sequences
ICML 2025
Bilateral Masking with prompt for Knowledge Graph Completion
NAACL 2024
Unraveling the Gradient Descent Dynamics of Transformers
NIPS 2024
MobileInst: Video Instance Segmentation on the Mobile
AAAI 2024
Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators
AAAI 2024
How to Trade Off the Quantity and Capacity of Teacher Ensemble: Learning Categorical Distribution to Stochastically Employ a Teacher for Distillation
AAAI 2024
CaMML: Context-Aware Multimodal Learner for Large Models
ACL 2024
MolTC: Towards Molecular Relational Modeling In Language Models
ACL 2024
Bridging Remote Sensors with Multisensor Geospatial Foundation Models
CVPR 2024
SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health Support
EMNLP 2024
Understanding the Therapeutic Relationship between Counselors and Clients in Online Text-based Counseling using LLMs
EMNLP 2024
Discovering Bias in Latent Space: An Unsupervised Debiasing Approach
ICML 2024
FedSC: Provable Federated Self-supervised Learning with Spectral Contrastive Objective over Non-i.i.d. Data
ICML 2024
Transferring Knowledge From Large Foundation Models to Small Downstream Models
ICML 2024
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
ICML 2024
Neural Jump-Diffusion Temporal Point Processes
ICML 2024
MMGNN: A Molecular Merged Graph Neural Network for Explainable Solvation Free Energy Prediction
IJCAI 2024
Gaussian Pancakes: Geometrically-Regularized 3D Gaussian Splatting for Realistic Endoscopic Reconstruction
MICCAI 2024
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving
NAACL 2024
Data Augmentation for Object Detection via Controllable Diffusion Models
WACV 2024
Understanding Client Reactions in Online Mental Health Counseling
ACL 2023
Rethinking Document-Level Relation Extraction: A Reality Check
ACL 2023
Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
INTERSPEECH 2023
MAS: Towards Resource-Efficient Federated Multiple-Task Learning
ICCV 2023
Offline Imitation Learning with Variational Counterfactual Reasoning
NIPS 2023
TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection
INTERSPEECH 2023
SKDBERT: Compressing BERT via Stochastic Knowledge Distillation
AAAI 2023
Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks
ICML 2023
Data-Informed Geometric Space Selection
NIPS 2023
On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $\epsilon$-Greedy Exploration
NIPS 2023
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
NIPS 2023
Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning
NIPS 2023
Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks
ICLR 2023
Divergence-aware Federated Self-Supervised Learning
ICLR 2022
ClusterFormer: Neural Clustering Attention for Efficient and Effective Transformer
ACL 2022
Syntax-guided Contrastive Learning for Pre-trained Language Model
ACL 2022
Neural Methods for Logical Reasoning over Knowledge Graphs
ICLR 2022
Jump Self-attention: Capturing High-order Statistics in Transformers
NIPS 2022
AutoST: Towards the Universal Modeling of Spatio-temporal Sequences
NIPS 2022
reducing multilingual context confusion for end-to-end code-switching automatic speech recognition
INTERSPEECH 2022
How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis
ICLR 2022
De-Bias for Generative Extraction in Unified NER Task
ACL 2022
A Fine-grained Interpretability Evaluation Benchmark for Neural NLP
EMNLP 2022
A Fine-grained Interpretability Evaluation Benchmark for Neural NLP
CONLL 2022
Collaborative Unsupervised Visual Representation Learning From Decentralized Data
ICCV 2021
Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters
ICLR 2021
Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition
ACL 2021
Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting
AAAI 2021
A Sequence-to-Set Network for Nested Named Entity Recognition
IJCAI 2021
Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition
IJCNLP 2021
On Orthogonality Constraints for Transformers
IJCNLP 2021
End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-Switching Speech Recognition
INTERSPEECH 2021
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization
INTERSPEECH 2021
Self-Instantiated Recurrent Units with Dynamic Soft Recursion
NIPS 2021
Knowledge Router: Learning Disentangled Representations for Knowledge Graphs
NAACL 2021
Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks
NIPS 2021
On Orthogonality Constraints for Transformers
ACL 2021
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
INTERSPEECH 2020
Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case
ICML 2020
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
INTERSPEECH 2020
TRP: Trained Rank Pruning for Efficient Deep Neural Networks
IJCAI 2020
Symmetric Metric Learning with Adaptive Margin for Recommendation
AAAI 2020
Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks
ACL 2019
Holographic Factorization Machines for Recommendation
AAAI 2019
Understanding Straight-Through Estimator in Training Activation Quantized Neural Nets
ICLR 2019
Quaternion Knowledge Graph Embeddings
NIPS 2019
DeepRec: An Open-source Toolkit for Deep Learning based Recommendation
IJCAI 2019
Quaternion Collaborative Filtering for Recommendation
IJCAI 2019
A Tensorized Transformer for Language Modeling
NIPS 2019
NeuRec: On Nonlinear Transformation for Personalized Ranking
IJCAI 2018