Shujian Zhang

25 papers · 2020–2026 · 8 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (8) 🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (7)

🗺️ Taxonomy Completionist (41) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (5) 🤝 Dynamic Duo (13) 👑 Triple Crown 🧬 Topic Evolution 📈 Trend Setter ⚡ Prolific Year (7) 💎 Century Club (24) 🗃️ Keyword Collector (81) 🔥 Unstoppable (6)

Conferences

ICLR (5) ICML (5) NIPS (5) ACL (3) EMNLP (3) NAACL (2) CVPR (1) IJCNLP (1)

Top co-authors

Mingyuan Zhou (13) Chengyue Gong (8) Korawat Tanwisuth (5) Xinjie Fan (5) Xingchao Liu (4) Yihao Feng (4) Wenxuan Zhou (4) Shentao Yang (4) Huangjie Zheng (3) Eunsol Choi (3)

Keywords

offline reinforcement learning (2) preference optimization (2) diffusion model (2) stochastic attention (2) variational inference (2) text generation (2) instruction following (2) model-based reinforcement learning (2) uncertainty estimation (2) ordinary differential equation (2) adversarial robustness (2) reinforcement learning from human feedback (2) preference learning (1) prototype learning (1) policy optimization (1) multi-label learning (1) bayesian inference (1) probabilistic modeling (1) domain adaptation (1) data augmentation (1)

Papers

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe ACL 2026 Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models ICLR 2025 Statistical Advantages of Perturbing Cosine Router in Mixture of Experts ICLR 2025 T-REG: Preference Optimization with Token-Level Reward Regularization ACL 2025 Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy ICLR 2025 Sliced Wasserstein with Random-Path Projecting Directions ICML 2024 LanguageFlow: Advancing Diffusion Language Generation with Probabilistic Flows NAACL 2024 Switchable Decision: Dynamic Neural Generation Networks ICML 2024 WPO: Enhancing RLHF with Weighted Preference Optimization EMNLP 2024 Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems ICLR 2023 Preference-grounded Token-level Guidance for Language Model Fine-tuning NIPS 2023 FlowGrad: Controlling the Output of Generative ODEs With Gradients CVPR 2023 POUF: Prompt-Oriented Unsupervised Fine-tuning for Large Pre-trained Models ICML 2023 A Unified Framework for Alternating Offline Model Training and Policy Learning NIPS 2022 Passage-Mask: A Learnable Regularization Strategy for Retriever-Reader Models EMNLP 2022 ALLSH: Active Learning Guided by Local Sensitivity and Hardness NAACL 2022 Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning ICML 2022 Knowing More About Questions Can Help: Improving Calibration in Question Answering ACL 2021 Knowing More About Questions Can Help: Improving Calibration in Question Answering IJCNLP 2021 A Prototype-Oriented Framework for Unsupervised Domain Adaptation NIPS 2021 Contextual Dropout: An Efficient Sample-Dependent Dropout Module ICLR 2021 Learning with Different Amounts of Annotation: From Zero to Many Labels EMNLP 2021 Bayesian Attention Belief Networks ICML 2021 Alignment Attention by Matching Key and Query Distributions NIPS 2021 Bayesian Attention Modules NIPS 2020