Shujian Zhang
25 papers · 2020–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (8) π Academic Marathon (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (7)
πΊοΈ
Taxonomy Completionist
(41)
π
Conference Polyglot
(8)
π
Academic Marathon
(5)
π€
Dynamic Duo
(13)
π
Triple Crown
π§¬
Topic Evolution
π
Trend Setter
β‘
Prolific Year
(7)
π
Century Club
(24)
ποΈ
Keyword Collector
(81)
π₯
Unstoppable
(6)
Conferences
ICLR (5)
ICML (5)
NIPS (5)
ACL (3)
EMNLP (3)
NAACL (2)
CVPR (1)
IJCNLP (1)
Top co-authors
Keywords
offline reinforcement learning
(2)
preference optimization
(2)
diffusion model
(2)
stochastic attention
(2)
variational inference
(2)
text generation
(2)
instruction following
(2)
model-based reinforcement learning
(2)
uncertainty estimation
(2)
ordinary differential equation
(2)
adversarial robustness
(2)
reinforcement learning from human feedback
(2)
preference learning
(1)
prototype learning
(1)
policy optimization
(1)
multi-label learning
(1)
bayesian inference
(1)
probabilistic modeling
(1)
domain adaptation
(1)
data augmentation
(1)
Papers
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
ACL 2026
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
ICLR 2025
Statistical Advantages of Perturbing Cosine Router in Mixture of Experts
ICLR 2025
T-REG: Preference Optimization with Token-Level Reward Regularization
ACL 2025
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
ICLR 2025
Sliced Wasserstein with Random-Path Projecting Directions
ICML 2024
LanguageFlow: Advancing Diffusion Language Generation with Probabilistic Flows
NAACL 2024
Switchable Decision: Dynamic Neural Generation Networks
ICML 2024
WPO: Enhancing RLHF with Weighted Preference Optimization
EMNLP 2024
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
ICLR 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
NIPS 2023
FlowGrad: Controlling the Output of Generative ODEs With Gradients
CVPR 2023
POUF: Prompt-Oriented Unsupervised Fine-tuning for Large Pre-trained Models
ICML 2023
A Unified Framework for Alternating Offline Model Training and Policy Learning
NIPS 2022
Passage-Mask: A Learnable Regularization Strategy for Retriever-Reader Models
EMNLP 2022
ALLSH: Active Learning Guided by Local Sensitivity and Hardness
NAACL 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
ICML 2022
Knowing More About Questions Can Help: Improving Calibration in Question Answering
ACL 2021
Knowing More About Questions Can Help: Improving Calibration in Question Answering
IJCNLP 2021
A Prototype-Oriented Framework for Unsupervised Domain Adaptation
NIPS 2021
Contextual Dropout: An Efficient Sample-Dependent Dropout Module
ICLR 2021
Learning with Different Amounts of Annotation: From Zero to Many Labels
EMNLP 2021
Bayesian Attention Belief Networks
ICML 2021
Alignment Attention by Matching Key and Query Distributions
NIPS 2021
Bayesian Attention Modules
NIPS 2020