Huaisheng Zhu
8 papers · 2023–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (6) π Renaissance Researcher (6) π Cross-Pollinator (15) πΊοΈ Taxonomy Completionist (18)
π§
Keyword Pioneer
β
The Questioner
Conferences
EMNLP (2)
NIPS (2)
ACL (1)
ICCV (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
large language model
(2)
reinforcement learning
(1)
imitation learning
(1)
policy optimization
(1)
direct preference optimization
(1)
self-supervised learning
(1)
preference optimization
(1)
text-to-image generation
(1)
instruction following
(1)
language model alignment
(1)
reinforcement learning from human feedback
(1)
model alignment
(1)
density ratio estimation
(1)
reward model
(1)
multimodal large language model
(1)
proximal policy optimization
(1)
reward shaping
(1)
jailbreak attack
(1)
human preference alignment
(1)
graph contrastive learning
(1)
Papers
Reinforcement Learning for Large Language Models via Group Preference Reward Shaping
EMNLP 2025
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
ICCV 2025
DSPO: Direct Score Preference Optimization for Diffusion Model Alignment
ICLR 2025
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective
EMNLP 2024
Efficient Contrastive Learning for Fast and Accurate Inference on Graphs
ICML 2024
Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment
NIPS 2024
Jailbreak Open-Sourced Large Language Models via Enforced Decoding
ACL 2024
Simple and Asymmetric Graph Contrastive Learning without Augmentations
NIPS 2023