conftrace_

Huaisheng Zhu

8 papers · 2023–2025 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (6) 🐝 Cross-Pollinator (15) 🗺️ Taxonomy Completionist (18)

🧭 Keyword Pioneer ❓ The Questioner

Conferences

EMNLP (2) NIPS (2) ACL (1) ICCV (1) ICLR (1) ICML (1)

Top co-authors

Teng Xiao (6) Vasant G Honavar (4) Zhimeng Guo (3) Yige Yuan (2) Mingxiao Li (2) Shijie Zhou (2) Hangfan Zhang (2) Suhang Wang (2) Ruiyi Zhang (1) Jian Chen (1)

Keywords

large language model (2) reinforcement learning (1) imitation learning (1) policy optimization (1) direct preference optimization (1) self-supervised learning (1) preference optimization (1) text-to-image generation (1) instruction following (1) language model alignment (1) reinforcement learning from human feedback (1) model alignment (1) density ratio estimation (1) reward model (1) multimodal large language model (1) proximal policy optimization (1) reward shaping (1) jailbreak attack (1) human preference alignment (1) graph contrastive learning (1)

Papers

Reinforcement Learning for Large Language Models via Group Preference Reward Shaping EMNLP 2025 Multimodal LLMs as Customized Reward Models for Text-to-Image Generation ICCV 2025 DSPO: Direct Score Preference Optimization for Diffusion Model Alignment ICLR 2025 How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective EMNLP 2024 Efficient Contrastive Learning for Fast and Accurate Inference on Graphs ICML 2024 Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment NIPS 2024 Jailbreak Open-Sourced Large Language Models via Enforced Decoding ACL 2024 Simple and Asymmetric Graph Contrastive Learning without Augmentations NIPS 2023