Hanze Dong

22 papers · 2022–2025 · 10 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🤝 Dynamic Duo (16) 👑 Triple Crown 💎 Century Club (22) ⚡ Prolific Year (12) 🗃️ Keyword Collector (91)

Conferences

ICLR (5) EMNLP (4) ICML (4) JMLR (2) NIPS (2) ACL (1) AISTATS (1) COLT (1) CVPR (1) NAACL (1)

Top co-authors

Tong Zhang (16) Jipeng Zhang (5) Xunpeng Huang (4) Rui Pan (4) Wei Xiong (4) SHIZHE DIAO (4) Yian Ma (3) Caiming Xiong (3) Renjie Pi (3) Doyen Sahoo (3)

Keywords

large language model (2) generative modeling (1) knowledge distillation (1) representation learning (1) reinforcement learning (1) feature learning (1) offline reinforcement learning (1) policy optimization (1) domain adaptation (1) bayesian inference (1) model calibration (1) object detection (1) preference learning (1) weakly supervised learning (1) langevin dynamics (1) data augmentation (1) multimodal learning (1) convergence analysis (1) distributed learning (1) stochastic optimization (1)

Papers

Automatic Curriculum Expert Iteration for Reliable LLM Reasoning ICLR 2025 Reward-Guided Speculative Decoding for Efficient LLM Reasoning ICML 2025 Offline Reinforcement Learning for LLM Multi-step Reasoning ACL 2025 ThinK: Thinner Key Cache by Query-Driven Pruning ICLR 2025 FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation EMNLP 2024 MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance EMNLP 2024 LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models NAACL 2024 Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference NIPS 2024 Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo COLT 2024 Mitigating the Alignment Tax of RLHF EMNLP 2024 Reverse Diffusion Monte Carlo ICLR 2024 Spurious Feature Diversification Improves Out-of-distribution Generalization ICLR 2024 Faster Sampling via Stochastic Gradient Proximal Sampler ICML 2024 Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint ICML 2024 PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium JMLR 2024 Online Iterative Reinforcement Learning from Human Feedback with General Preference Model NIPS 2024 Particle-based Variational Inference with Preconditioned Functional Gradient Flow ICLR 2023 Catalyst Acceleration of Error Compensated Methods Leads to Better Communication Complexity AISTATS 2023 DetGPT: Detect What You Need via Reasoning EMNLP 2023 Weakly Supervised Disentangled Generative Causal Representation Learning JMLR 2022 Bayesian Invariant Risk Minimization CVPR 2022 Local Augmentation for Graph Neural Networks ICML 2022