Hanze Dong
22 papers · 2022–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π£ Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(11)
π§
Keyword Pioneer
π€
Dynamic Duo
(16)
π
Triple Crown
π
Century Club
(22)
β‘
Prolific Year
(12)
ποΈ
Keyword Collector
(91)
Conferences
ICLR (5)
EMNLP (4)
ICML (4)
JMLR (2)
NIPS (2)
ACL (1)
AISTATS (1)
COLT (1)
CVPR (1)
NAACL (1)
Top co-authors
Keywords
large language model
(2)
generative modeling
(1)
knowledge distillation
(1)
representation learning
(1)
reinforcement learning
(1)
feature learning
(1)
offline reinforcement learning
(1)
policy optimization
(1)
domain adaptation
(1)
bayesian inference
(1)
model calibration
(1)
object detection
(1)
preference learning
(1)
weakly supervised learning
(1)
langevin dynamics
(1)
data augmentation
(1)
multimodal learning
(1)
convergence analysis
(1)
distributed learning
(1)
stochastic optimization
(1)
Papers
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning
ICLR 2025
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
ICML 2025
Offline Reinforcement Learning for LLM Multi-step Reasoning
ACL 2025
ThinK: Thinner Key Cache by Query-Driven Pruning
ICLR 2025
FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation
EMNLP 2024
MLLM-Protector: Ensuring MLLMβs Safety without Hurting Performance
EMNLP 2024
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models
NAACL 2024
Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference
NIPS 2024
Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo
COLT 2024
Mitigating the Alignment Tax of RLHF
EMNLP 2024
Reverse Diffusion Monte Carlo
ICLR 2024
Spurious Feature Diversification Improves Out-of-distribution Generalization
ICLR 2024
Faster Sampling via Stochastic Gradient Proximal Sampler
ICML 2024
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint
ICML 2024
PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium
JMLR 2024
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
NIPS 2024
Particle-based Variational Inference with Preconditioned Functional Gradient Flow
ICLR 2023
Catalyst Acceleration of Error Compensated Methods Leads to Better Communication Complexity
AISTATS 2023
DetGPT: Detect What You Need via Reasoning
EMNLP 2023
Weakly Supervised Disentangled Generative Causal Representation Learning
JMLR 2022
Bayesian Invariant Risk Minimization
CVPR 2022
Local Augmentation for Graph Neural Networks
ICML 2022