Zhanhui Zhou
9 papers · 2024–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (5) π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (20) π£ Hot Topic Early Bird
π
Cross-Pollinator
(13)
β‘
Prolific Year
(7)
Conferences
ACL (5)
EMNLP (1)
ICML (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
large language model
(6)
harmful content
(2)
fine-grained evaluation
(2)
instruction following
(2)
mathematical reasoning
(1)
direct preference optimization
(1)
benchmark evaluation
(1)
preference optimization
(1)
reward modeling
(1)
language modeling
(1)
model merging
(1)
language model alignment
(1)
dialogue state tracking
(1)
model alignment
(1)
safety alignment
(1)
text generation
(1)
value function
(1)
greedy search
(1)
multi-objective optimization
(1)
adversarial learning
(1)
Papers
dLLM: Simple Diffusion Language Modeling
ACL 2026
Emergent Response Planning in LLMs
ICML 2025
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
ACL 2024
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
ACL 2024
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
NIPS 2024
Inference-Time Language Model Alignment via Integrated Value Guidance
EMNLP 2024
Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
NAACL 2024
Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
ACL 2024
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
ACL 2024