conftrace_

Zhanhui Zhou

9 papers · 2024–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (20) 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (13) ⚡ Prolific Year (7)

Conferences

ACL (5) EMNLP (1) ICML (1) NAACL (1) NIPS (1)

Top co-authors

Chao Yang (6) Yu Qiao (5) Jie Liu (5) Zhichen Dong (4) Wanli Ouyang (4) Jiaheng Liu (3) Zhixuan Liu (3) Bo Zheng (2) Tiezheng Ge (2) Xingyuan Bu (2)

Keywords

large language model (6) harmful content (2) fine-grained evaluation (2) instruction following (2) mathematical reasoning (1) direct preference optimization (1) benchmark evaluation (1) preference optimization (1) reward modeling (1) language modeling (1) model merging (1) language model alignment (1) dialogue state tracking (1) model alignment (1) safety alignment (1) text generation (1) value function (1) greedy search (1) multi-objective optimization (1) adversarial learning (1)

Papers

dLLM: Simple Diffusion Language Modeling ACL 2026 Emergent Response Planning in LLMs ICML 2025 Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! ACL 2024 ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models ACL 2024 Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models NIPS 2024 Inference-Time Language Model Alignment via Integrated Value Guidance EMNLP 2024 Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey NAACL 2024 Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization ACL 2024 MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues ACL 2024