Yuanzhao Zhai
6 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (3) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (15) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (3)
ACL (2)
ICML (1)
Top co-authors
Keywords
large language model
(3)
reinforcement learning
(2)
offline reinforcement learning
(1)
policy optimization
(1)
preference learning
(1)
preference optimization
(1)
out-of-distribution generalization
(1)
markov decision process
(1)
model alignment
(1)
model-based reinforcement learning
(1)
monte carlo tree search
(1)
human feedback
(1)
influence function
(1)
model collapse
(1)
policy regularization
(1)
ai alignment
(1)
citation network
(1)
hypothesis generation
(1)
semantic compatibility
(1)
multi-agent system
(1)
Papers
EvoNarrator: Modeling Scientific Evolution for Feasible Hypothesis Generation
ACL 2026
Correcting Large Language Model Behavior via Influence Function
AAAI 2025
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models
AAAI 2025
COPR: Continual Human Preference Learning via Optimal Policy Regularization
ACL 2025
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
AAAI 2024
Iterative Regularized Policy Optimization with Imperfect Demonstrations
ICML 2024