Xingzhou Lou
5 papers · 2023–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(3)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(11)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (2)
ACL (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
policy gradient
(2)
policy optimization
(2)
multi-agent system
(2)
mathematical reasoning
(1)
preference alignment
(1)
preference optimization
(1)
reinforcement learning from human feedback
(1)
language model
(1)
partner modeling
(1)
human preference alignment
(1)
centralized critic
(1)
large language model
(1)
multi-agent policy gradient
(1)
human-ai coordination
(1)
agent topology
(1)
implicit reward modeling
(1)
multi-dimensional preference
(1)
centralized-decentralized mismatch
(1)
coalition utility
(1)
zero-shot learning
(1)
Papers
Calibration-Aware Policy Optimization for Reasoning LLMs
ACL 2026
Sequential Preference Optimization: Multi-Dimensional Preference Alignment with Implicit Reward Modeling
AAAI 2025
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
AAAI 2024
Position: Foundation Agents as the Paradigm Shift for Decision Making
ICML 2024
An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination
NIPS 2023