Haosheng Zou
4 papers · 2019–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (4) π Academic Marathon (6) π Cross-Pollinator (6)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(17)
Conferences
AAAI (1)
ACL (1)
EMNLP (1)
IJCAI (1)
Top co-authors
Keywords
reinforcement learning
(2)
multi-task learning
(1)
curriculum learning
(1)
mathematical reasoning
(1)
model distillation
(1)
chain-of-thought reasoning
(1)
policy learning
(1)
hierarchical reinforcement learning
(1)
model training
(1)
language model
(1)
task distribution
(1)
supervised fine-tuning
(1)
reward shaping
(1)
intrinsic reward
(1)
credit assignment
(1)
process supervision
(1)
long context
(1)
first-person shooter
(1)
option framework
(1)
chain of thought
(1)
Papers
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond
ACL 2025
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
EMNLP 2025
Learning Task-Distribution Reward Shaping with Meta-Learning
AAAI 2021
Playing FPS Games With Environment-Aware Hierarchical Reinforcement Learning
IJCAI 2019