conftrace_

Haosheng Zou

4 papers · 2019–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🏃 Academic Marathon (6) 🐝 Cross-Pollinator (6)

🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (17)

Conferences

AAAI (1) ACL (1) EMNLP (1) IJCAI (1)

Top co-authors

Hang Su (2) Xiangzheng Zhang (2) Dong Yan (2) Jun Zhu (2) Xiaowei Lv (1) Lifu Tang (1) Junchen Liu (1) XIN HE (1) Qi An (1) Zhenyu Duan (1)

Keywords

reinforcement learning (2) multi-task learning (1) curriculum learning (1) mathematical reasoning (1) model distillation (1) chain-of-thought reasoning (1) policy learning (1) hierarchical reinforcement learning (1) model training (1) language model (1) task distribution (1) supervised fine-tuning (1) reward shaping (1) intrinsic reward (1) credit assignment (1) process supervision (1) long context (1) first-person shooter (1) option framework (1) chain of thought (1)

Papers

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond ACL 2025 Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision EMNLP 2025 Learning Task-Distribution Reward Shaping with Meta-Learning AAAI 2021 Playing FPS Games With Environment-Aware Hierarchical Reinforcement Learning IJCAI 2019