Renqing He
3 papers · 2026–2026 · 1 conference · across top CS/AI conferences
Conferences
ACL (3)
Top co-authors
Keywords
reinforcement learning
(1)
chain-of-thought reasoning
(1)
singular value decomposition
(1)
geometric structure
(1)
low-rank adaptation
(1)
supervised fine-tuning
(1)
credit assignment
(1)
memory management
(1)
large language model
(1)
dense reward
(1)
probabilistic flow
(1)
reinforcement learning with verifiable reward
(1)
reward attribution
(1)
flow-guided decoding
(1)
step-wise information gain
(1)