Boyi Liu
16 papers · 2019–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Academic Marathon (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (7) π Cross-Pollinator (6)
πΊοΈ
Taxonomy Completionist
(26)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Triple Crown
π€
Dynamic Duo
(13)
ποΈ
Keyword Collector
(50)
π₯
Unstoppable
(5)
π
Century Club
(16)
Conferences
NIPS (6)
ICML (4)
ICLR (2)
COLING (1)
EMNLP (1)
IJCAI (1)
JMLR (1)
Top co-authors
Keywords
model-based reinforcement learning
(3)
neural network
(2)
policy gradient
(2)
graph matching
(2)
function approximation
(1)
reinforcement learning from human feedback
(1)
game theory
(1)
generalization bound
(1)
graph representation learning
(1)
global convergence
(1)
primal-dual optimization
(1)
language model alignment
(1)
constrained reinforcement learning
(1)
relational reasoning
(1)
markov decision process
(1)
direct preference optimization
(1)
mirror descent
(1)
bilevel optimization
(1)
fenchel duality
(1)
reinforcement learning
(1)
Papers
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
ICML 2025
Towards Database-Free Text-to-SQL Evaluation: A Graph-Based Metric for Functional Correctness
COLING 2025
Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward
EMNLP 2025
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
ICML 2025
Let Models Speak Ciphers: Multiagent Debate through Embeddings
ICLR 2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
NIPS 2024
Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents
ICML 2024
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
NIPS 2023
Achieving Hierarchy-Free Approximation for Bilevel Programs with Equilibrium Constraints
ICML 2023
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
JMLR 2023
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence
NIPS 2022
Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
NIPS 2022
Dynamic Graph Learning Based on Hierarchical Memory for Origin-Destination Demand Prediction
IJCAI 2022
BooVI: Provably Efficient Bootstrapped Value Iteration
NIPS 2021
Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy
ICLR 2019
Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy
NIPS 2019