conftrace_

Boyi Liu

16 papers · 2019–2025 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+8 more ↓ πŸƒ Academic Marathon (6) πŸŒ‰ Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (6)
πŸ—ΊοΈ Taxonomy Completionist (26) πŸŒ‰ Interdisciplinary Bridge 🧭 Keyword Pioneer πŸ‘‘ Triple Crown 🀝 Dynamic Duo (13) πŸ—ƒοΈ Keyword Collector (50) πŸ”₯ Unstoppable (5) πŸ’Ž Century Club (16)

Conferences

NIPS (6) ICML (4) ICLR (2) COLING (1) EMNLP (1) IJCAI (1) JMLR (1)

Papers

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning ICML 2025 Towards Database-Free Text-to-SQL Evaluation: A Graph-Based Metric for Functional Correctness COLING 2025 Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward EMNLP 2025 Reward-Augmented Data Enhances Direct Preference Alignment of LLMs ICML 2025 Let Models Speak Ciphers: Multiagent Debate through Embeddings ICLR 2024 Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer NIPS 2024 Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents ICML 2024 Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms NIPS 2023 Achieving Hierarchy-Free Approximation for Bilevel Programs with Equilibrium Constraints ICML 2023 Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning JMLR 2023 Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence NIPS 2022 Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL NIPS 2022 Dynamic Graph Learning Based on Hierarchical Memory for Origin-Destination Demand Prediction IJCAI 2022 BooVI: Provably Efficient Bootstrapped Value Iteration NIPS 2021 Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy ICLR 2019 Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy NIPS 2019