conftrace_

Kaito Ariu

17 papers · 2020–2025 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+7 more ↓

🐝 Cross-Pollinator (9) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (7)

🌍 Conference Polyglot (9) 🏃 Academic Marathon (5) 🤝 Dynamic Duo (10) 🏆 Grand Slam 💎 Century Club (17) ⚡ Prolific Year (7) 🗃️ Keyword Collector (51)

Conferences

ICML (6) AAAI (2) ACL (2) AISTATS (2) EMNLP (1) ICLR (1) IJCAI (1) NAACL (1) NIPS (1)

Top co-authors

Kenshi Abe (10) Alexandre Proutiere (5) Yuu Jinnai (5) Mitsuki Sakamoto (4) Tetsuro Morimura (4) Yuma Fujimoto (3) Atsushi Iwasaki (3) Po-An Wang (2) Se-Young Yun (2) Eiji Uchibe (1)

Keywords

nash equilibrium (4) minimum bayes risk (3) game theory (3) regret bound (3) zero-sum game (3) language model alignment (2) text generation (2) learning dynamics (2) machine translation (1) utility optimization (1) direct preference optimization (1) reinforcement learning from human feedback (1) model alignment (1) bayesian inference (1) text summarization (1) preference learning (1) multi-armed bandit (1) lasso regression (1) optimization algorithm (1) multi-agent learning (1)

Papers

Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment NAACL 2025 Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium AAAI 2025 Theoretical Guarantees for Minimum Bayes Risk Decoding ACL 2025 Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games ICLR 2025 Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model ICML 2025 On Universally Optimal Algorithms for A/B Testing ICML 2024 Matroid Semi-Bandits in Sublinear Time ICML 2024 Memory Asymmetry Creates Heteroclinic Orbits to Nash Equilibrium in Learning in Zero-Sum Games AAAI 2024 Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding ACL 2024 Filtered Direct Preference Optimization EMNLP 2024 Adaptively Perturbed Mirror Descent for Learning in Games ICML 2024 Model-Based Minimum Bayes Risk Decoding for Text Generation ICML 2024 Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games AISTATS 2023 Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium IJCAI 2023 Thresholded Lasso Bandit ICML 2022 Regret in Online Recommendation Systems NIPS 2020 Optimal Algorithms for Multiplayer Multi-Armed Bandits AISTATS 2020