Kaito Ariu
17 papers · 2020–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🐝 Cross-Pollinator (9) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (7)
🌍
Conference Polyglot
(9)
🏃
Academic Marathon
(5)
🤝
Dynamic Duo
(10)
🏆
Grand Slam
💎
Century Club
(17)
⚡
Prolific Year
(7)
🗃️
Keyword Collector
(51)
Conferences
ICML (6)
AAAI (2)
ACL (2)
AISTATS (2)
EMNLP (1)
ICLR (1)
IJCAI (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
nash equilibrium
(4)
minimum bayes risk
(3)
game theory
(3)
regret bound
(3)
zero-sum game
(3)
language model alignment
(2)
text generation
(2)
learning dynamics
(2)
machine translation
(1)
utility optimization
(1)
direct preference optimization
(1)
reinforcement learning from human feedback
(1)
model alignment
(1)
bayesian inference
(1)
text summarization
(1)
preference learning
(1)
multi-armed bandit
(1)
lasso regression
(1)
optimization algorithm
(1)
multi-agent learning
(1)
Papers
Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment
NAACL 2025
Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium
AAAI 2025
Theoretical Guarantees for Minimum Bayes Risk Decoding
ACL 2025
Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games
ICLR 2025
Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
ICML 2025
On Universally Optimal Algorithms for A/B Testing
ICML 2024
Matroid Semi-Bandits in Sublinear Time
ICML 2024
Memory Asymmetry Creates Heteroclinic Orbits to Nash Equilibrium in Learning in Zero-Sum Games
AAAI 2024
Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding
ACL 2024
Filtered Direct Preference Optimization
EMNLP 2024
Adaptively Perturbed Mirror Descent for Learning in Games
ICML 2024
Model-Based Minimum Bayes Risk Decoding for Text Generation
ICML 2024
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games
AISTATS 2023
Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium
IJCAI 2023
Thresholded Lasso Bandit
ICML 2022
Regret in Online Recommendation Systems
NIPS 2020
Optimal Algorithms for Multiplayer Multi-Armed Bandits
AISTATS 2020