conftrace_

Kwangjun Ahn

21 papers · 2018–2025 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+9 more ↓ 🐝 Cross-Pollinator (8) πŸƒ Academic Marathon (7) 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (5)
🌍 Conference Polyglot (6) πŸƒ Academic Marathon (7) 🧭 Keyword Pioneer πŸ‘‘ Triple Crown πŸ”₯ Unstoppable (6) ⚑ Prolific Year (5) πŸ’Ž Century Club (21) ❓ The Questioner πŸ—ƒοΈ Keyword Collector (74)

Conferences

NIPS (9) ICML (5) ICLR (3) COLT (2) JMLR (1) L4DC (1)

Papers

The Belief State Transformer ICLR 2025 General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization ICML 2025 Does SGD really happen in tiny subspaces? ICLR 2025 How to Escape Sharp Minima with Random Perturbations ICML 2024 Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise ICML 2024 Adam with model exponential moving average is effective for nonconvex optimization NIPS 2024 Linear attention is (maybe) all you need (to understand Transformer optimization) ICLR 2024 The Crucial Role of Normalization in Sharpness-Aware Minimization NIPS 2023 A Unified Approach to Controlling Implicit Regularization via Mirror Descent JMLR 2023 Model Predictive Control via On-Policy Imitation Learning L4DC 2023 Learning threshold neurons via edge of stability NIPS 2023 Transformers learn to implement preconditioned gradient descent for in-context learning NIPS 2023 Understanding the unstable convergence of gradient descent ICML 2022 Reproducibility in Optimization: Theoretical Framework and Limits NIPS 2022 Mirror Descent Maximizes Generalized Margin and Can Be Implemented Efficiently NIPS 2022 Agnostic Learnability of Halfspaces via Logistic Loss ICML 2022 Optimal dimension dependence of the Metropolis-Adjusted Langevin Algorithm COLT 2021 Efficient constrained sampling via the mirror-Langevin algorithm NIPS 2021 From Nesterov’s Estimate Sequence to Riemannian Acceleration COLT 2020 SGD with shuffling: optimal rates without component convexity and large epoch requirements NIPS 2020 Binary Rating Estimation with Graph Side Information NIPS 2018