Vaishnavh Nagarajan
17 papers · 2017–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (19) π Conference Polyglot (6) π Academic Marathon (8) π§ Keyword Pioneer
π
Conference Polyglot
(6)
π
Interdisciplinary Bridge
π₯
Unstoppable
(5)
π
Conference Pioneer
π
Century Club
(17)
β
The Questioner
Conferences
ICLR (7)
NIPS (4)
AISTATS (2)
ICML (2)
ALT (1)
COLT (1)
Top co-authors
Keywords
gradient descent
(3)
generalization bound
(2)
implicit bia
(2)
sample complexity
(1)
deep learning
(1)
neural network optimization
(1)
nearest neighbor
(1)
uniform convergence
(1)
agnostic learning
(1)
equilibrium analysis
(1)
lifelong learning
(1)
adversarial perturbation
(1)
decision tree
(1)
generative adversarial network
(1)
overparameterized network
(1)
stochastic dynamics
(1)
residual learning
(1)
algorithm configuration
(1)
safe exploration
(1)
adversarial risk
(1)
Papers
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
ICML 2025
Think before you speak: Training Language Models With Pause Tokens
ICLR 2024
The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory before In-context Learning
ICLR 2024
Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning
ICLR 2024
The Pitfalls of Next-Token Prediction
ICML 2024
ResMem: Learn what you can and memorize the rest
NIPS 2023
On student-teacher deviations in distillation: does it pay to disobey?
NIPS 2023
Assessing Generalization of SGD via Disagreement
ICLR 2022
A Learning Theoretic Perspective on Local Explainability
ICLR 2021
Provably Safe PAC-MDP Exploration Using Analogies
AISTATS 2021
Understanding the failure modes of out-of-distribution generalization
ICLR 2021
Deterministic PAC-Bayesian generalization bounds for deep networks via generalizing noise-resilience
ICLR 2019
Uniform convergence may be unable to explain generalization in deep learning
NIPS 2019
Revisiting Adversarial Risk
AISTATS 2019
Gradient descent GAN optimization is locally stable
NIPS 2017
Learning-Theoretic Foundations of Algorithm Configuration for Combinatorial Partitioning Problems
COLT 2017
Lifelong Learning in Costly Feature Spaces
ALT 2017