Sashank Reddi
23 papers · 2015–2023 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (4) π Academic Marathon (8) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (12)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(42)
π€
Dynamic Duo
(16)
π
Triple Crown
π¬
Deep Specialist
(11)
π₯
Unstoppable
(6)
π
Century Club
(23)
ποΈ
Keyword Collector
(94)
β
The Questioner
(4)
β‘
Prolific Year
(7)
Conferences
ICML (9)
NIPS (8)
AISTATS (3)
ICLR (3)
Top co-authors
Research topics
Keywords
federated learning
(4)
knowledge distillation
(3)
adaptive gradient method
(3)
representation learning
(2)
information retrieval
(2)
attention mechanism
(2)
negative sampling
(2)
embedding dimension
(2)
stochastic gradient descent
(2)
stochastic optimization
(2)
client drift
(2)
control variate
(2)
natural language processing
(1)
bayesian inference
(1)
graph laplacian
(1)
transformer architecture
(1)
binary classification
(1)
composite optimization
(1)
non-convex optimization
(1)
convergence analysis
(1)
Papers
What is the Inductive Bias of Flatness Regularization? A Study of Deep Matrix Factorization Models
NIPS 2023
In defense of dual-encoders for neural ranking
ICML 2022
Robust Training of Neural Networks Using Scale Invariant Architectures
ICML 2022
Private Adaptive Optimization with Side information
ICML 2022
Breaking the centralized barrier for cross-device federated learning
NIPS 2021
RankDistil: Knowledge Distillation for Ranking
AISTATS 2021
Federated Composite Optimization
ICML 2021
Disentangling Sampling and Labeling Bias for Learning in Large-output Spaces
ICML 2021
A statistical perspective on distillation
ICML 2021
Efficient Training of Retrieval Models using Negative Cache
NIPS 2021
Learning to Learn by Zeroth-Order Oracle
ICLR 2020
O(n) Connections are Expressive Enough: Universal Approximability of Sparse Transformers
NIPS 2020
Why are Adaptive Methods Good for Attention Models?
NIPS 2020
Are Transformers universal approximators of sequence-to-sequence functions?
ICLR 2020
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
ICLR 2020
Low-Rank Bottleneck in Multi-head Attention Models
ICML 2020
SCAFFOLD: Stochastic Controlled Averaging for Federated Learning
ICML 2020
Escaping Saddle Points with Adaptive Gradient Methods
ICML 2019
Multilabel reductions: what is my loss optimising?
NIPS 2019
Breaking the Glass Ceiling for Embedding-Based Classifiers for Large Output Spaces
NIPS 2019
Adaptive Methods for Nonconvex Optimization
NIPS 2018
A Generic Approach for Escaping Saddle points
AISTATS 2018
On the High Dimensional Power of a Linear-Time Two Sample Test under Mean-shift Alternatives
AISTATS 2015