Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Optimization
1638 directly classified papers
Papers per year
2006: 5
2007: 2
2008: 4
2009: 2
2010: 2
2011: 3
2012: 8
2013: 25
2014: 19
2015: 22
2016: 31
2017: 42
2018: 68
2019: 104
2020: 148
2021: 174
2022: 178
2023: 209
2024: 345
2025: 244
2026: 3
Papers
Accelerating Relative Entropy Coding with Space Partitioning
NIPS 2024
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
COLING 2024
CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending
ACL 2024
IR-CM: The Fast and General-purpose Image Restoration Method Based on Consistency Model
NIPS 2024
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
NIPS 2024
FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection
ACL 2024
MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence
NIPS 2024
Boosted Conformal Prediction Intervals
NIPS 2024
Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
NIPS 2024
Optimized Speculative Sampling for GPU Hardware Accelerators
EMNLP 2024
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
NIPS 2024
Iterate Averaging in the Quest for Best Test Error
JMLR 2024
SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization
NIPS 2024
Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling
NIPS 2024
Understanding Entropic Regularization in GANs
JMLR 2024
On the Hyperparameters in Stochastic Gradient Descent with Momentum
JMLR 2024
Faster Randomized Methods for Orthogonality Constrained Problems
JMLR 2024
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
JMLR 2024
PGMax: Factor Graphs for Discrete Probabilistic Graphical Models and Loopy Belief Propagation in JAX
JMLR 2024
Countering the Communication Bottleneck in Federated Learning: A Highly Efficient Zero-Order Optimization Technique
JMLR 2024
Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective
ACL 2024
Integrating GNN and Neural ODEs for Estimating Non-Reciprocal Two-Body Interactions in Mixed-Species Collective Motion
NIPS 2024
ADOPT: Modified Adam Can Converge with Any $\beta_2$ with the Optimal Rate
NIPS 2024
SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training
ACL 2024
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers
ACL 2024
<
1
…
18
19
20
…
66
>