← Optimization & Theory

Deep Learning › Optimization & Theory ›

Optimization

1638 directly classified papers

Papers per year

Papers

Accelerating Relative Entropy Coding with Space Partitioning NIPS 2024

Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives COLING 2024

CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending ACL 2024

IR-CM: The Fast and General-purpose Image Restoration Method Based on Consistency Model NIPS 2024

SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention NIPS 2024

FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection ACL 2024

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence NIPS 2024

Boosted Conformal Prediction Intervals NIPS 2024

Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate NIPS 2024

Optimized Speculative Sampling for GPU Hardware Accelerators EMNLP 2024

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization NIPS 2024

Iterate Averaging in the Quest for Best Test Error JMLR 2024

SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization NIPS 2024

Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling NIPS 2024

Understanding Entropic Regularization in GANs JMLR 2024

On the Hyperparameters in Stochastic Gradient Descent with Momentum JMLR 2024

Faster Randomized Methods for Orthogonality Constrained Problems JMLR 2024

White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? JMLR 2024

PGMax: Factor Graphs for Discrete Probabilistic Graphical Models and Loopy Belief Propagation in JAX JMLR 2024

Countering the Communication Bottleneck in Federated Learning: A Highly Efficient Zero-Order Optimization Technique JMLR 2024

Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective ACL 2024

Integrating GNN and Neural ODEs for Estimating Non-Reciprocal Two-Body Interactions in Mixed-Species Collective Motion NIPS 2024

ADOPT: Modified Adam Can Converge with Any $\beta_2$ with the Optimal Rate NIPS 2024

SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training ACL 2024

CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers ACL 2024