← Optimization & Theory

Machine Learning › Optimization & Theory ›

Optimization

14207 directly classified papers

Papers per year

Papers

Dynamic Scaling of Unit Tests for Code Reward Modeling ACL 2025

P2 Law: Scaling Law for Post-Training After Model Pruning ACL 2025

Fast Contiguous Somatic Hypermutations for Single-Objective Optimisation and Multi-Objective Optimisation Via Decomposition AAAI 2025

Delay as Payoff in MAB AAAI 2025

Learning to Reason from Feedback at Test-Time ACL 2025

YuLan-Mini: Pushing the Limits of Open Data-efficient Language Model ACL 2025

Multi-Label Ranking Loss Minimization for Matrix Completion AAAI 2025

PIC: Unlocking Long-Form Text Generation Capabilities of Large Language Models via Position ID Compression ACL 2025

Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention ACL 2025

Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent AAAI 2025

Online and Streaming Algorithms for Constrained k-Submodular Maximization AAAI 2025

Online MDP with Prototypes Information: A Robust Adaptive Approach AAAI 2025

An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning ACL 2025

Dynamic and Generalizable Process Reward Modeling ACL 2025

PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models ACL 2025

Taming LLMs with Gradient Grouping ACL 2025

Pre-training Distillation for Large Language Models: A Design Space Exploration ACL 2025

Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization ACL 2025

Improving Deep Learning Speed and Performance Through Synaptic Neural Balance AAAI 2025

Adversarial Training for Probabilistic Robustness ICCV 2025

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework ACL 2025

Gradient-Based Nonlinear Rehearsal Learning with Multivariate Alterations AAAI 2025

Universal Online Convex Optimization Meets Second-order Bounds JMLR 2025

Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization ICLR 2025

RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation ACL 2025