Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Optimization
14207 directly classified papers
Papers per year
2001: 10
2002: 9
2003: 16
2004: 6
2005: 16
2006: 58
2007: 67
2008: 72
2009: 84
2010: 106
2011: 132
2012: 164
2013: 333
2014: 295
2015: 310
2016: 380
2017: 509
2018: 669
2019: 1072
2020: 1217
2021: 1489
2022: 1470
2023: 1746
2024: 1819
2025: 1567
2026: 591
Papers
Dynamic Scaling of Unit Tests for Code Reward Modeling
ACL 2025
P2 Law: Scaling Law for Post-Training After Model Pruning
ACL 2025
Fast Contiguous Somatic Hypermutations for Single-Objective Optimisation and Multi-Objective Optimisation Via Decomposition
AAAI 2025
Delay as Payoff in MAB
AAAI 2025
Learning to Reason from Feedback at Test-Time
ACL 2025
YuLan-Mini: Pushing the Limits of Open Data-efficient Language Model
ACL 2025
Multi-Label Ranking Loss Minimization for Matrix Completion
AAAI 2025
PIC: Unlocking Long-Form Text Generation Capabilities of Large Language Models via Position ID Compression
ACL 2025
Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
ACL 2025
Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent
AAAI 2025
Online and Streaming Algorithms for Constrained k-Submodular Maximization
AAAI 2025
Online MDP with Prototypes Information: A Robust Adaptive Approach
AAAI 2025
An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning
ACL 2025
Dynamic and Generalizable Process Reward Modeling
ACL 2025
PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models
ACL 2025
Taming LLMs with Gradient Grouping
ACL 2025
Pre-training Distillation for Large Language Models: A Design Space Exploration
ACL 2025
Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization
ACL 2025
Improving Deep Learning Speed and Performance Through Synaptic Neural Balance
AAAI 2025
Adversarial Training for Probabilistic Robustness
ICCV 2025
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
ACL 2025
Gradient-Based Nonlinear Rehearsal Learning with Multivariate Alterations
AAAI 2025
Universal Online Convex Optimization Meets Second-order Bounds
JMLR 2025
Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization
ICLR 2025
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation
ACL 2025
<
1
…
25
26
27
…
569
>