Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3648 directly classified papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
Mamba YOLO: A Simple Baseline for Object Detection with State Space Model
AAAI 2025
Neural Operators Can Play Dynamic Stackelberg Games
JMLR 2025
Revisiting Gradient Normalization and Clipping for Nonconvex SGD under Heavy-Tailed Noise: Necessity, Sufficiency, and Acceleration
JMLR 2025
Robust and Adaptive AI Models for Medication Usage Forecasting Using ICD-9/10 Code (Student Abstract)
AAAI 2025
Reducing Divergence in Batch Normalization for Domain Adaptation
AAAI 2025
Continual Gradient Low-Rank Projection Fine-Tuning for LLMs
ACL 2025
Smart-Searcher: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
EMNLP 2025
Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction
EMNLP 2025
Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism
EMNLP 2025
Layer Duplication in LLMs
EMNLP 2025
Learning Physics Informed Neural ODEs with Partial Measurements
AAAI 2025
Let’s Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM’s Math Capability
EMNLP 2025
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
CVPR 2025
Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult
JMLR 2025
Global Convergence of Adjoint-Optimized Neural PDEs
JMLR 2025
MossNet: Mixture of State-Space Experts is a Multi-Head Attention
AACL 2025
Optimizing RLHF Training for Large Language Models with Stage Fusion
NSDI 2025
Curved Worlds, Clear Boundaries: Generalizing Speech Deepfake Detection using Hyperbolic and Spherical Geometry Spaces
AACL 2025
Enhancing Chain-of-Thought Reasoning with Critical Representation Fine-tuning
ACL 2025
YuLan-Mini: Pushing the Limits of Open Data-efficient Language Model
ACL 2025
FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling
ACL 2025
TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge
ACL 2025
LESA: Learnable LLM Layer Scaling-Up
ACL 2025
Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up
ACL 2025
YNU-HPCC at SemEval-2025 Task 6: Using BERT Model with R-drop for Promise Verification
SEMEVAL 2025
<
1
…
12
13
14
…
146
>