← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3648 directly classified papers

Papers per year

Papers

Mamba YOLO: A Simple Baseline for Object Detection with State Space Model AAAI 2025

Neural Operators Can Play Dynamic Stackelberg Games JMLR 2025

Revisiting Gradient Normalization and Clipping for Nonconvex SGD under Heavy-Tailed Noise: Necessity, Sufficiency, and Acceleration JMLR 2025

Robust and Adaptive AI Models for Medication Usage Forecasting Using ICD-9/10 Code (Student Abstract) AAAI 2025

Reducing Divergence in Batch Normalization for Domain Adaptation AAAI 2025

Continual Gradient Low-Rank Projection Fine-Tuning for LLMs ACL 2025

Smart-Searcher: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning EMNLP 2025

Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction EMNLP 2025

Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism EMNLP 2025

Layer Duplication in LLMs EMNLP 2025

Learning Physics Informed Neural ODEs with Partial Measurements AAAI 2025

Let’s Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM’s Math Capability EMNLP 2025

Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key CVPR 2025

Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult JMLR 2025

Global Convergence of Adjoint-Optimized Neural PDEs JMLR 2025

MossNet: Mixture of State-Space Experts is a Multi-Head Attention AACL 2025

Optimizing RLHF Training for Large Language Models with Stage Fusion NSDI 2025

Curved Worlds, Clear Boundaries: Generalizing Speech Deepfake Detection using Hyperbolic and Spherical Geometry Spaces AACL 2025

Enhancing Chain-of-Thought Reasoning with Critical Representation Fine-tuning ACL 2025

YuLan-Mini: Pushing the Limits of Open Data-efficient Language Model ACL 2025

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling ACL 2025

TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge ACL 2025

LESA: Learnable LLM Layer Scaling-Up ACL 2025

Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up ACL 2025

YNU-HPCC at SemEval-2025 Task 6: Using BERT Model with R-drop for Promise Verification SEMEVAL 2025