conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Optimization & Theory
Machine Learning
›
Optimization & Theory
›
Neural Network Optimization
3,648 papers
Papers per year
2001: 1
2003: 1
2005: 2
2006: 3
2007: 6
2008: 1
2009: 7
2010: 5
2011: 7
2012: 9
2013: 17
2014: 18
2015: 40
2016: 76
2017: 113
2018: 214
2019: 324
2020: 414
2021: 489
2022: 445
2023: 524
2024: 469
2025: 386
2026: 77
Papers
On-the-fly Denoising for Data Augmentation in Natural Language Understanding
EACL 2024
Correcting Language Model Outputs by Editing Salient Layers
EACL 2024
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
EMNLP 2024
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models
EMNLP 2024
LongEmbed: Extending Embedding Models for Long Context Retrieval
EMNLP 2024
Model Balancing Helps Low-data Training and Fine-tuning
EMNLP 2024
From Bottom to Top: Extending the Potential of Parameter Efficient Fine-Tuning
EMNLP 2024
CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling
EMNLP 2024
PSC: Extending Context Window of Large Language Models via Phase Shift Calibration
EMNLP 2024
Focused Large Language Models are Stable Many-Shot Learners
EMNLP 2024
Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation
EMNLP 2024
Fast Forwarding Low-Rank Training
EMNLP 2024
Unlocking Memorization in Large Language Models with Dynamic Soft Prompting
EMNLP 2024
Stable Language Model Pre-training by Reducing Embedding Variability
EMNLP 2024
The Mystery of the Pathological Path-star Task for Language Models
EMNLP 2024
A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
EMNLP 2024
SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers
EMNLP 2024
Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment
EMNLP 2024
SRF: Enhancing Document-Level Relation Extraction with a Novel Secondary Reasoning Framework
EMNLP 2024
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training
EMNLP 2024
Quantum Recurrent Architectures for Text Classification
EMNLP 2024
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy
EMNLP 2024
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models
EMNLP 2024
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion
EMNLP 2024
Memory-Efficient Fine-Tuning of Transformers via Token Selection
EMNLP 2024
<
1
…
32
33
34
…
146
>