conftrace_

← Optimization & Theory

Machine Learning › Optimization & Theory ›

Neural Network Optimization

3,648 papers

Papers per year

Papers

Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning EMNLP 2023

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models EMNLP 2023

SPT: Learning to Selectively Insert Prompts for Better Prompt Tuning EMNLP 2023

PAC-tuning: Fine-tuning Pre-trained Language Models with PAC-driven Perturbed Gradient Descent EMNLP 2023

Focus Your Attention (with Adaptive IIR Filters) EMNLP 2023

PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based Regularizer EMNLP 2023

TLM: Token-Level Masking for Transformers EMNLP 2023

ATHENA: Mathematical Reasoning with Thought Expansion EMNLP 2023

Pretraining Without Attention EMNLP 2023

Approximating Two-Layer Feedforward Networks for Efficient Transformers EMNLP 2023

How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts EMNLP 2023

Exploring the Sensitivity of LLMs’ Decision-Making Capabilities: Insights from Prompt Variations and Hyperparameters EMNLP 2023

Segmented Recurrent Transformer: An Efficient Sequence-to-Sequence Model EMNLP 2023

A Spectral Viewpoint on Continual Relation Extraction EMNLP 2023

Incorporating Syntactic Knowledge into Pre-trained Language Model using Optimization for Overcoming Catastrophic Forgetting EMNLP 2023

TokenDrop + BucketSampler: Towards Efficient Padding-free Fine-tuning of Language Models EMNLP 2023

On Enhancing Fine-Tuning for Pre-trained Language Models EMNLP 2023

Results of WMT23 Metrics Shared Task: Metrics Might Be Guilty but References Are Not Innocent EMNLP 2023

Trained MT Metrics Learn to Cope with Machine-translated References EMNLP 2023

One Wide Feedforward Is All You Need EMNLP 2023

Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training ICCV 2023

ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradient Accumulation ICCV 2023

Robust Object Modeling for Visual Tracking ICCV 2023

Fast Adversarial Training with Smooth Convergence ICCV 2023

Trajectory Unified Transformer for Pedestrian Trajectory Prediction ICCV 2023