efficient training

51 papers

Explore in graph

Co-occurring keywords

model compression (3283) efficient computing (779) large language model (12755) neural network optimization (1293) transfer learning (5442) language model (4573) memory optimization (114) vision transformer (1091) attention mechanism (3975) data augmentation (3037)

Papers

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones ICCV 2023

Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training AAAI 2023

Self-Supervised Dataset Pruning for Efficient Training in Audio Anti-spoofing INTERSPEECH 2023

Data-Efficient French Language Modeling with CamemBERTa ACL 2023

No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models NIPS 2023

Automated Progressive Learning for Efficient Training of Vision Transformers CVPR 2022

NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework ICML 2022

Efficient yet Competitive Speech Translation: FBK@IWSLT2022 ACL 2022

Tempo: Accelerating Transformer-Based Model Training through Memory Footprint Reduction NIPS 2022

Efficient Training of Neural Transducer for Speech Recognition INTERSPEECH 2022

Early-Bird GCNs: Graph-Network Co-optimization towards More Efficient GCN Training and Inference via Drawing Early-Bird Lottery Tickets AAAI 2022

An Efficient Training Approach for Very Large Scale Face Recognition CVPR 2022

Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training NIPS 2022

Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation ACL 2021

EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets IJCNLP 2021

How far can we get with one GPU in 100 hours? CoAStaL at MultiIndicMT Shared Task ACL 2021

GRAD-MATCH: Gradient Matching based Data Subset Selection for Efficient Deep Model Training ICML 2021

Blocking-based Neighbor Sampling for Large-scale Graph Neural Networks IJCAI 2021

Improving Privacy Guarantee and Efficiency of Latent Dirichlet Allocation Model Training Under Differential Privacy EMNLP 2021

Accelerating Training of Transformer-Based Language Models with Progressive Layer Dropping NIPS 2020

High-contrast “gaudy” images improve the training of deep neural network models of visual cortex NIPS 2020

L2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks CVPR 2020

FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training NIPS 2020

Dynamic Sentence Sampling for Efficient Training of Neural Machine Translation ACL 2018

Bag of Tricks for Efficient Text Classification EACL 2017