conftrace
_
Papers
Trends
Conferences
Explore
More
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Learning Types
Deep Learning
›
Learning Types
›
Pre-Training
23 papers
Papers per year
2021: 2
2
2022: 6
6
2023: 3
3
2024: 1
1
2026: 11
11
Papers
MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings
ACL 2026
TiKMiX: Efficient Semi-Dynamic Data Mixture via Data Influence for LLM Pre-training
ACL 2026
Language Acquisition Device in Large Language Models
ACL 2026
Perplexity-Aware Data Scaling Law: Perplexity Landscapes Predict Performance for Continual Pre-training
ACL 2026
KoCo: Conditioning Language Model Pre-training on Knowledge Coordinates
ACL 2026
Beyond Scaling: Measuring and Predicting the Upper Bound of Knowledge Retention in Language Model Pre-Training
ACL 2026
Demystifying Data Organization for Enhanced LLM Training
ACL 2026
The Role of Mixed-Language Documents for Multilingual Large Language Model Pretraining
ACL 2026
SciPedia: Unlocking the Value of Scientific Data for Pre-training
ACL 2026
Is a Document Educational or Just Wikipedia-Style? — Pitfalls of Classifier-Based Quality Filtering
ACL 2026
Practical Guidelines for Model Merging in LLMs Pre-Training
ACL 2026
Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training
EMNLP 2024
Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt
CVPR 2023
Exploring Graph Pre-training for Aspect-based Sentiment Analysis
EMNLP 2023
CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction
EMNLP 2023
GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection
AAAI 2022
DEEP: DEnoising Entity Pre-training for Neural Machine Translation
ACL 2022
Scheduled Multi-task Learning for Neural Chat Translation
ACL 2022
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding
ACL 2022
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples
EMNLP 2022
STAR: SQL Guided Pre-Training for Context-dependent Text-to-SQL Parsing
EMNLP 2022
Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models
ACL 2021
ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation
ACL 2021
<
1
>