Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Data Augmentation
3622 directly classified papers
Papers per year
2002: 2
2006: 1
2008: 2
2009: 1
2011: 3
2012: 3
2013: 9
2014: 8
2015: 7
2016: 35
2017: 45
2018: 108
2019: 239
2020: 329
2021: 477
2022: 518
2023: 607
2024: 561
2025: 546
2026: 121
Papers
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
ACL 2024
Towards Cost-effective Multi-style Conversations: A Pilot Study in Task-oriented Dialogue Generation
COLING 2024
Deterministic Reversible Data Augmentation for Neural Machine Translation
ACL 2024
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
ACL 2024
Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning
ACL 2024
Refining and Synthesis: A Simple yet Effective Data Augmentation Framework for Cross-Domain Aspect-based Sentiment Analysis
ACL 2024
Improving Grammatical Error Correction via Contextual Data Augmentation
ACL 2024
Generating Harder Cross-document Event Coreference Resolution Datasets using Metaphoric Paraphrasing
ACL 2024
Step-by-Step: Controlling Arbitrary Style in Text with Large Language Models
COLING 2024
Empowering Large Language Models for Textual Data Augmentation
ACL 2024
Language Model Priors and Data Augmentation Strategies for Low-resource Machine Translation: A Case Study Using Finnish to Northern Sámi
ACL 2024
UniPSDA: Unsupervised Pseudo Semantic Data Augmentation for Zero-Shot Cross-Lingual Natural Language Understanding
COLING 2024
Text Filtering Classifiers for Medium-Resource Languages
COLING 2024
CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems
ACL 2024
NoteChat: A Dataset of Synthetic Patient-Physician Conversations Conditioned on Clinical Notes
ACL 2024
WikiSplit++: Easy Data Refinement for Split and Rephrase
COLING 2024
Consistency Training by Synthetic Question Generation for Conversational Question Answering
ACL 2024
Consistent Document-level Relation Extraction via Counterfactuals
EMNLP 2024
All You Need is Attention: Lightweight Attention-based Data Augmentation for Text Classification
EMNLP 2024
Learning “look-ahead” nonlocal traffic dynamics in a ring road
L4DC 2024
Interpretable data-driven model predictive control of building energy systems using SHAP
L4DC 2024
An investigation of time reversal symmetry in reinforcement learning
L4DC 2024
SpreadNaLa: A Naturalistic Code Generation Evaluation Dataset of Spreadsheet Formulas
COLING 2024
What should Baby Models read? Exploring Sample-Efficient Data Composition on Model Performance
CONLL 2024
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
AISTATS 2024
<
1
…
47
48
49
…
145
>