Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Data Augmentation
3622 directly classified papers
Papers per year
2002: 2
2006: 1
2008: 2
2009: 1
2011: 3
2012: 3
2013: 9
2014: 8
2015: 7
2016: 35
2017: 45
2018: 108
2019: 239
2020: 329
2021: 477
2022: 518
2023: 607
2024: 561
2025: 546
2026: 121
Papers
Targeted Augmentation for Low-Resource Event Extraction
NAACL 2024
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
ACL 2024
Simple and effective data augmentation for compositional generalization
NAACL 2024
NLP_STR_teamS at SemEval-2024 Task1: Semantic Textual Relatedness based on MASK Prediction and BERT Model
NAACL 2024
EPIC: Effective Prompting for Imbalanced-Class Data Synthesis in Tabular Data Classification via Large Language Models
NIPS 2024
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs
ACL 2024
Synonym relations affect object detection learned on vision-language data
NAACL 2024
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions
ACL 2024
Assemblage: Automatic Binary Dataset Construction for Machine Learning
NIPS 2024
Synthetic Data Generation for Low-resource Grammatical Error Correction with Tagged Corruption Models
NAACL 2024
When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels
NAACL 2024
Revealing the Two Sides of Data Augmentation: An Asymmetric Distillation-based Win-Win Solution for Open-Set Recognition
IJCAI 2024
CCSum: A Large-Scale and High-Quality Dataset for Abstractive News Summarization
NAACL 2024
Brandeis at VarDial 2024 DSL-ML Shared Task: Multilingual Models, Simple Baselines and Data Augmentation
NAACL 2024
Edu-ConvoKit: An Open-Source Library for Education Conversation Data
NAACL 2024
On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey
ACL 2024
Oasis: Data Curation and Assessment System for Pretraining of Large Language Models
IJCAI 2024
Tweak to Trust: Assessing the Reliability of Summarization Metrics in Contact Centers via Perturbed Summaries
NAACL 2024
ATLAS: A System for PDF-centric Human Interaction Data Collection
NAACL 2024
Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification
CVPR 2024
Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis
NIPS 2024
Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs
EMNLP 2024
ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation
ACL 2024
Making LLMs as Fine-Grained Relation Extraction Data Augmentor
IJCAI 2024
DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness
NAACL 2024
<
1
…
31
32
33
…
145
>