Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Data Augmentation
3622 directly classified papers
Papers per year
2002: 2
2006: 1
2008: 2
2009: 1
2011: 3
2012: 3
2013: 9
2014: 8
2015: 7
2016: 35
2017: 45
2018: 108
2019: 239
2020: 329
2021: 477
2022: 518
2023: 607
2024: 561
2025: 546
2026: 121
Papers
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
AISTATS 2024
Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
ACL 2024
Improving Cross-Lingual CSR Classification Using Pretrained Transformers with Variable Selection Networks and Data Augmentation
COLING 2024
Better Synthetic Data by Retrieving and Transforming Existing Datasets
ACL 2024
Computational Methods for the Analysis of Complementizer Variability in Language and Literature: The Case of Hebrew “she-” and “ki”
EMNLP 2024
Ctyun AI at BioLaySumm: Enhancing Lay Summaries of Biomedical Articles Through Large Language Models and Data Augmentation
ACL 2024
Revisiting Interpolation Augmentation for Speech-to-Text Generation
ACL 2024
Enhancing Effectiveness and Robustness in a Low-Resource Regime via Decision-Boundary-aware Data Augmentation
COLING 2024
Evaluating the Potential of Language-family-specific Generative Models for Low-resource Data Augmentation: A Faroese Case Study
COLING 2024
An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution
NAACL 2024
Claim-Centric and Sentiment Guided Graph Attention Network for Rumour Detection
COLING 2024
DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models
CVPR 2024
Quantifying the Impact of Disfluency on Spoken Content Summarization
COLING 2024
The Role of Data Curation in Image Captioning
EACL 2024
MAmmoTH2: Scaling Instructions from the Web
NIPS 2024
Saliency-Aware Interpolative Augmentation for Multimodal Financial Prediction
COLING 2024
Resource Acquisition for Understudied Languages: Extracting Wordlists from Dictionaries for Computer-assisted Language Comparison
COLING 2024
Counterfactual User Sequence Synthesis Augmented with Continuous Time Dynamic Preference Modeling for Sequential POI Recommendation
IJCAI 2024
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
NIPS 2024
Managing Fine-grained Metadata for Text Bases in Extremely Low Resource Languages: The Cases of Two Regional Languages of France
COLING 2024
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
ACL 2024
EgoGen: An Egocentric Synthetic Data Generator
CVPR 2024
Topic-Controllable Summarization: Topic-Aware Evaluation and Transformer Methods
COLING 2024
Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges
ACL 2024
GENDEX: Generative Data Augmentation Strategy Leveraging External Data for Abstractive Dialogue Summarization
ACL 2024
<
1
…
48
49
50
…
145
>