Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Data Augmentation
3622 directly classified papers
Papers per year
2002: 2
2006: 1
2008: 2
2009: 1
2011: 3
2012: 3
2013: 9
2014: 8
2015: 7
2016: 35
2017: 45
2018: 108
2019: 239
2020: 329
2021: 477
2022: 518
2023: 607
2024: 561
2025: 546
2026: 121
Papers
Diagram-Driven Course Questions Generation
EMNLP 2025
ALADAN at IWSLT25 Low-resource Arabic Dialectal Speech Translation Task
ACL 2025
Decoupled Diffusion Sparks Adaptive Scene Generation
ICCV 2025
An empirical study of validating synthetic data for formula generation
NAACL 2025
MAIN: Mutual Alignment Is Necessary for instruction tuning
EMNLP 2025
LLM-based Adversarial Dataset Augmentation for Automatic Media Bias Detection
NAACL 2025
Penalizing Boundary Activation for Object Completeness in Diffusion Models
ICCV 2025
Adversarial Data Augmentation for Single Domain Generalization via Lyapunov Exponent-Guided Optimization
ICCV 2025
Language Models as Continuous Self-Evolving Data Engineers
EMNLP 2025
Overcoming Data Scarcity in Named Entity Recognition: Synthetic Data Generation with Large Language Models
ACL 2025
Dynamic Jointly Batch Selection for Data Efficient Machine Translation Fine-Tuning
EMNLP 2025
PolyNorm: Few-Shot LLM-Based Text Normalization for Text-to-Speech
EMNLP 2025
Enhancing Low-Resource Text Classification with LLM-Generated Corpora : A Case Study on Olfactory Reference Extraction
IJCNLP 2025
Thapar Titan/s : Fine-Tuning Pretrained Language Models with Contextual Augmentation for Mistake Identification in Tutor–Student Dialogues
ACL 2025
Split-and-Combine: Enhancing Style Augmentation for Single Domain Generalization
ICCV 2025
TutorMind at BEA 2025 Shared Task: Leveraging Fine-Tuned LLMs and Data Augmentation for Mistake Identification
ACL 2025
Borrowing Eyes for the Blind Spot: Overcoming Data Scarcity in Malicious Video Detection via Cross-Domain Retrieval Augmentation
ICCV 2025
GRAID: Synthetic Data Generation with Geometric Constraints and Multi-Agentic Reflection for Harmful Content Detection
EMNLP 2025
CrowdAgent: Multi-Agent Managed Multi-Source Annotation System
EMNLP 2025
PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions
ICCV 2025
MetninOzU at BioLaySumm2025: Text Summarization with Reverse Data Augmentation and Injecting Salient Sentences
ACL 2025
SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains
EMNLP 2025
Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing
AAAI 2025
UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets
EMNLP 2025
xiacui at SemEval-2025 Task 11: Addressing Data Imbalance in Transformer-Based Multi-Label Emotion Detection with Weighted Loss
SEMEVAL 2025
<
1
…
13
14
15
…
145
>