Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Data Augmentation
3622 directly classified papers
Papers per year
2002: 2
2006: 1
2008: 2
2009: 1
2011: 3
2012: 3
2013: 9
2014: 8
2015: 7
2016: 35
2017: 45
2018: 108
2019: 239
2020: 329
2021: 477
2022: 518
2023: 607
2024: 561
2025: 546
2026: 121
Papers
Boosting Sentiment Analysis in Persian through a GAN-Based Synthetic Data Augmentation Method
COLING 2025
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining
WACV 2025
EDGE: Efficient Data Selection for LLM Agents via Guideline Effectiveness
IJCAI 2025
SCAN: Bootstrapping Contrastive Pre-training for Data Efficiency
ICCV 2025
Augmented and Softened Matching for Unsupervised Visible-Infrared Person Re-Identification
ICCV 2025
ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis
ICCV 2025
KGCL: Knowledge-Enhanced Graph Contrastive Learning for Retrosynthesis Prediction Based on Molecular Graph Editing
IJCAI 2025
GAUDA: Generative Adaptive Uncertainty-Guided Diffusion-Based Augmentation for Surgical Segmentation
WACV 2025
Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models
WACV 2025
XPose: Towards Extreme Low Light Hand Pose Estimation
WACV 2025
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs
ACL 2025
Mat-Instructions: A Large-Scale Inorganic Material Instruction Dataset for Large Language Models
IJCAI 2025
Enhancing Unsupervised Sentence Embeddings via Knowledge-Driven Data Augmentation and Gaussian-Decayed Contrastive Learning
ACL 2025
SynDroneVision: A Synthetic Dataset for Image-Based Drone Detection
WACV 2025
From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration
CVPR 2025
Filter Images First, Generate Instructions Later: Pre-Instruction Data Selection for Visual Instruction Tuning
CVPR 2025
MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion
ACL 2025
SegBuilder: A Semi-Automatic Annotation Tool for Segmentation
WACV 2025
KCS: Diversify Multi-hop Question Generation with Knowledge Composition Sampling
EMNLP 2025
PriFold: Biological Priors Improve RNA Secondary Structure Predictions
AAAI 2025
Beyond Words: Augmenting Discriminative Richness via Diffusions in Unsupervised Prompt Learning
CVPR 2025
Synthetic Visual Genome
CVPR 2025
GRAID: Synthetic Data Generation with Geometric Constraints and Multi-Agentic Reflection for Harmful Content Detection
EMNLP 2025
Corrupted but Not Broken: Understanding and Mitigating the Negative Impacts of Corrupted Data in Visual Instruction Tuning
EMNLP 2025
TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability
EMNLP 2025
<
1
…
16
17
18
…
145
>