Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Data Augmentation
3622 directly classified papers
Papers per year
2002: 2
2006: 1
2008: 2
2009: 1
2011: 3
2012: 3
2013: 9
2014: 8
2015: 7
2016: 35
2017: 45
2018: 108
2019: 239
2020: 329
2021: 477
2022: 518
2023: 607
2024: 561
2025: 546
2026: 121
Papers
A Taxonomy for Data Contamination in Large Language Models
ACL 2024
UTRad-NLP at #SMM4H 2024: Why LLM-Generated Texts Fail to Improve Text Classification Models
ACL 2024
Croissant: A Metadata Format for ML-Ready Datasets
NIPS 2024
Investigating the productivity of Passamaquoddy medials: A computational approach
EACL 2024
The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data
INTERSPEECH 2024
EEVEE: An Easy Annotation Tool for Natural Language Processing
EACL 2024
Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection
ACL 2024
Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning
ACL 2024
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
ACL 2024
A Comprehensive Augmentation Framework for Anomaly Detection
AAAI 2024
Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges
ACL 2024
A New Dataset for Tonal and Segmental Dialectometry from the Yue- and Pinghua-Speaking Area
EACL 2024
Gene-Gene Relationship Modeling Based on Genetic Evidence for Single-Cell RNA-Seq Data Imputation
NIPS 2024
GENDEX: Generative Data Augmentation Strategy Leveraging External Data for Abstractive Dialogue Summarization
ACL 2024
Better Synthetic Data by Retrieving and Transforming Existing Datasets
ACL 2024
KUL@SMM4H2024: Optimizing Text Classification with Quality-Assured Augmentation Strategies
ACL 2024
KGAST: From Knowledge Graphs to Annotated Synthetic Texts
ACL 2024
Generating Harder Cross-document Event Coreference Resolution Datasets using Metaphoric Paraphrasing
ACL 2024
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators
ACL 2024
Consistency Training by Synthetic Question Generation for Conversational Question Answering
ACL 2024
Generative Data Augmentation using LLMs improves Distributional Robustness in Question Answering
EACL 2024
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
ACL 2024
Guidance-Based Prompt Data Augmentation in Specialized Domains for Named Entity Recognition
ACL 2024
Quilt: Robust Data Segment Selection against Concept Drifts
AAAI 2024
Limitations of Face Image Generation
AAAI 2024
<
1
…
42
43
44
…
145
>