Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Keywords
data augmentation
3037 papers
Explore in graph
Also known as
DA-MCMC
DADA
DA
TTA
ADA
EDA
Co-occurring keywords
text classification
(6776)
large language model
(12755)
transfer learning
(5442)
semi-supervised learning
(2331)
few-shot learning
(3390)
domain adaptation
(4578)
machine translation
(2472)
image classification
(1943)
language model
(4573)
low-resource language
(2234)
Papers
MANTA: A Scalable Pipeline for Transmuting Massive Web Corpora into Instruction Datasets
EMNLP 2025
ANVITA : A Multi-pronged Approach for Enhancing Machine Translation of Extremely Low-Resource Indian Languages
EMNLP 2025
ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis
NAACL 2025
Systematic Knowledge Injection into Large Language Models via Diverse Augmentation for Domain-Specific RAG
NAACL 2025
Realistic Noise Synthesis with Diffusion Models
AAAI 2025
Benchmark Creation for Aspect-Based Sentiment Analysis in Low-Resource Odia Language and Evaluation through Fine-Tuning of Multilingual Models
COLING 2025
FiNE: Filtering and Improving Noisy Data Elaborately with Large Language Models
NAACL 2025
CUET_Novice@DravidianLangTech 2025: A Bi-GRU Approach for Multiclass Political Sentiment Analysis of Tamil Twitter (X) Comments
NAACL 2025
VIGFace: Virtual Identity Generation for Privacy-Free Face Recognition Dataset
ICCV 2025
LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning
NAACL 2025
LLM-based Adversarial Dataset Augmentation for Automatic Media Bias Detection
NAACL 2025
Serving the Underserved: Leveraging BARTBahnar Language Model for Bahnaric-Vietnamese Translation
NAACL 2025
Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation
EMNLP 2025
Enhancing NER Performance in Low-Resource Pakistani Languages using Cross-Lingual Data Augmentation
NAACL 2025
Augmenting Sign Language Translation Datasets with Large Language Models
IJCNLP 2025
Representation Space Augmentation for Effective Self-Supervised Learning on Tabular Data
AAAI 2025
GASE: Generatively Augmented Sentence Encoding
EMNLP 2025
ZeLa: Advancing Zero-Shot Multilingual Semantic Parsing with Large Language Models and Chain-of-Thought Strategies
COLING 2024
Well Begun Is Half Done: An Implicitly Augmented Generative Framework with Distribution Modification for Hierarchical Text Classification
COLING 2024
Duration Dynamics: Fin-Turbo’s Rapid Route to ESG Impact Insight
COLING 2024
Towards Robust Evidence-Aware Fake News Detection via Improving Semantic Perception
COLING 2024
UniPSDA: Unsupervised Pseudo Semantic Data Augmentation for Zero-Shot Cross-Lingual Natural Language Understanding
COLING 2024
Advancing CSR Theme and Topic Classification: LLMs and Training Enhancement Insights
COLING 2024
Text Filtering Classifiers for Medium-Resource Languages
COLING 2024
STAGE: Simple Text Data Augmentation by Graph Exploration
COLING 2024
<
1
…
18
19
20
…
122
>