SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 24262450 of 8378 papers

TitleStatusHype
Diffusion-based Data Augmentation for Nuclei Image SegmentationCode1
Intent Contrastive Learning with Cross Subsequences for Sequential RecommendationCode1
Text generation for dataset augmentation in security classification tasksCode1
PromptMix: A Class Boundary Augmentation Method for Large Language Model DistillationCode1
Toward Generative Data Augmentation for Traffic Classification0
Filling the Missing: Exploring Generative AI for Enhanced Federated Learning over Heterogeneous Mobile Edge Devices0
Data-Free Knowledge Distillation Using Adversarially Perturbed OpenGL Shader Images0
DIG-MILP: a Deep Instance Generator for Mixed-Integer Linear Programming with Feasibility GuaranteeCode1
A Quality-based Syntactic Template Retriever for Syntactically-controlled Paraphrase GenerationCode0
GraphGPT: Graph Instruction Tuning for Large Language ModelsCode2
A Car Model Identification System for Streamlining the Automobile Sales Process0
A Distributed Approach to Meteorological Predictions: Addressing Data Imbalance in Precipitation Prediction Models through Federated Learning and GANs0
Unsupervised Candidate Answer Extraction through Differentiable Masker-Reconstructor Model0
Data Augmentations for Improved (Large) Language Model Generalization0
OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution ShiftCode1
EmoDiarize: Speaker Diarization and Emotion Identification from Speech Signals using Convolutional Neural Networks0
DiffBoost: Enhancing Medical Image Segmentation via Text-Guided Diffusion ModelCode1
AUC-mixup: Deep AUC Maximization with Mixup0
MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error CorrectionCode1
CLARA: Multilingual Contrastive Learning for Audio Representation AcquisitionCode1
Enhancing Spoofing Speech Detection Using Rhythm Information0
DASA: Difficulty-Aware Semantic Augmentation for Speaker Verification0
Panoptic Out-of-Distribution Segmentation0
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation0
Self-supervision meets kernel graph neural models: From architecture to augmentations0
Show:102550
← PrevPage 98 of 336Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified