SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 15511600 of 8378 papers

TitleStatusHype
Data Augmentation for Intent Classification with Off-the-shelf Large Language ModelsCode1
Data Augmentation for Abstractive Query-Focused Multi-Document SummarizationCode1
Rethinking Pre-training and Self-trainingCode1
CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentationCode1
Rethinking the Effect of Data Augmentation in Adversarial Contrastive LearningCode1
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of ViewCode1
Cross-Domain Adaptive Teacher for Object DetectionCode1
Cloud and Cloud Shadow Segmentation for Remote Sensing Imagery via Filtered Jaccard Loss Function and Parametric AugmentationCode1
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training StagesCode1
Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling TaskCode1
RFBoost: Understanding and Boosting Deep WiFi Sensing via Physical Data AugmentationCode1
RGB no more: Minimally-decoded JPEG Vision TransformersCode1
RigorLLM: Resilient Guardrails for Large Language Models against Undesired ContentCode1
Data Augmentation for Cross-Domain Named Entity RecognitionCode1
CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate SpeechCode1
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic EnvironmentsCode1
ContrastCAD: Contrastive Learning-based Representation Learning for Computer-Aided Design ModelsCode1
Data Augmentation for Deep Candlestick LearnerCode1
Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice ConversionCode1
Robust Hybrid Learning With Expert AugmentationCode1
RobustNav: Towards Benchmarking Robustness in Embodied NavigationCode1
Robustness Testing of Language Understanding in Task-Oriented DialogCode1
RockNER: A Simple Method to Create Adversarial Examples for Evaluating the Robustness of Named Entity Recognition ModelsCode1
CNN-generated images are surprisingly easy to spot... for nowCode1
Data augmentation for learning predictive models on EEG: a systematic comparisonCode1
Cross-modality Data Augmentation for End-to-End Sign Language TranslationCode1
ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRICode1
RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via DiffusionCode1
Data Augmentation using Pre-trained Transformer ModelsCode1
Rotation-Invariant Transformer for Point Cloud MatchingCode1
A Fourier-based Framework for Domain GeneralizationCode1
Roto-Translation Covariant Convolutional Networks for Medical Image AnalysisCode1
A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and GenerationCode1
Cross-modulated Few-shot Image Generation for Colorectal Tissue ClassificationCode1
Deep Data Augmentation for Weed Recognition Enhancement: A Diffusion Probabilistic Model and Transfer Learning Based ApproachCode1
CodeIt: Self-Improving Language Models with Prioritized Hindsight ReplayCode1
SageMix: Saliency-Guided Mixup for Point CloudsCode1
Direct Differentiable Augmentation SearchCode1
SaliencyMix: A Saliency Guided Data Augmentation Strategy for Better RegularizationCode1
Sample-adaptive Augmentation for Point Cloud Recognition Against Real-world CorruptionsCode1
A Simple Graph Contrastive Learning Framework for Short Text ClassificationCode1
Injecting Numerical Reasoning Skills into Language ModelsCode1
Scale-aware Automatic Augmentation for Object DetectionCode1
Scale-wise Convolution for Image RestorationCode1
Regularizing Deep Networks with Semantic Data AugmentationCode1
A Unified Gradient Regularization Family for Adversarial Examples0
A Multi-level Supervised Contrastive Learning Framework for Low-Resource Natural Language Inference0
A Unified Framework for Generative Data Augmentation: A Comprehensive Survey0
Addressing degeneracies in latent interpolation for diffusion models0
Accurate pedestrian localization in overhead depth images via Height-Augmented HOG0
Show:102550
← PrevPage 32 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified