SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 79017950 of 8378 papers

TitleStatusHype
Systematically designing better instance counting models on cell images with Neural Arithmetic Logic UnitsCode0
NIV-SSD: Neighbor IoU-Voting Single-Stage Object Detector From Point CloudCode0
Do Invariances in Deep Neural Networks Align with Human Perception?Code0
An Efficient and Layout-Independent Automatic License Plate Recognition System Based on the YOLO detectorCode0
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages StudyCode0
AENet: Learning Deep Audio Features for Video AnalysisCode0
NLPineers@ NLU of Devanagari Script Languages 2025: Hate Speech Detection using Ensembling of BERT-based modelsCode0
ConvBoost: Boosting ConvNets for Sensor-based Activity RecognitionCode0
Data Augmentation through Expert-guided Symmetry Detection to Improve Performance in Offline Reinforcement LearningCode0
Explanation-based Data Augmentation for Image ClassificationCode0
SCORE: Self-supervised Correspondence Fine-tuning for Improved Content RepresentationsCode0
Explaining Full-disk Deep Learning Model for Solar Flare Prediction using Attribution MethodsCode0
EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data Augmentation for Multi-hop Fact VerificationCode0
Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion ModelsCode0
An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment ClassificationCode0
SDA: Simple Discrete Augmentation for Contrastive Sentence Representation LearningCode0
Controllable User Dialogue Act Augmentation for Dialogue State TrackingCode0
Noise Stability Optimization for Finding Flat Minima: A Hessian-based Regularization ApproachCode0
Synthesizing Traffic Datasets using Graph Neural NetworksCode0
ANDA: A Novel Data Augmentation Technique Applied to Salient Object DetectionCode0
Explainable Deep Learning-based Solar Flare Prediction with post hoc Attention for Operational ForecastingCode0
Advising OpenMP Parallelization via a Graph-Based Approach with TransformersCode0
Anchor Data AugmentationCode0
Explainability of Deep Neural Networks for Brain Tumor DetectionCode0
Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection TasksCode0
Accept Synthetic Objects as Real: End-to-End Training of Attentive Deep Visuomotor Policies for Manipulation in ClutterCode0
Contrastive Learning with Prompt-derived Virtual Semantic Prototypes for Unsupervised Sentence EmbeddingCode0
Augmentor: An Image Augmentation Library for Machine LearningCode0
Second language Korean Universal Dependency treebank v1.2: Focus on data augmentation and annotation scheme refinementCode0
Contrastive Learning with Consistent RepresentationsCode0
Variable Skipping for Autoregressive Range Density EstimationCode0
Watch out Venomous Snake Species: A Solution to SnakeCLEF2023Code0
Contrastive Learning for Character Detection in Ancient Greek PapyriCode0
Consistency Regularization for Domain Generalization with Logit Attribution MatchingCode0
When Neural Networks Fail to Generalize? A Model Sensitivity PerspectiveCode0
Synthetic Data Augmentation for Enhancing Harmful Algal Bloom Detection with Machine LearningCode0
Understanding robustness and generalization of artificial neural networks through Fourier masksCode0
Nonparametric Bayesian Lomax delegate racing for survival analysis with competing risksCode0
A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram EmbeddingsCode0
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based AugmentationsCode0
Variational Autoencoder with Disentanglement Priors for Low-Resource Task-Specific Natural Language GenerationCode0
UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity RepresentationCode0
GraphLearner: Graph Node Clustering with Fully Learnable AugmentationCode0
Segmentation Consistency Training: Out-of-Distribution Generalization for Medical Image SegmentationCode0
Nordic Vehicle Dataset (NVD): Performance of vehicle detectors using newly captured NVD from UAV in different snowy weather conditionsCode0
Normal-bundle BootstrapCode0
Segmentation of Hemorrhagic Areas in Human Brain from CT Scan ImagesCode0
Towards More Equitable Question Answering Systems: How Much More Data Do You Need?Code0
Towards More Sample Efficiency in Reinforcement Learning with Data AugmentationCode0
Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data AugmentationCode0
Show:102550
← PrevPage 159 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified