SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 39514000 of 8378 papers

TitleStatusHype
Unsupervised Data Augmentation for Aspect Based Sentiment Analysis0
An Augmented Benchmark Dataset for Geometric Question Answering through Dual Parallel Text Encoding0
ParaZh-22M: A Large-Scale Chinese Parabank via Machine Translation0
Addressing Limitations of Encoder-Decoder Based Approach to Text-to-SQL0
基于相似度进行句子选择的机器阅读理解数据增强(Machine reading comprehension data Augmentation for sentence selection based on similarity)0
Improving Event Temporal Relation Classification via Auxiliary Label-Aware Contrastive Learning0
MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation0
Automated segmentation of microvessels in intravascular OCT images using deep learning0
GraDA: Graph Generative Data Augmentation for Commonsense ReasoningCode0
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement LearningCode0
Using Knowledge Distillation to improve interpretable models in a retail banking context0
Domain Generalization -- A Causal Perspective0
Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods0
Augmentation BackdoorsCode0
Automatic Data Augmentation via Invariance-Constrained LearningCode0
Contrastive Unsupervised Learning of World Model with Invariant Causal Features0
Prompt-guided Scene Generation for 3D Zero-Shot Learning0
Named Entity Recognition in Industrial Tables using Tabular Language Models0
Strong Instance Segmentation Pipeline for MMSports ChallengeCode1
Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition0
Weighted Contrastive HashingCode0
Data Augmentation using Feature Generation for Volumetric Medical Images0
3D Rendering Framework for Data Augmentation in Optical Character Recognition0
Ani-GIFs: A benchmark dataset for domain generalization of action recognition from GIFs0
TaskMix: Data Augmentation for Meta-Learning of Spoken Intent Understanding0
On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question AnsweringCode0
Contrastive learning for unsupervised medical image clustering and reconstruction0
A Simple Strategy to Provable Invariance via Orbit Mapping0
Towards Bridging the Space Domain Gap for Satellite Pose Estimation using Event Sensing0
SR-GCL: Session-Based Recommendation with Global Context Enhanced Augmentation in Contrastive Learning0
AcroFOD: An Adaptive Method for Cross-domain Few-shot Object DetectionCode1
StyleTime: Style Transfer for Synthetic Time Series Generation0
Semantically Consistent Data Augmentation for Neural Machine Translation via Conditional Masked Language ModelCode0
Scope of Pre-trained Language Models for Detecting Conflicting Health Information0
Automated detection of Alzheimer disease using MRI images and deep neural networks- A review0
NamedMask: Distilling Segmenters from Complementary Foundation ModelsCode1
DARTSRepair: Core-failure-set Guided DARTS for Network Robustness to Common Corruptions0
Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future DirectionsCode2
Improving Generalizability of Graph Anomaly Detection Models via Data AugmentationCode1
SynthA1c: Towards Clinically Interpretable Patient Representations for Diabetes Risk StratificationCode0
High-resolution synthesis of high-density breast mammograms: Application to improved fairness in deep learning based mass detectionCode1
Vega-MT: The JD Explore Academy Translation System for WMT22Code1
Improving GANs with A Dynamic Discriminator0
Exploring Inconsistent Knowledge Distillation for Object Detection with Data AugmentationCode0
Fairness in Face Presentation Attack DetectionCode0
The Geometry of Self-supervised Learning Models and its Impact on Transfer Learning0
Intrinsically Motivated Reinforcement Learning based Recommendation with Counterfactual Data Augmentation0
Can segmentation models be trained with fully synthetically generated data?0
Confidence-Guided Data Augmentation for Improved Semi-Supervised Training0
KaliCalib: A Framework for Basketball Court Registration0
Show:102550
← PrevPage 80 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified