SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 35513600 of 8378 papers

TitleStatusHype
DISCO: Distilling Counterfactuals with Large Language ModelsCode1
Rumour detection using graph neural network and oversampling in benchmark Twitter dataset0
An Augmentation Strategy for Visually Rich Documents0
On the Applicability of Synthetic Data for Re-IdentificationCode0
On-the-fly Denoising for Data Augmentation in Natural Language UnderstandingCode0
Data Augmentation on Graphs: A Technical SurveyCode1
Discrete Point-wise Attack Is Not Enough: Generalized Manifold Adversarial Attack for Face RecognitionCode1
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based AugmentationsCode0
StyleFlow: Disentangle Latent Representations via Normalizing Flow for Unsupervised Text Style Transfer0
Predicting Ejection Fraction from Chest X-rays Using Computer Vision for Diagnosing Heart Failure0
Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised LearningCode1
APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning0
Synthetic Data Augmentation Using GAN For Improved Automated Visual Inspection0
Sentence-level Feedback Generation for English Language Learners: Does Data Augmentation Help?0
PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment0
Balanced Split: A new train-test data splitting strategy for imbalanced datasetsCode0
AugTriever: Unsupervised Dense Retrieval and Domain Adaptation by Scalable Data AugmentationCode0
Human Image Generation: A Comprehensive Survey0
Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization0
Uniform Sequence Better: Time Interval Aware Data Augmentation for Sequential RecommendationCode1
ColorSense: A Study on Color Vision in Machine Visual Recognition0
Check-worthy Claim Detection across Topics for Automated Fact-checking0
Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image ClassificationCode0
The effects of gender bias in word embeddings on depression prediction0
Multi-VALUE: A Framework for Cross-Dialectal English NLP0
SBSS: Stacking-Based Semantic Segmentation Framework for Very High Resolution Remote Sensing Image0
The Effects of Character-Level Data Augmentation on Style-Based Dating of Historical ManuscriptsCode0
Urban Scene Semantic Segmentation with Low-Cost Coarse Annotation0
DeepJoin: Joinable Table Discovery with Pre-trained Language Models0
A Critical Appraisal of Data Augmentation Methods for Imaging-Based Medical Diagnosis Applications0
Calibration-Free Driver Drowsiness Classification based on Manifold-Level AugmentationCode0
Build-a-Bot: Teaching Conversational AI Using a Transformer-Based Intent Recognition and Question Answering Architecture0
Generative Robust Classification0
MA-GCL: Model Augmentation Tricks for Graph Contrastive LearningCode1
Robust Policy Optimization in Deep Reinforcement LearningCode0
SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation0
Population Template-Based Brain Graph Augmentation for Improving One-Shot Learning Classification0
A Machine Learning Enhanced Approach for Automated Sunquake Detection in Acoustic Emission Maps0
CropCat: Data Augmentation for Smoothing the Feature Distribution of EEG Signals0
Improving Depression estimation from facial videos with face alignment, training optimization and scheduling0
Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis0
Zero-Shot Accent Conversion using Pseudo Siamese Disentanglement Network0
Robust and Explainable Identification of Logical Fallacies in Natural Language ArgumentsCode1
RPN: A Word Vector Level Data Augmentation Algorithm in Deep Learning for Language UnderstandingCode0
On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch BaselineCode1
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation LearningCode1
Teaching What You Should Teach: A Data-Based Distillation Method0
End-to-End Speech Translation of Arabic to English Broadcast News0
Cap2Aug: Caption guided Image to Image data Augmentation0
Towards Scale Balanced 6-DoF Grasp Detection in Cluttered ScenesCode1
Show:102550
← PrevPage 72 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified