SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 18511900 of 8378 papers

TitleStatusHype
Learning the Difference that Makes a Difference with Counterfactually-Augmented DataCode0
Improving the Robustness of Question Answering Systems to Question ParaphrasingCode0
Audiogmenter: a MATLAB Toolbox for Audio Data AugmentationCode0
Learning to Compose Domain-Specific Transformations for Data AugmentationCode0
Improving Systematic Generalization Through Modularity and AugmentationCode0
Improving Socratic Question Generation using Data Augmentation and Preference OptimizationCode0
Learning to Recombine and Resample Data for Compositional GeneralizationCode0
Improving singing voice separation with the Wave-U-Net using Minimum Hyperspherical EnergyCode0
A Lightweight Privacy-Preserving Scheme Using Label-based Pixel Block Mixing for Image Classification in Deep LearningCode0
CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation ProcessingCode0
Aggression Identification Using Deep Learning and Data AugmentationCode0
Learning Tree-Structured Composition of Data AugmentationCode0
Improving Skeleton-based Action Recognition with Interactive Object InformationCode0
A Byte Sequence is Worth an Image: CNN for File Fragment Classification Using Bit Shift and n-Gram EmbeddingsCode0
Cross-dataset COVID-19 Transfer Learning with Cough Detection, Cough Segmentation, and Data AugmentationCode0
Scaling Up Single Image Dehazing Algorithm by Cross-Data Vision Alignment for Richer Representation Learning and BeyondCode0
A Survey of Data Synthesis ApproachesCode0
Improving satellite imagery segmentation using multiple Sentinel-2 revisitsCode0
Cross-Domain Face Synthesis using a Controllable GANCode0
Constructing Multiple Tasks for Augmentation: Improving Neural Image Classification With K-means FeaturesCode0
Improving SSVEP BCI Spellers With Data Augmentation and Language ModelsCode0
IMSurReal Too: IMS in the Surface Realization Shared Task 2020Code0
Constructing Contrastive samples via Summarization for Text Classification with limited annotationsCode0
Improving Reading Comprehension Question Generation with Data Augmentation and Overgenerate-and-rankCode0
Leveraging Content and Context Cues for Low-Light Image EnhancementCode0
Leveraging Data Augmentation for Process Information ExtractionCode0
Consistency Training by Synthetic Question Generation for Conversational Question AnsweringCode0
Improving Novelty Detection using the Reconstructions of Nearest NeighboursCode0
Improving Robustness by Augmenting Training Sentences with Predicate-Argument StructuresCode0
Consistency of augmentation graph and network approximability in contrastive learningCode0
Improving Neural Machine Translation Robustness via Data Augmentation: Beyond Back-TranslationCode0
Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic ParsingCode0
Improving LSTM-CTC based ASR performance in domains with limited training dataCode0
Combining Data Generation and Active Learning for Low-Resource Question AnsweringCode0
Improving Neural Networks for Time Series Forecasting using Data Augmentation and AutoMLCode0
Cross-Lingual Text Classification of Transliterated Hindi and MalayalamCode0
Improving robustness to corruptions with multiplicative weight perturbationsCode0
Improving Generalization for Multimodal Fake News DetectionCode0
Conjugate Bayesian Two-step Change Point Detection for Hawkes ProcessCode0
A Geometry-Sensitive Approach for Photographic Style ClassificationCode0
Augmentation BackdoorsCode0
Improving Grammatical Error Correction via Contextual Data AugmentationCode0
A little goes a long way: Improving toxic language classification despite data scarcityCode0
Cross-modal tumor segmentation using generative blending augmentation and self trainingCode0
15,500 Seconds: Lean UAV Classification Leveraging PEFT and Pre-Trained NetworksCode0
Improving In-Context Learning with Reasoning DistillationCode0
Improving Robustness via Tilted Exponential Layer: A Communication-Theoretic PerspectiveCode0
Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data AugmentationCode0
Improving Robustness by Enhancing Weak SubnetsCode0
A Generative Model of Symmetry TransformationsCode0
Show:102550
← PrevPage 38 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified