SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 13511400 of 8378 papers

TitleStatusHype
Unsupervised Sketch-to-Photo SynthesisCode1
Learning joint segmentation of tissues and brain lesions from task-specific hetero-modal domain-shifted datasetsCode1
Adversarial Dual-Student with Differentiable Spatial Warping for Semi-Supervised Semantic SegmentationCode1
Learning Normal Flow Directly From Event NeighborhoodsCode1
Learning Representations of Bi-level Knowledge Graphs for Reasoning beyond Link PredictionCode1
Learning Robust Representations via Multi-View Information BottleneckCode1
Learning Symmetry-Aware Geometry Correspondences for 6D Object Pose EstimationCode1
Learning Temporally Invariant and Localizable Features via Data Augmentation for Video RecognitionCode1
Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced DataCode1
Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-OptCode1
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language NavigationCode1
Learn to Augment: Joint Data Augmentation and Network Optimization for Text RecognitionCode1
Lesion-based Contrastive Learning for Diabetic Retinopathy Grading from Fundus ImagesCode1
C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot FillingCode1
CT4Rec: Simple yet Effective Consistency Training for Sequential RecommendationCode1
A parallel corpus of Python functions and documentation strings for automated code documentation and code generationCode1
AutoCLINT: The Winning Method in AutoCV Challenge 2019Code1
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation LearningCode1
Lie Point Symmetry Data Augmentation for Neural PDE SolversCode1
AutoBalance: Optimized Loss Functions for Imbalanced DataCode1
CADTransformer: Panoptic Symbol Spotting Transformer for CAD DrawingsCode1
CAiRE in DialDoc21: Data Augmentation for Information-Seeking Dialogue SystemCode1
CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue SystemCode1
APBench: A Unified Benchmark for Availability Poisoning Attacks and DefensesCode1
CoSDA-ML: Multi-Lingual Code-Switching Data Augmentation for Zero-Shot Cross-Lingual NLPCode1
Counterfactual Data Augmentation for Neural Machine TranslationCode1
Cross-Domain Feature Augmentation for Domain GeneralizationCode1
Adversarial Feature Hallucination Networks for Few-Shot LearningCode1
Local Additivity Based Data Augmentation for Semi-supervised NERCode1
Calibrating Wireless Ray Tracing for Digital Twinning using Local Phase Error EstimatesCode1
Logic-Guided Data Augmentation and Regularization for Consistent Question AnsweringCode1
Long-tail Augmented Graph Contrastive Learning for RecommendationCode1
A Contrastive Cross-Channel Data Augmentation Framework for Aspect-based Sentiment AnalysisCode1
CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization PerspectiveCode1
A pipeline for fair comparison of graph neural networks in node classification tasksCode1
Lung Segmentation from Chest X-rays using Variational Data ImputationCode1
CodeIt: Self-Improving Language Models with Prioritized Hindsight ReplayCode1
MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and RecoveryCode1
MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without RetrainingCode1
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data AugmentationCode1
A 3D generative model of pathological multi-modal MR images and segmentationsCode1
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic EnvironmentsCode1
Mask Conditional Synthetic Satellite ImageryCode1
Masked Autoencoders are Robust Data AugmentorsCode1
Appearance and Structure Aware Robust Deep Visual Graph Matching: Attack, Defense and BeyondCode1
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement LearningCode1
MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction FusionCode1
Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and RobustnessCode1
MedAugment: Universal Automatic Data Augmentation Plug-in for Medical Image AnalysisCode1
Cooperative Training and Latent Space Data Augmentation for Robust Medical Image SegmentationCode1
Show:102550
← PrevPage 28 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified