SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 69517000 of 8378 papers

TitleStatusHype
Acoustic scene classification using auditory datasetsCode0
ARHNet: Adaptive Region Harmonization for Lesion-aware Augmentation to Improve Segmentation PerformanceCode0
Improving Grammatical Error Correction via Contextual Data AugmentationCode0
Improving Generalization for Multimodal Fake News DetectionCode0
PSAT: Pediatric Segmentation Approaches via Adult Augmentations and Transfer LearningCode0
Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector QuantizationCode0
The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspectiveCode0
The Knowref Coreference Corpus: Removing Gender and Number Cues for Difficult Pronominal Anaphora ResolutionCode0
Pseudo-Label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object DetectionCode0
L^2CL: Embarrassingly Simple Layer-to-Layer Contrastive Learning for Graph Collaborative FilteringCode0
LA3: Efficient Label-Aware AutoAugmentCode0
A Practical Method for Generating String CounterfactualsCode0
Label Augmentation Method for Medical Landmark Detection in Hip Radiograph ImagesCode0
Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language ModelsCode0
Improving Dialogue State Tracking with Turn-based Loss Function and Sequential Data AugmentationCode0
Pseudo Multi-Source Domain Generalization: Bridging the Gap Between Single and Multi-Source Domain GeneralizationCode0
TransformCode: A Contrastive Learning Framework for Code Embedding via Subtree TransformationCode0
Theoretically Motivated Data Augmentation and Regularization for Portfolio ConstructionCode0
Adversarial Defense via Data Dependent Activation Function and Total Variation MinimizationCode0
Disentangling Policy from Offline Task Representation Learning via Adversarial Data AugmentationCode0
A little goes a long way: Improving toxic language classification despite data scarcityCode0
Labels Generated by Large Language Model Helps Measuring People's Empathy in VitroCode0
Discriminative Neural Clustering for Speaker DiarisationCode0
Transformer-Based Dual-Optical Attention Fusion Crowd Head Point Counting and Localization NetworkCode0
Improving deep learning in arrhythmia Detection: The application of modular quality and quantity controllers in data augmentationCode0
Discriminative feature generation for classification of imbalanced dataCode0
LaiDA: Linguistics-aware In-context Learning with Data Augmentation for Metaphor Components IdentificationCode0
PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic Dialogue Convert Patient Dialogues to Medical RecordsCode0
Improving Robustness by Enhancing Weak SubnetsCode0
PULSAR: Pre-training with Extracted Healthcare Terms for Summarising Patients' Problems and Data Augmentation with Black-box Large Language ModelsCode0
Discrete Wavelet Transform for Generative Adversarial Network to Identify Drivers Using Gyroscope and Accelerometer SensorsCode0
Small Object Detection via Pixel Level Balancing With Applications to Blood Cell DetectionCode0
Adversarial Bayesian Augmentation for Single-Source Domain GeneralizationCode0
SMARAGD: Learning SMatch for Accurate and Rapid Approximate Graph DistanceCode0
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question AnsweringCode0
Aligning Actions and Walking to LLM-Generated Textual DescriptionsCode0
Language Semantic Graph Guided Data-Efficient LearningCode0
PushPull-Net: Inhibition-driven ResNet robust to image corruptionsCode0
Bridging the domain gap in cross-lingual document classificationCode0
Large Language Model Augmented Narrative Driven RecommendationsCode0
Transformers as Neural Augmentors: Class Conditional Sentence Generation via Variational BayesCode0
Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data AugmentationCode0
Disconnect to Connect: A Data Augmentation Method for Improving Topology Accuracy in Image SegmentationCode0
Improving Compositional Generalization in Math Word Problem SolvingCode0
Pyramid Adversarial Training Improves ViT PerformanceCode0
PyramidBox++: High Performance Detector for Finding Tiny FaceCode0
SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language ModelsCode0
Large Margin Deep Networks for ClassificationCode0
Temporal-Coded Deep Spiking Neural Network with Easy Training and Robust PerformanceCode0
Smelting Gold and Silver for Improved Multilingual AMR-to-Text GenerationCode0
Show:102550
← PrevPage 140 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified