SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 39514000 of 8378 papers

TitleStatusHype
Data Augmentation for Mathematical Objects0
Data Augmentation for Machine Translation via Dependency Subtree SwappingCode0
Single Domain Generalization via Normalised Cross-correlation Based Convolutions0
The Whole Pathological Slide Classification via Weakly Supervised Learning0
Diagnosis, Feedback, Adaptation: A Human-in-the-Loop Framework for Test-Time Policy Adaptation0
Data Augmentation in Training CNNs: Injecting Noise to Images0
Improved POS tagging for spontaneous, clinical speech using data augmentation0
RoPDA: Robust Prompt-based Data Augmentation for Low-Resource Named Entity Recognition0
Q-YOLOP: Quantization-aware You Only Look Once for Panoptic Driving Perception0
Optimal Transport Posterior Alignment for Cross-lingual Semantic ParsingCode0
Robust Learning-Based Incipient Slip Detection using the PapillArray Optical Tactile Sensor for Improved Robotic Gripping0
GEANN: Scalable Graph Augmentations for Multi-Horizon Time Series Forecasting0
Blocks2World: Controlling Realistic Scenes with Editable Primitives0
TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning0
ADASSM: Adversarial Data Augmentation in Statistical Shape Models From Images0
LEA: Improving Sentence Similarity Robustness to Typos Using Lexical Attention BiasCode0
Using Data Augmentations and VTLN to Reduce Bias in Dutch End-to-End Speech Recognition Systems0
Beyond Known Reality: Exploiting Counterfactual Explanations for Medical ResearchCode0
PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic Dialogue Convert Patient Dialogues to Medical RecordsCode0
Learning to Augment: Hallucinating Data for Domain Generalized Segmentation0
SleepEGAN: A GAN-enhanced Ensemble Deep Learning Model for Imbalanced Classification of Sleep Stages0
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning0
Prompt Tuning Pushes Farther, Contrastive Learning Pulls Closer: A Two-Stage Approach to Mitigate Social Biases0
ENGAGE: Explanation Guided Data Augmentation for Graph Representation LearningCode0
Dynamical Graph Echo State Networks with Snapshot Merging for Dissemination Process Classification0
SUGAR: Spherical Ultrafast Graph Attention Framework for Cortical Surface Registration0
CNN-BiLSTM model for English Handwriting Recognition: Comprehensive Evaluation on the IAM Dataset0
ARHNet: Adaptive Region Harmonization for Lesion-aware Augmentation to Improve Segmentation PerformanceCode0
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency0
Long-Tailed Continual Learning For Visual Food Recognition0
Decoding Taste Information in Human Brain: A Temporal and Spatial Reconstruction Data Augmentation Method Coupled with Taste EEG0
Unsupervised Coordinate-Based Video Denoising0
Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing0
Counterfactual Collaborative Reasoning0
EyeBAG: Accurate Control of Eye Blink and Gaze Based on Data Augmentation Leveraging Style Mixing0
DeepTagger: Knowledge Enhanced Named Entity Recognition for Web-Based Ads Queries0
Navigating Noise: A Study of How Noise Influences Generalisation and Calibration of Neural NetworksCode0
GuidedMixup: An Efficient Mixup Strategy Guided by Saliency MapsCode0
Generate Anything Anywhere in Any Scene0
DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations0
Fused Gromov-Wasserstein Graph Mixup for Graph-level ClassificationsCode0
Improving Primate Sounds Classification using Binary Presorting for Deep Learning0
Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination methods0
Pseudo-Bag Mixup Augmentation for Multiple Instance Learning-Based Whole Slide Image Classification0
Cross Spectral Image Reconstruction Using a Deep Guided Neural NetworkCode0
Multi-perspective Information Fusion Res2Net with RandomSpecmix for Fake Speech Detection0
Enhancing Representation Learning on High-Dimensional, Small-Size Tabular Data: A Divide and Conquer Method with Ensembled VAEs0
Using Large Language Models to Provide Explanatory Feedback to Human Tutors0
On the Usefulness of Synthetic Tabular Data Generation0
TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection0
Show:102550
← PrevPage 80 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified