SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 17511800 of 8378 papers

TitleStatusHype
JUICER: Data-Efficient Imitation Learning for Robotic AssemblyCode1
Mitigating analytical variability in fMRI results with style transfer0
If It's Not Enough, Make It So: Reducing Authentic Data Demand in Face Recognition through Synthetic Faces0
Improving Topic Relevance Model by Mix-structured Summarization and LLM-based Data Augmentation0
TSA on AutoPilot: Self-tuning Self-supervised Time Series Anomaly DetectionCode0
Low-resource neural machine translation with morphological modelingCode0
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness0
Generative-Contrastive Heterogeneous Graph Neural NetworkCode0
Semantic Augmentation in Images using Language0
Deep Neural Networks with 3D Point Clouds for Empirical Friction Measurements in Hydrodynamic Flood ModelsCode0
ContrastCAD: Contrastive Learning-based Representation Learning for Computer-Aided Design ModelsCode1
A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference ResolutionCode0
Towards Enhanced Analysis of Lung Cancer Lesions in EBUS-TBNA -- A Semi-Supervised Video Object Detection Method0
AAdaM at SemEval-2024 Task 1: Augmentation and Adaptation for Multilingual Semantic Textual RelatednessCode0
CAAP: Class-Dependent Automatic Data Augmentation Based On Adaptive Policies For Time Series0
Source-Aware Training Enables Knowledge Attribution in Language ModelsCode1
Harnessing The Power of Attention For Patch-Based Biomedical Image Classification0
Position-Aware Parameter Efficient Fine-Tuning Approach for Reducing Positional Bias in LLMs0
CoUDA: Coherence Evaluation via Unified Data AugmentationCode0
Addressing Both Statistical and Causal Gender Fairness in NLP ModelsCode0
Controllable and Diverse Data Augmentation with Large Language Model for Low-Resource Open-Domain Dialogue Generation0
CoDa: Constrained Generation based Data Augmentation for Low-Resource NLPCode0
A Comprehensive Study on NLP Data Augmentation for Hate Speech Detection: Legacy Methods, BERT, and LLMs0
Shortcuts Arising from Contrast: Effective and Covert Clean-Label Attacks in Prompt-Based Learning0
Colorful Cutout: Enhancing Image Data Augmentation with Curriculum LearningCode0
Adverb Is the Key: Simple Text Data Augmentation with Adverb DeletionCode0
A Data-Driven Predictive Analysis on Cyber Security Threats with Key Risk Factors0
Boosting Cardiac Color Doppler Frame Rates with Deep Learning0
Enhance Image Classification via Inter-Class Image Mixup with Diffusion ModelCode1
Towards Multimodal Video Paragraph Captioning Models Robust to Missing ModalityCode0
CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems0
Deep Fusion: Capturing Dependencies in Contrastive Learning via Transformer Projection Heads0
Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event DetectionCode2
GeNet: A Graph Neural Network-based Anti-noise Task-Oriented Semantic Communication ParadigmCode1
Scaling Laws For Dense RetrievalCode0
Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data0
A vascular synthetic model for improved aneurysm segmentation and detection via Deep Neural Networks0
Choreographing the Digital Canvas: A Machine Learning Approach to Artistic Performance0
Segment Any Medical Model ExtendedCode3
Semi-Supervised Image Captioning Considering Wasserstein Graph Matching0
OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation0
Illuminating Blind Spots of Language Models with Targeted Agent-in-the-Loop Synthetic Data0
The Solution for the CVPR 2023 1st foundation model challenge-Track20
Calib3D: Calibrating Model Preferences for Reliable 3D Scene UnderstandingCode2
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA0
SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation0
Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator0
EG-ConMix: An Intrusion Detection Method based on Graph Contrastive Learning0
Are NeRFs ready for autonomous driving? Towards closing the real-to-simulation gap0
Towards Channel-Resilient CSI-Based RF Fingerprinting using Deep Learning0
Show:102550
← PrevPage 36 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified