SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 101150 of 8378 papers

TitleStatusHype
NoisyRollout: Reinforcing Visual Reasoning with Data AugmentationCode2
Efficient Training of Language Models to Fill in the MiddleCode2
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
Fast-BEV: A Fast and Strong Bird's-Eye View Perception BaselineCode2
Effective Data Augmentation With Diffusion ModelsCode2
PolarMix: A General Data Augmentation Technique for LiDAR Point CloudsCode2
EarthLoc: Astronaut Photography Localization by Indexing Earth from SpaceCode2
MuggleMath: Assessing the Impact of Query and Response Augmentation on Math ReasoningCode2
Random Erasing Data AugmentationCode2
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images and A BenchmarkCode2
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification TasksCode2
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future DirectionsCode2
Revisiting Adversarial Training under Long-Tailed DistributionsCode2
RL-ADN: A High-Performance Deep Reinforcement Learning Environment for Optimal Energy Storage Systems Dispatch in Active Distribution NetworksCode2
A Survey on Diffusion Models for Recommender SystemsCode2
Delving into the Trajectory Long-tail Distribution for Muti-object TrackingCode2
MolScribe: Robust Molecular Structure Recognition with Image-To-Graph GenerationCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual BackbonesCode2
Saturn: Sample-efficient Generative Molecular Design using Memory ManipulationCode2
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View PerceptionCode2
Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future DirectionsCode2
Self-supervised Contrastive Representation Learning for Semi-supervised Time-Series ClassificationCode2
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language ModelsCode2
DD-Ranking: Rethinking the Evaluation of Dataset DistillationCode2
SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy DemonstrationsCode2
Deep learning for time series classificationCode2
Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology imagesCode2
Conditional Diffusion Models for Semantic 3D Brain MRI SynthesisCode2
Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation frameworkCode2
Deep PCB To COCO ConvertorCode2
BOP Challenge 2020 on 6D Object LocalizationCode2
BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data TrainingCode2
BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with TransformerCode2
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
Composed Multi-modal Retrieval: A Survey of Approaches and ApplicationsCode2
BirdNET: A deep learning solution for avian diversity monitoringCode2
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-ViewCode2
BiRP: Learning Robot Generalized Bimanual Coordination using Relative Parameterization Method on Human DemonstrationCode2
Decoupling Representation Learning from Reinforcement LearningCode2
Calib3D: Calibrating Model Preferences for Reliable 3D Scene UnderstandingCode2
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationCode2
Authorship Obfuscation in Multilingual Machine-Generated Text DetectionCode2
Automated Self-Supervised Learning for RecommendationCode2
Addressing Concept Shift in Online Time Series Forecasting: Detect-then-AdaptCode2
Depth Field Networks for Generalizable Multi-view Scene RepresentationCode2
Augraphy: A Data Augmentation Library for Document ImagesCode2
A Versatile Framework for Multi-scene Person Re-identificationCode2
EdgeYOLO: An Edge-Real-Time Object DetectorCode2
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?Code2
Show:102550
← PrevPage 3 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified