SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 150 of 8378 papers

TitleStatusHype
YOLOv10: Real-Time End-to-End Object DetectionCode11
Depth Anything: Unleashing the Power of Large-Scale Unlabeled DataCode9
RT-DETRv2: Improved Baseline with Bag-of-Freebies for Real-Time Detection TransformerCode7
RouteLLM: Learning to Route LLMs with Preference DataCode7
Symmetry Considerations for Learning Task Symmetric Robot PoliciesCode7
Dynamic Evaluation of Large Language Models by Meta Probing AgentsCode7
OmniV2V: Versatile Video Generation and Editing via Dynamic Content ManipulationCode5
DEIM: DETR with Improved Matching for Fast ConvergenceCode5
MixTex: Unambiguous Recognition Should Not Rely Solely on Real DataCode5
A Survey on Knowledge Distillation of Large Language ModelsCode5
AugLy: Data Augmentations for RobustnessCode5
Improving Data Augmentation-based Cross-Speaker Style Transfer for TTS with Singing Voice, Style Filtering, and F0 MatchingCode4
Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge SystemCode4
InternLM-Math: Open Math Large Language Models Toward Verifiable ReasoningCode4
RecBole 2.0: Towards a More Up-to-Date Recommendation LibraryCode4
DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band AudioCode4
A Framework For Contrastive Self-Supervised Learning And Designing A New ApproachCode4
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any CameraCode3
Data Generation for Hardware-Friendly Post-Training QuantizationCode3
Generalizing Motion Planners with Mixture of Experts for Autonomous DrivingCode3
Data Augmentation for Sequential Recommendation: A SurveyCode3
EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone TrainingCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint DetectorCode3
Segment Any Medical Model ExtendedCode3
OpenGraph: Towards Open Graph Foundation ModelsCode3
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-TuningCode3
Improved motif-scaffolding with SE(3) flow matchingCode3
Anatomy-informed Data Augmentation for Enhanced Prostate Cancer DetectionCode3
Generative Data Augmentation using LLMs improves Distributional Robustness in Question AnsweringCode3
Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative RepresentationsCode3
PyABSA: A Modularized Framework for Reproducible Aspect-based Sentiment AnalysisCode3
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling StrategiesCode3
EfficientNetV2: Smaller Models and Faster TrainingCode3
Robust and Accurate Object Detection via Adversarial LearningCode3
YOLOv4: Optimal Speed and Accuracy of Object DetectionCode3
Pythia v0.1: the Winning Entry to the VQA Challenge 2018Code3
AutoAugment: Learning Augmentation Policies from DataCode3
DD-Ranking: Rethinking the Evaluation of Dataset DistillationCode2
GuardReasoner-VL: Safeguarding VLMs via Reinforced ReasoningCode2
SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data AugmentationCode2
SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy DemonstrationsCode2
NoisyRollout: Reinforcing Visual Reasoning with Data AugmentationCode2
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images and A BenchmarkCode2
External Knowledge Injection for CLIP-Based Class-Incremental LearningCode2
Composed Multi-modal Retrieval: A Survey of Approaches and ApplicationsCode2
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future DirectionsCode2
RoboBERT: An End-to-end Multimodal Robotic Manipulation ModelCode2
R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual LocalizationCode2
Show:102550
← PrevPage 1 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified