SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 151200 of 8378 papers

TitleStatusHype
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
ARoFace: Alignment Robustness to Improve Low-Quality Face RecognitionCode2
Generative Adversarial Network in Medical Imaging: A ReviewCode2
BOP Challenge 2020 on 6D Object LocalizationCode2
BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data TrainingCode2
BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with TransformerCode2
Improved Multi-Task Brain Tumour Segmentation with Synthetic Data AugmentationCode2
Composed Multi-modal Retrieval: A Survey of Approaches and ApplicationsCode2
A Survey on Data Augmentation in Large Model EraCode2
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification TasksCode2
A Survey of Data Augmentation Approaches for NLPCode2
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical ReasoningCode2
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 LanguagesCode2
LLM2LLM: Boosting LLMs with Novel Iterative Data EnhancementCode2
MindBridge: A Cross-Subject Brain Decoding FrameworkCode2
Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event DetectionCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, GeometryCode2
Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial ExamplesCode2
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature AnalysisCode1
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningCode1
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment AnalysisCode1
An Analysis of Simple Data Augmentation for Named Entity RecognitionCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter OptimizationCode1
Bi-level Learning of Task-Specific Decoders for Joint Registration and One-Shot Medical Image SegmentationCode1
Analyzing Overfitting under Class Imbalance in Neural Networks for Image SegmentationCode1
BEVDiffLoc: End-to-End LiDAR Global Localization in BEV View based on Diffusion ModelCode1
Analysis of skin lesion images with deep learningCode1
3D Common Corruptions and Data AugmentationCode1
Anatomical Data Augmentation via Fluid-based Image RegistrationCode1
Exploring Discontinuity for Video Frame InterpolationCode1
An Accurate Car Counting in Aerial Images Based on Convolutional Neural NetworksCode1
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint ModelsCode1
Better plain ViT baselines for ImageNet-1kCode1
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency DomainCode1
AMR-DA: Data Augmentation by Abstract Meaning RepresentationCode1
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification ContextCode1
Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuningCode1
Bayesian inference for logistic models using Polya-Gamma latent variablesCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
BDANet: Multiscale Convolutional Neural Network with Cross-directional Attention for Building Damage Assessment from Satellite ImagesCode1
Bag of Tricks for Long-Tailed Visual Recognition with Deep Convolutional Neural NetworksCode1
Bayesian Adversarial Human Motion SynthesisCode1
Behavior Injection: Preparing Language Models for Reinforcement LearningCode1
Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust FinetuningCode1
A Locality-based Neural Solver for Optical Motion CaptureCode1
A Winning Hand: Compressing Deep Networks Can Improve Out-Of-Distribution RobustnessCode1
A Cartesian Encoding Graph Neural Network for Crystal Structures Property Prediction: Application to Thermal Ellipsoid EstimationCode1
AlignMixup: Improving Representations By Interpolating Aligned FeaturesCode1
Show:102550
← PrevPage 4 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified