SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 23512400 of 8378 papers

TitleStatusHype
HumVI: A Multilingual Dataset for Detecting Violent Incidents Impacting Humanitarian AidCode0
HULAT at SemEval-2023 Task 10: Data augmentation for pre-trained transformers applied to the detection of sexism in social mediaCode0
HULAT at SemEval-2023 Task 9: Data augmentation for pre-trained transformers applied to Multilingual Tweet Intimacy AnalysisCode0
Adversarial Auto-Augment with Label Preservation: A Representation Learning Principle Guided ApproachCode0
Bootstrap Advantage Estimation for Policy Optimization in Reinforcement LearningCode0
HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?Code0
Iceberg: Enhancing HLS Modeling with Synthetic DataCode0
An ordinal CNN approach for the assessment of neurological damage in Parkinson's disease patientsCode0
How Well Do Multi-hop Reading Comprehension Models Understand Date Information?Code0
Boosting Semi-Supervised 3D Object Detection with Semi-SamplingCode0
Boosting Semi-Supervised 2D Human Pose Estimation by Revisiting Data Augmentation and Consistency TrainingCode0
How to Solve Contextual Goal-Oriented Problems with Offline Datasets?Code0
How Should Markup Tags Be Translated?Code0
1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position SelectorCode0
How to track your dragon: A Multi-Attentional Framework for real-time RGB-D 6-DOF Object Pose TrackingCode0
HSDA: High-frequency Shuffle Data Augmentation for Bird's-Eye-View Map SegmentationCode0
How Explanations Leak the Decision Logic: Stealing Graph Neural Networks via Explanation AlignmentCode0
Boosting Novel Category Discovery Over Domains with Soft Contrastive Learning and All in One ClassifierCode0
How Good Are Synthetic Medical Images? An Empirical Study with Lung UltrasoundCode0
Annotating FrameNet via Structure-Conditioned Language GenerationCode0
How Do We Fail? Stress Testing Perception in Autonomous VehiclesCode0
Holistic Exploration on Universal Decompositional Semantic Parsing: Architecture, Data Augmentation, and LLM ParadigmCode0
HitNet: a neural network with capsules embedded in a Hit-or-Miss layer, extended with hybrid data augmentation and ghost capsulesCode0
HIT-SCIR at MMNLU-22: Consistency Regularization for Multilingual Spoken Language UnderstandingCode0
Hotels-50K: A Global Hotel Recognition DatasetCode0
How Robust is 3D Human Pose Estimation to Occlusion?Code0
Learning Stage-wise GANs for Whistle Extraction in Time-Frequency SpectrogramsCode0
Boosting High Resolution Image Classification with Scaling-up TransformersCode0
An Investigation of Time Reversal Symmetry in Reinforcement LearningCode0
Boosting Distress Support Dialogue Responses with Motivational Interviewing StrategyCode0
Adverb Is the Key: Simple Text Data Augmentation with Adverb DeletionCode0
Boosting Disfluency Detection with Large Language Model as Disfluency GeneratorCode0
An Inflectional Database for GitksanCode0
Hierarchical Transformer Model for Scientific Named Entity RecognitionCode0
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data AugmentationCode0
High-dimensional Bayesian Tobit regression for censored response with Horseshoe priorCode0
Hide-and-Seek: A Data Augmentation Technique for Weakly-Supervised Localization and BeyondCode0
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better GeneralizabilityCode0
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better GeneralizationCode0
HELP: A Dataset for Identifying Shortcomings of Neural Models in Monotonicity ReasoningCode0
Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority GenerationCode0
Heavy Lasso: sparse penalized regression under heavy-tailed noise via data-augmented soft-thresholdingCode0
Heterogeneous Multi-Task Gaussian Cox ProcessesCode0
Harnessing Out-Of-Distribution Examples via Augmenting Content and StyleCode0
Harnessing Collective Structure Knowledge in Data Augmentation for Graph Neural NetworksCode0
HCDG: A Hierarchical Consistency Framework for Domain Generalization on Medical Image SegmentationCode0
Person Re-identification: Implicitly Defining the Receptive Fields of Deep Learning Classification FrameworksCode0
Head and Neck Tumor Segmentation of MRI from Pre- and Mid-radiotherapy with Pre-training, Data Augmentation and Dual Flow UNetCode0
BLT: Balancing Long-Tailed Datasets with Adversarially-Perturbed ImagesCode0
An Image Clustering Auto-Encoder Based on Predefined Evenly-Distributed Class Centroids and MMD DistanceCode0
Show:102550
← PrevPage 48 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified