SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 80018050 of 8378 papers

TitleStatusHype
T5 for Hate Speech, Augmented Data and EnsembleCode0
Evaluating data augmentation for financial time series classificationCode0
Consistency of augmentation graph and network approximability in contrastive learningCode0
Conjugate Bayesian Two-step Change Point Detection for Hawkes ProcessCode0
Conditional Infilling GANs for Data Augmentation in Mammogram ClassificationCode0
Adversarial Word Dilution as Text Data Augmentation in Low-Resource RegimeCode0
OLISIA: a Cascade System for Spoken Dialogue State TrackingCode0
Equivariant Contrastive Learning for Sequential RecommendationCode0
Conditional Generation from Unconditional Diffusion Models using Denoiser RepresentationsCode0
Equivariance versus Augmentation for Spherical ImagesCode0
OMR: Occlusion-Aware Memory-Based Refinement for Video Lane DetectionCode0
EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth from Light Field ImagesCode0
On Automatic Data Augmentation for 3D Point Cloud ClassificationCode0
TabAug: Data Driven Augmentation for Enhanced Table Structure RecognitionCode0
Conditional Distribution Learning on GraphsCode0
Conditional BERT Contextual AugmentationCode0
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring TasksCode0
Table-based Fact Verification with Salience-aware LearningCode0
Ensembles provably learn equivariance through data augmentationCode0
Towards Robustifying NLI Models Against Lexical Dataset BiasesCode0
TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context SubsettingCode0
Ensemble Augmentation for Deep Neural Networks Using 1-D Time Series Vibration DataCode0
Graph Component Contrastive Learning for Concept Relatedness EstimationCode0
Augmented Balanced Image Dataset Generator Using AugStatic LibraryCode0
Compositionality as Lexical SymmetryCode0
Enriching GNNs with Text Contextual Representations for Detecting Disinformation Campaigns on Social MediaCode0
One Node One Model: Featuring the Missing-Half for Graph ClusteringCode0
Comparative Knowledge DistillationCode0
Enriched Long-term Recurrent Convolutional Network for Facial Micro-Expression RecognitionCode0
DICE: Device-level Integrated Circuits Encoder with Graph Contrastive PretrainingCode0
Enriched BERT Embeddings for Scholarly Publication ClassificationCode0
One-Shot Segmentation of Novel White Matter Tracts via Extensive Data AugmentationCode0
Enhancing Visual Perception in Novel Environments via Incremental Data Augmentation Based on Style TransferCode0
Enhancing TinyBERT for Financial Sentiment Analysis Using GPT-Augmented FinBERT DistillationCode0
Community-Based Hierarchical Positive-Unlabeled (PU) Model Fusion for Chronic Disease PredictionCode0
Enhancing Sequence-to-Sequence Neural Lemmatization with External ResourcesCode0
Combining Denoising Autoencoders with Contrastive Learning to fine-tune Transformer ModelsCode0
Enhancing Robustness of AI Offensive Code Generators via Data AugmentationCode0
Enhancing Psychotherapy Counseling: A Data Augmentation Pipeline Leveraging Large Language Models for Counseling ConversationsCode0
Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph NetworksCode0
On the Effectiveness of Hybrid Pooling in Mixup-Based Graph Learning for Language ProcessingCode0
Combining Contrastive and Supervised Learning for Video Super-Resolution DetectionCode0
Colorful Cutout: Enhancing Image Data Augmentation with Curriculum LearningCode0
Tackling Data Bias in Painting Classification with Style TransferCode0
Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniquesCode0
ColloQL: Robust Text-to-SQL Over Search QueriesCode0
Self-supervised Learning for Panoptic Segmentation of Multiple Fruit Flower SpeciesCode0
U-NetPlus: A Modified Encoder-Decoder U-Net Architecture for Semantic and Instance Segmentation of Surgical InstrumentCode0
ColloQL: Robust Cross-Domain Text-to-SQL Over Search QueriesCode0
Tackling Irony Detection using Ensemble ClassifiersCode0
Show:102550
← PrevPage 161 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified