SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 78517900 of 8378 papers

TitleStatusHype
NeSy is alive and well: A LLM-driven symbolic approach for better code comment data generation and classificationCode0
NetAurHPD: Network Auralization Hyperlink Prediction Model to Identify Metabolic Pathways from Metabolomics DataCode0
Dynamic Test-Time Augmentation via Differentiable FunctionsCode0
T-GAE: Transferable Graph Autoencoder for Network AlignmentCode0
Scalable Implicit Graphon LearningCode0
Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class ImbalanceCode0
Networks with pixels embedding: a method to improve noise resistance in images classificationCode0
UW-FinSent at SemEval-2017 Task 5: Sentiment Analysis on Financial News Headlines using Training Dataset AugmentationCode0
Transformers on Multilingual Clause-Level MorphologyCode0
Syntax-based data augmentation for Hungarian-English machine translationCode0
Syntax-driven Data Augmentation for Named Entity RecognitionCode0
Exploring the Robustness of In-Context Learning with Noisy LabelsCode0
Zero-shot Code-Mixed Offensive Span Identification through Rationale ExtractionCode0
Neural Data Augmentation via Example ExtrapolationCode0
VaeDiff-DocRE: End-to-end Data Augmentation Framework for Document-level Relation ExtractionCode0
Simple Data Augmentation Techniques for Chinese Disease NormalizationCode0
Counterexample-Guided Data AugmentationCode0
Neural Language Model Based Training Data Augmentation for Weakly Supervised Early Rumor DetectionCode0
Understanding Compositional Data Augmentation in Typologically Diverse Morphological InflectionCode0
Exploring Scene Affinity for Semi-Supervised LiDAR Semantic SegmentationCode0
CoUDA: Coherence Evaluation via Unified Data AugmentationCode0
Augment to Interpret: Unsupervised and Inherently Interpretable Graph EmbeddingsCode0
Neural Network Architecture for Database Augmentation Using Shared FeaturesCode0
Cost-efficient Crowdsourcing for Span-based Sequence Labeling: Worker Selection and Data AugmentationCode0
Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language ModelsCode0
Neural Networks Regularization Through Representation LearningCode0
An Efficient LSTM Neural Network-Based Framework for Vessel Location ForecastingCode0
CORE: A Retrieve-then-Edit Framework for Counterfactual Data GenerationCode0
Neural Operator-Based Proxy for Reservoir Simulations Considering Varying Well Settings, Locations, and Permeability FieldsCode0
Scaling Laws For Dense RetrievalCode0
Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation PoliciesCode0
CoPrUS: Consistency Preserving Utterance Synthesis towards more realistic benchmark dialoguesCode0
Scaling up Discourse Quality Annotation for Political ScienceCode0
Exploring Human-Like Thinking in Search Simulations with Large Language ModelsCode0
WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial NetworksCode0
ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and DissertationsCode0
Towards Generalising Neural Topical RepresentationsCode0
CoopSubNet: Cooperating Subnetwork for Data-Driven Regularization of Deep Networks under Limited Training BudgetsCode0
Convolutional Recurrent Neural Networks for Electrocardiogram ClassificationCode0
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative ExamplesCode0
Synthesizing and Identifying Noise Levels in Autonomous Vehicle Camera Radar DatasetsCode0
Neural TMDlayer: Modeling Instantaneous flow of features via SDE GeneratorsCode0
Exploring Data Augmentation for Code Generation TasksCode0
Context-Aware Image Matting for Simultaneous Foreground and Alpha EstimationCode0
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data AugmentationCode0
Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited QuestionsCode0
Conversation Graph: Data Augmentation, Training and Evaluation for Non-Deterministic Dialogue ManagementCode0
SCISSOR: Mitigating Semantic Bias through Cluster-Aware Siamese Networks for Robust ClassificationCode0
Towards Generalizable SER: Soft Labeling and Data Augmentation for Modeling Temporal Emotion Shifts in Large-Scale Multilingual SpeechCode0
A Deep Learning Model for Chilean Bills ClassificationCode0
Show:102550
← PrevPage 158 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified