SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 23012350 of 8378 papers

TitleStatusHype
Improving the performance of weak supervision searches using data augmentation0
CantorNet: A Sandbox for Testing Geometrical and Topological Complexity Measures0
Reverse Thinking Makes LLMs Stronger Reasoners0
Topology-Preserving Scaling in Data Augmentation0
UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation0
Data Augmentation with Diffusion Models for Colon Polyp Localization on the Low Data Regime: How much real data is enough?0
Enhancing weed detection performance by means of GenAI-based image augmentation0
Dual-Level Boost Network for Long-Tail Prohibited Items Detection in X-ray Security Inspection0
Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification0
Thai Financial Domain Adaptation of THaLLE -- Technical Report0
Task Progressive Curriculum Learning for Robust Visual Question Answering0
Scaling nnU-Net for CBCT Segmentation0
Breast Tumor Classification Using EfficientNet Deep Learning ModelCode0
Semantic Data Augmentation for Long-tailed Facial Expression Recognition0
SynDiff-AD: Improving Semantic Segmentation and End-to-End Autonomous Driving with Synthetic Data from Latent Diffusion Models0
J-CaPA : Joint Channel and Pyramid Attention Improves Medical Image Segmentation0
Unsupervised Event Outlier Detection in Continuous Time0
Enhancing Few-Shot Learning with Integrated Data and GAN Model Approaches0
RoCoDA: Counterfactual Data Augmentation for Data-Efficient Robot Learning from Demonstrations0
A Novel Data Augmentation Tool for Enhancing Machine Learning Classification: A New Application of the Higher Order Dynamic Mode Decomposition for Improved Cardiac Disease Identification0
Machine-agnostic Automated Lumbar MRI Segmentation using a Cascaded Model Based on Generative Neurons0
Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computers0
Reconciling Semantic Controllability and Diversity for Remote Sensing Image Synthesis with Hybrid Semantic Embedding0
Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural NetworkCode0
Evaluating LLM Prompts for Data Augmentation in Multi-label Classification of Ecological Texts0
MVANet: Multi-Stage Video Attention Network for Sound Event Localization and Detection with Source Distance EstimationCode0
Improving Routability Prediction via NAS Using a Smooth One-shot Augmented Predictor0
An Experimental Study on Data Augmentation Techniques for Named Entity Recognition on Low-Resource Domains0
Next-Generation Phishing: How LLM Agents Empower Cyber Attackers0
SynEHRgy: Synthesizing Mixed-Type Structured Electronic Health Records using Decoder-Only Transformers0
Simulating Liquidity: Agent-Based Modeling of Illiquid Markets for Fractional Ownership0
Conditional Distribution Learning on GraphsCode0
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM0
On the Way to LLM Personalization: Learning to Remember User Conversations0
Whisper Finetuning on Nepali Language0
Residual Vision Transformer (ResViT) Based Self-Supervised Learning Model for Brain Tumor Classification0
Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production0
Can Open-source LLMs Enhance Data Synthesis for Toxic Detection?: An Experimental Study0
Online Item Cold-Start Recommendation with Popularity-Aware Meta-LearningCode0
Wafer Map Defect Classification Using Autoencoder-Based Data Augmentation and Convolutional Neural Network0
Variational Bayesian Bow tie Neural Networks with ShrinkageCode0
LTCXNet: Advancing Chest X-Ray Analysis with Solutions for Long-Tailed Multi-Label Classification and Fairness Challenges0
Improvement in Facial Emotion Recognition using Synthetic Data Generated by Diffusion ModelCode0
Enhancing PTSD Outcome Prediction with Ensemble Models in Disaster Contexts0
Counterfactual Learning-Driven Representation Disentanglement for Search-Enhanced Recommendation0
Are nuclear masks all you need for improved out-of-domain generalisation? A closer look at cancer classification in histopathologyCode0
Bayesian estimation of finite mixtures of Tobit models0
Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case StudyCode0
Optimal Transport-Based Displacement Interpolation with Data Augmentation for Reduced Order Modeling of Nonlinear Dynamical Systems0
A Heterogeneous Graph Neural Network Fusing Functional and Structural Connectivity for MCI Diagnosis0
Show:102550
← PrevPage 47 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified