SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 201250 of 8378 papers

TitleStatusHype
MVCNet: Multi-View Contrastive Network for Motor Imagery ClassificationCode1
Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on ManchuCode1
ReLearn: Unlearning via Learning for Large Language ModelsCode1
MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI ClassificationCode1
Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 ChallengeCode1
SpaceGNN: Multi-Space Graph Neural Network for Node Anomaly Detection with Extremely Limited LabelsCode1
A Cartesian Encoding Graph Neural Network for Crystal Structures Property Prediction: Application to Thermal Ellipsoid EstimationCode1
Image, Text, and Speech Data Augmentation using Multimodal LLMs for Deep Learning: A SurveyCode1
CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentationCode1
MixRec: Individual and Collective Mixing Empowers Data Augmentation for Recommender SystemsCode1
A Survey of World Models for Autonomous DrivingCode1
A Simple Graph Contrastive Learning Framework for Short Text ClassificationCode1
DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific InformationCode1
Context-Aware Deep Learning for Multi Modal Depression DetectionCode1
DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect GenerationCode1
ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance AnalysisCode1
DS^2-ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment AnalysisCode1
MixRec: Heterogeneous Graph Collaborative FilteringCode1
PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object DetectionCode1
AD-LLM: Benchmarking Large Language Models for Anomaly DetectionCode1
Learning Normal Flow Directly From Event NeighborhoodsCode1
ST-FiT: Inductive Spatial-Temporal Forecasting with Limited Training DataCode1
FM2S: Towards Spatially-Correlated Noise Modeling in Zero-Shot Fluorescence Microscopy Image DenoisingCode1
Augmenting Sequential Recommendation with Balanced Relevance and DiversityCode1
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMsCode1
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image SegmentationCode1
Training and Evaluating Language Models with Template-based Data GenerationCode1
Open-Amp: Synthetic Data Framework for Audio Effect Foundation ModelsCode1
FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-trainingCode1
Generalizable Person Re-identification via Balancing Alignment and UniformityCode1
DeepCRF: Deep Learning-Enhanced CSI-Based RF Fingerprinting for Channel-Resilient WiFi Device IdentificationCode1
Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion InversionCode1
DiffBatt: A Diffusion Model for Battery Degradation Prediction and SynthesisCode1
DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch InferenceCode1
Mitigating Unauthorized Speech Synthesis for Voice ProtectionCode1
Shape Transformation Driven by Active Contour for Class-Imbalanced Semi-Supervised Medical Image SegmentationCode1
Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided DiffusionCode1
Minority-Focused Text-to-Image Generation via Prompt OptimizationCode1
RFBoost: Understanding and Boosting Deep WiFi Sensing via Physical Data AugmentationCode1
Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile glovesCode1
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard ModelsCode1
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic DataCode1
Data Extrapolation for Text-to-image Generation on Small DatasetsCode1
Exploring Empty Spaces: Human-in-the-Loop Data AugmentationCode1
RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic ObservationsCode1
SWIM: Short-Window CNN Integrated with Mamba for EEG-Based Auditory Spatial Attention DecodingCode1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance ScalingCode1
TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based ModelsCode1
Deep-Wide Learning Assistance for Insect Pest ClassificationCode1
Robust image representations with counterfactual contrastive learningCode1
Show:102550
← PrevPage 5 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified