SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 501550 of 8378 papers

TitleStatusHype
CellMix: A General Instance Relationship based Method for Data Augmentation Towards Pathology Image ClassificationCode1
ExaRanker-Open: Synthetic Explanation for IR using Open-Source LLMsCode1
CultureLLM: Incorporating Cultural Differences into Large Language ModelsCode1
AMR-DA: Data Augmentation by Abstract Meaning RepresentationCode1
Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data AugmentationCode1
Overcoming challenges in leveraging GANs for few-shot data augmentationCode1
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimesCode1
Exploring Corruption Robustness: Inductive Biases in Vision Transformers and MLP-MixersCode1
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint ModelsCode1
Exploring Empty Spaces: Human-in-the-Loop Data AugmentationCode1
CVAE-GAN: Fine-Grained Image Generation through Asymmetric TrainingCode1
A Study on Transferability of Deep Learning Models for Network Intrusion DetectionCode1
Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text GuidanceCode1
Astroformer: More Data Might not be all you need for ClassificationCode1
Advancing Fine-Grained Classification by Structure and Subject Preserving AugmentationCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
CST5: Data Augmentation for Code-Switched Semantic ParsingCode1
Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion RecognitionCode1
Aspect-Controlled Neural Argument GenerationCode1
Cross-modality Data Augmentation for End-to-End Sign Language TranslationCode1
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversionCode1
A Survey of Label-Efficient Deep Learning for 3D Point CloudsCode1
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram DigitizationCode1
Cross-modulated Few-shot Image Generation for Colorectal Tissue ClassificationCode1
CUDA: Curriculum of Data Augmentation for Long-Tailed RecognitionCode1
CyCNN: A Rotation Invariant CNN using Polar Mapping and Cylindrical Convolution LayersCode1
A Simple Graph Contrastive Learning Framework for Short Text ClassificationCode1
3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labellingCode1
Cross-domain Compositing with Pretrained Diffusion ModelsCode1
A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data AugmentationCode1
An Open-source Tool for Hyperspectral Image Augmentation in TensorflowCode1
A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and GenerationCode1
Cross-Domain Feature Augmentation for Domain GeneralizationCode1
Learning from Counterfactual Links for Link PredictionCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
COVID-19 detection from scarce chest x-ray image data using few-shot deep learning approachCode1
A Semi-supervised Learning Approach with Two Teachers to Improve Breakdown Identification in DialoguesCode1
Artificial Pupil Dilation for Data Augmentation in Iris Semantic SegmentationCode1
A Simple Recipe for Language-guided Domain Generalized SegmentationCode1
Enhancing Text-based Knowledge Graph Completion with Zero-Shot Large Language Models: A Focus on Semantic EnhancementCode1
Cross-Domain Adaptive Teacher for Object DetectionCode1
Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced DataCode1
CoSDA-ML: Multi-Lingual Code-Switching Data Augmentation for Zero-Shot Cross-Lingual NLPCode1
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual ExamplesCode1
A Robust Real-Time Automatic License Plate Recognition Based on the YOLO DetectorCode1
A systematic approach to deep learning-based nodule detection in chest radiographsCode1
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language NavigationCode1
3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian LocalizationCode1
AD-LLM: Benchmarking Large Language Models for Anomaly DetectionCode1
Arrhythmia Classification using CGAN-augmented ECG SignalsCode1
Show:102550
← PrevPage 11 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified