SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 201250 of 8378 papers

TitleStatusHype
Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence ModelingCode1
Brain Tumor Segmentation and Radiomics Survival Prediction: Contribution to the BRATS 2017 ChallengeCode1
Break-It-Fix-It: Unsupervised Learning for Program RepairCode1
A Cartesian Encoding Graph Neural Network for Crystal Structures Property Prediction: Application to Thermal Ellipsoid EstimationCode1
An Accurate Car Counting in Aerial Images Based on Convolutional Neural NetworksCode1
Bootstrapping Relation Extractors using Syntactic Search by ExamplesCode1
BSUV-Net: A Fully-Convolutional Neural Network forBackground Subtraction of Unseen VideosCode1
Adaptive Graph Contrastive Learning for RecommendationCode1
Boosted Neural Decoders: Achieving Extreme Reliability of LDPC Codes for 6G NetworksCode1
Bi-level Learning of Task-Specific Decoders for Joint Registration and One-Shot Medical Image SegmentationCode1
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency DomainCode1
DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data AugmentationCode1
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningCode1
Exploring Discontinuity for Video Frame InterpolationCode1
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature AnalysisCode1
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint ModelsCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
Bootstrap Your Object Detector via Mixed TrainingCode1
Boundary thickness and robustness in learning modelsCode1
Analysis of skin lesion images with deep learningCode1
Analyzing Overfitting under Class Imbalance in Neural Networks for Image SegmentationCode1
An Analysis of Simple Data Augmentation for Named Entity RecognitionCode1
scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell DataCode1
AMR-DA: Data Augmentation by Abstract Meaning RepresentationCode1
An Efficient and Scalable Deep Learning Approach for Road Damage DetectionCode1
ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and Multispectral Data FusionCode1
CADTransformer: Panoptic Symbol Spotting Transformer for CAD DrawingsCode1
Anchor-free Small-scale Multispectral Pedestrian DetectionCode1
An Effective and Robust Detector for Logo DetectionCode1
Eliminate Deviation with Deviation for Data Augmentation and a General Multi-modal Data Learning MethodCode1
An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure SystemsCode1
An evaluation framework for synthetic data generation modelsCode1
Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment AnalysisCode1
An Empirical Study of CLIP for Text-based Person SearchCode1
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language ModelsCode1
An Empirical Survey of Data Augmentation for Time Series Classification with Neural NetworksCode1
CAR-MFL: Cross-Modal Augmentation by Retrieval for Multimodal Federated Learning with Missing ModalitiesCode1
Leveraging SO(3)-steerable convolutions for pose-robust semantic segmentation in 3D medical dataCode1
3D Common Corruptions and Data AugmentationCode1
CCGL: Contrastive Cascade Graph LearningCode1
CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color ConstancyCode1
CellMix: A General Instance Relationship based Method for Data Augmentation Towards Pathology Image ClassificationCode1
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
CipherDAug: Ciphertext based Data Augmentation for Neural Machine TranslationCode1
AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat ReportsCode1
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile ApplicationCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer SummarizationCode1
A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent AttentionCode1
BOOTPLACE: Bootstrapped Object Placement with Detection TransformersCode1
Show:102550
← PrevPage 5 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified