SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 53765400 of 8378 papers

TitleStatusHype
TextAttack: Lessons learned in designing Python frameworks for NLP0
Text Augmentation in a Multi-Task View0
TextAug: Test time Text Augmentation for Multimodal Person Re-identification0
Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer's Disease Detection0
Text clustering applied to data augmentation in legal contexts0
Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities0
Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models0
Text Data Augmentation: Towards better detection of spear-phishing emails0
Text Detection on Technical Drawings for the Digitization of Brown-field Processes0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Text Intimacy Analysis using Ensembles of Multilingual Transformers0
TextMosaic: A New Data Augmentation Method for Named Entity Recognition Using Document-Level Contexts0
Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks0
Text-To-Speech Data Augmentation for Low Resource Speech Recognition0
Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of Swahili0
Textual Data Augmentation for Efficient Active Learning on Tiny Datasets0
Textual Data Augmentation for Patient Outcomes Prediction0
Texture Synthesis Guided Deep Hashing for Texture Image Retrieval0
Thai Financial Domain Adaptation of THaLLE -- Technical Report0
ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents0
That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using \#petpeeve Tweets0
THDA: Treasure Hunt Data Augmentation for Semantic Navigation0
The 2021 NIST Speaker Recognition Evaluation0
The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge0
The 2ST-UNet for Pneumothorax Segmentation in Chest X-Rays using ResNet34 as a Backbone for U-Net0
Show:102550
← PrevPage 216 of 336Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified