SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 53515400 of 8378 papers

TitleStatusHype
TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla0
Team DoNotDistribute at SemEval-2020 Task 11: Features, Finetuning, and Data Augmentation in Neural Models for Propaganda Detection in News Articles0
Team HYU ASML ROBOVOX SP Cup 2024 System Description0
Team JUST at the MADAR Shared Task on Arabic Fine-Grained Dialect Identification0
Team “NoConflict” at CASE 2021 Task 1: Pretraining for Sentence-Level Protest Event Detection0
Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track0
Technical report on Conversational Question Answering0
Technical Report on Shared Task in DialDoc210
Technical report on target classification in SAR track0
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise0
TeLL Me what you cant see0
Temporal-Clustering Invariance in Irregular Healthcare Time Series0
Temporal Variability and Multi-Viewed Self-Supervised Representations to Tackle the ASVspoof5 Deepfake Challenge0
Tencent Neural Machine Translation Systems for WMT180
Tencent Neural Machine Translation Systems for the WMT20 News Translation Task0
Tencent Translation System for the WMT21 News Translation Task0
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney0
TermMind: Alibaba’s WMT21 Machine Translation Using Terminologies Task Submission0
Tertiary Lymphoid Structures Generation through Graph-based Diffusion0
Tesla at SemEval-2022 Task 4: Patronizing and Condescending Language Detection using Transformer-based Models with Data Augmentation0
Test-Time Augmentation for 3D Point Cloud Classification and Segmentation0
Test-Time Augmentation Meets Variational Bayes0
Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology0
Test-time Training for Hyperspectral Image Super-resolution0
Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model0
TextAttack: Lessons learned in designing Python frameworks for NLP0
Text Augmentation in a Multi-Task View0
TextAug: Test time Text Augmentation for Multimodal Person Re-identification0
Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer's Disease Detection0
Text clustering applied to data augmentation in legal contexts0
Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities0
Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models0
Text Data Augmentation: Towards better detection of spear-phishing emails0
Text Detection on Technical Drawings for the Digitization of Brown-field Processes0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Text Intimacy Analysis using Ensembles of Multilingual Transformers0
TextMosaic: A New Data Augmentation Method for Named Entity Recognition Using Document-Level Contexts0
Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks0
Text-To-Speech Data Augmentation for Low Resource Speech Recognition0
Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of Swahili0
Textual Data Augmentation for Efficient Active Learning on Tiny Datasets0
Textual Data Augmentation for Patient Outcomes Prediction0
Texture Synthesis Guided Deep Hashing for Texture Image Retrieval0
Thai Financial Domain Adaptation of THaLLE -- Technical Report0
ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents0
That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using \#petpeeve Tweets0
THDA: Treasure Hunt Data Augmentation for Semantic Navigation0
The 2021 NIST Speaker Recognition Evaluation0
The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge0
The 2ST-UNet for Pneumothorax Segmentation in Chest X-Rays using ResNet34 as a Backbone for U-Net0
Show:102550
← PrevPage 108 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified