SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 46014650 of 8378 papers

TitleStatusHype
Tencent Translation System for the WMT21 News Translation Task0
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney0
TermMind: Alibaba’s WMT21 Machine Translation Using Terminologies Task Submission0
Tertiary Lymphoid Structures Generation through Graph-based Diffusion0
Tesla at SemEval-2022 Task 4: Patronizing and Condescending Language Detection using Transformer-based Models with Data Augmentation0
Test-Time Augmentation for 3D Point Cloud Classification and Segmentation0
Test-Time Augmentation Meets Variational Bayes0
Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology0
Test-time Training for Hyperspectral Image Super-resolution0
Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model0
TextAttack: Lessons learned in designing Python frameworks for NLP0
Text Augmentation in a Multi-Task View0
TextAug: Test time Text Augmentation for Multimodal Person Re-identification0
Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer's Disease Detection0
Text clustering applied to data augmentation in legal contexts0
Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities0
Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models0
Text Data Augmentation: Towards better detection of spear-phishing emails0
Text Detection on Technical Drawings for the Digitization of Brown-field Processes0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Text Intimacy Analysis using Ensembles of Multilingual Transformers0
TextMosaic: A New Data Augmentation Method for Named Entity Recognition Using Document-Level Contexts0
Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks0
Text-To-Speech Data Augmentation for Low Resource Speech Recognition0
Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of Swahili0
Textual Data Augmentation for Efficient Active Learning on Tiny Datasets0
Textual Data Augmentation for Patient Outcomes Prediction0
Texture Synthesis Guided Deep Hashing for Texture Image Retrieval0
Thai Financial Domain Adaptation of THaLLE -- Technical Report0
ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents0
That's So Annoying!!!: A Lexical and Frame-Semantic Embedding Based Data Augmentation Approach to Automatic Categorization of Annoying Behaviors using \#petpeeve Tweets0
THDA: Treasure Hunt Data Augmentation for Semantic Navigation0
The 2021 NIST Speaker Recognition Evaluation0
The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge0
The 2ST-UNet for Pneumothorax Segmentation in Chest X-Rays using ResNet34 as a Backbone for U-Net0
The ADAPT Centre’s Neural MT Systems for the WAT 2020 Document-Level Translation Task0
The AI Mechanic: Acoustic Vehicle Characterization Neural Networks0
The ASRU 2019 Mandarin-English Code-Switching Speech Recognition Challenge: Open Datasets, Tracks, Methods and Results0
The Benefits of Mixup for Feature Learning0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
The Curious Case of Benign Memorization0
The data augmentation algorithm0
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion0
The Dual-use Dilemma in LLMs: Do Empowering Ethical Capacities Make a Degraded Utility?0
The Effectiveness of Data Augmentation for Detection of Gastrointestinal Diseases from Endoscopical Images0
The Effect of Data Augmentation on Classification of Atrial Fibrillation in Short Single-Lead ECG Signals Using Deep Neural Networks0
The effects of gender bias in word embeddings on depression prediction0
The Effects of Hallucinations in Synthetic Training Data for Relation Extraction0
The Effects of Regularization and Data Augmentation are Class Dependent0
The Efficacy of Semantics-Preserving Transformations in Self-Supervised Learning for Medical Ultrasound0
Show:102550
← PrevPage 93 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified