SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 63016350 of 8378 papers

TitleStatusHype
LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge0
LTCXNet: Advancing Chest X-Ray Analysis with Solutions for Long-Tailed Multi-Label Classification and Fairness Challenges0
Lumbar Spine Tumor Segmentation and Localization in T2 MRI Images Using AI0
Lungmix: A Mixup-Based Strategy for Generalization in Respiratory Sound Classification0
LungRN+NL: An Improved Adventitious Lung Sound Classification Using Non-Local Block ResNet Neural Network with Mixup Data Augmentation0
Lung Sound Classification Using Co-tuning and Stochastic Normalization0
LuxemBERT: Simple and Practical Data Augmentation in Language Model Pre-Training for Luxembourgish0
Lyb3b at SemEval-2018 Task 11: Machine Comprehension Task using Deep Learning Models0
M2R2: Missing-Modality Robust emotion Recognition framework with iterative data augmentation0
M3ST: Mix at Three Levels for Speech Translation0
Maastricht University’s Multilingual Speech Translation System for IWSLT 20210
MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion0
Machine-agnostic Automated Lumbar MRI Segmentation using a Cascaded Model Based on Generative Neurons0
Machine Learning Algorithms for Breast Cancer Detection in Mammography Images: A Comparative Study0
Machine-Learning-Assisted Photonic Device Development: A Multiscale Approach from Theory to Characterization0
Deep Learning Based Walking Tasks Classification in Older Adults using fNIRS0
Machine Learning based Post Processing Artifact Reduction in HEVC Intra Coding0
Machine Learning in Proton Exchange Membrane Water Electrolysis -- Part I: A Knowledge-Integrated Framework0
Machine Learning with Physics Knowledge for Prediction: A Survey0
Machine Reading Comprehension as Data Augmentation: A Case Study on Implicit Event Argument Extraction0
Large Language Models for Document-Level Event-Argument Data Augmentation for Challenging Role Types0
MAFIA: Multi-Adapter Fused Inclusive LanguAge Models0
Reformulation for Pretraining Data Augmentation0
MAGE: Multi-Head Attention Guided Embeddings for Low Resource Sentiment Classification0
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness0
MAIRA-1: A specialised large multimodal model for radiology report generation0
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation0
Changing the Training Data Distribution to Reduce Simplicity Bias Improves In-distribution Generalization0
Making Invisible Visible: Data-Driven Seismic Inversion with Spatio-temporally Constrained Data Augmentation0
Making LLMs Worth Every Penny: Resource-Limited Text Classification in Banking0
MalMixer: Few-Shot Malware Classification with Retrieval-Augmented Semi-Supervised Learning0
MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation0
MALTO at SemEval-2024 Task 6: Leveraging Synthetic Data for LLM Hallucination Detection0
Manitest: Are classifiers really invariant?0
MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis0
Mapping fNIRS to fMRI with Neural Data Augmentation and Machine Learning Models0
Markov Game Video Augmentation for Action Segmentation0
Marvolo: Programmatic Data Augmentation for Practical ML-Driven Malware Detection0
Mask2Lesion: Mask-Constrained Adversarial Skin Lesion Image Synthesis0
Mask-based Data Augmentation for Semi-supervised Semantic Segmentation0
Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling0
Masked Clinical Modelling: A Framework for Synthetic and Augmented Survival Data Generation0
Masked Conditional Diffusion Model for Enhancing Deepfake Detection0
Masked Conditional Diffusion Models for Image Analysis with Application to Radiographic Diagnosis of Infant Abuse0
Masked Contrastive Representation Learning0
Masked Face Recognition with Latent Part Detection0
Metaphorical Paraphrase Generation: Feeding Metaphorical Language Models with Literal Texts0
Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning0
Mask-guided Data Augmentation for Multiparametric MRI Generation with a Rare Hepatocellular Carcinoma0
Mask-Guided Portrait Editing with Conditional GANs0
Show:102550
← PrevPage 127 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified