SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 351375 of 8378 papers

TitleStatusHype
7T MRI Synthesization from 3T AcquisitionsCode1
Dataset Condensation for Time Series Classification via Dual Domain MatchingCode1
Repeated Padding for Sequential RecommendationCode1
CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization PerspectiveCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via PromptsCode1
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code GeneratorsCode1
FedHCDR: Federated Cross-Domain Recommendation with Hypergraph Signal DecouplingCode1
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language ModelsCode1
Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image ClassificationCode1
WHU-Synthetic: A Synthetic Perception Dataset for 3-D Multitask Model ResearchCode1
3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labellingCode1
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual ExamplesCode1
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data AugmentationCode1
Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation StrategiesCode1
Parametric Augmentation for Time Series Contrastive LearningCode1
ExaRanker-Open: Synthetic Explanation for IR using Open-Source LLMsCode1
CultureLLM: Incorporating Cultural Differences into Large Language ModelsCode1
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning DatasetCode1
SPARQL Generation: an analysis on fine-tuning OpenLLaMA for Question Answering over a Life Science Knowledge GraphCode1
CodeIt: Self-Improving Language Models with Prioritized Hindsight ReplayCode1
Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced SegmentationCode1
TimeSiam: A Pre-Training Framework for Siamese Time-Series ModelingCode1
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapesCode1
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of ViewCode1
Show:102550
← PrevPage 15 of 336Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified