SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 20512100 of 8378 papers

TitleStatusHype
Interpretable Solutions for Breast Cancer Diagnosis with Grammatical Evolution and Data Augmentation0
Integrating Large Language Models into Recommendation via Mutual Augmentation and Adaptive Aggregation0
Diffusion-based Data Augmentation for Object Counting Problems0
Language Modelling Approaches to Adaptive Machine Translation0
Machine Learning in Proton Exchange Membrane Water Electrolysis -- Part I: A Knowledge-Integrated Framework0
Can GPT-3.5 Generate and Code Discharge Summaries?Code0
Catch-Up Mix: Catch-Up Class for Struggling Filters in CNN0
NIV-SSD: Neighbor IoU-Voting Single-Stage Object Detector From Point CloudCode0
On Building Myopic MPC Policies using Supervised Learning0
IndiText Boost: Text Augmentation for Low Resource India Languages0
Towards Better Inclusivity: A Diverse Tweet Corpus of English VarietiesCode0
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of ViewCode1
Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic RoomsCode2
Data Augmentation for Traffic Classification0
Exploring Color Invariance through Image-Level Ensemble LearningCode2
SAGE-HB: Swift Adaptation and Generalization in Massive MIMO Hybrid Beamforming0
Depth Anything: Unleashing the Power of Large-Scale Unlabeled DataCode9
Interplay of Semantic Communication and Knowledge Learning0
Learning High-Quality and General-Purpose Phrase RepresentationsCode1
Analyzing and Mitigating Bias for Vulnerable Classes: Towards Balanced Representation in Dataset0
ContextMix: A context-aware data augmentation method for industrial visual inspection systemsCode0
Simple and effective data augmentation for compositional generalization0
Few-shot learning for COVID-19 Chest X-Ray Classification with Imbalanced Data: An Inter vs. Intra Domain StudyCode0
Boosting Few-Shot Segmentation via Instance-Aware Data Augmentation and Local Consensus Guided Cross Attention0
Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph ClassificationCode0
Self-supervised New Activity Detection in Sensor-based Smart Environments0
SymTC: A Symbiotic Transformer-CNN Net for Instance Segmentation of Lumbar Spine MRICode1
On the Effect of Data-Augmentation on Local Embedding Properties in the Contrastive Learning of Music Audio Representations0
Trapped in texture bias? A large scale comparison of deep instance segmentationCode1
Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search0
Efficient Training of Generalizable Visuomotor Policies via Control-Aware Augmentation0
Augmenting Ground-Level PM2.5 Prediction via Kriging-Based Pseudo-Label Generation0
A Deep Hierarchical Feature Sparse Framework for Occluded Person Re-Identification0
Authorship Obfuscation in Multilingual Machine-Generated Text DetectionCode2
Enhanced Few-Shot Class-Incremental Learning via Ensemble Models0
Contrastive Learning with Negative Sampling Correction0
UniVision: A Unified Framework for Vision-Centric 3D PerceptionCode0
Large Language Models Can Learn Temporal ReasoningCode2
Maximum-Entropy Adversarial Audio Augmentation for Keyword Spotting0
Local Gamma Augmentation for Ischemic Stroke Lesion Segmentation on MRI0
Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentationCode0
Adaptive Data Augmentation for Aspect Sentiment Quad PredictionCode0
Enhancing Personality Recognition in Dialogue by Data Augmentation and Heterogeneous Conversational Graph NetworksCode0
Chain of History: Learning and Forecasting with LLMs for Temporal Knowledge Graph Completion0
Evaluating Data Augmentation Techniques for Coffee Leaf Disease Classification0
Knowledge Translation: A New Pathway for Model CompressionCode0
Learning Generalizable Models via Disentangling Spurious and Enhancing Potential Correlations0
Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object DetectionCode0
Content-Conditioned Generation of Stylized Free hand Sketches0
Phase-shifted remote photoplethysmography for estimating heart rate and blood pressure from facial videoCode1
Show:102550
← PrevPage 42 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified