SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 30013050 of 8378 papers

TitleStatusHype
With a Little Push, NLI Models can Robustly and Efficiently Predict FaithfulnessCode0
GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks0
An Empirical Comparison of LM-based Question and Answer Generation Methods0
Diversify Your Vision Datasets with Automatic Diffusion-Based AugmentationCode1
You Don't Have to Be Perfect to Be Amazing: Unveil the Utility of Synthetic Images0
PDE+: Enhancing Generalization via PDE with Adaptive Distributional DiffusionCode1
Dynamic Data Augmentation via MCTS for Prostate MRI SegmentationCode0
VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large ScaleCode1
Cross-lingual Data Augmentation for Document-grounded Dialog Systems in Low Resource Languages0
OverPrompt: Enhancing ChatGPT through Efficient In-Context LearningCode0
HARD: Hard Augmentations for Robust Distillation0
Training on Thin Air: Improve Image Classification with Generated DataCode1
Prompting Large Language Models for Counterfactual Generation: An Empirical Study0
ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents0
Getting Sick After Seeing a Doctor? Diagnosing and Mitigating Knowledge Conflicts in Event Temporal ReasoningCode0
SmartTrim: Adaptive Tokens and Attention Pruning for Efficient Vision-Language ModelsCode0
Iteratively Improving Speech Recognition and Voice Conversion0
Sorted Convolutional Network for Achieving Continuous Rotational Invariance0
Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited QuestionsCode0
Siamese Masked Autoencoders0
Conversational Recommendation as Retrieval: A Simple, Strong Baseline0
Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation0
Understanding Compositional Data Augmentation in Typologically Diverse Morphological InflectionCode0
LLM-powered Data Augmentation for Enhanced Cross-lingual PerformanceCode0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Subspace-Configurable NetworksCode0
Self-Evolution Learning for Mixup: Enhance Data Augmentation on Few-Shot Text Classification Tasks0
ConvBoost: Boosting ConvNets for Sensor-based Activity RecognitionCode0
ColMix -- A Simple Data Augmentation Framework to Improve Object Detector Performance and Robustness in Aerial Images0
Statistical Guarantees of Group-Invariant GANs0
Improving Classifier Robustness through Active Generation of Pairwise Counterfactuals0
Tied-Augment: Controlling Representation Similarity Improves Data AugmentationCode1
Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation0
Tokenized Graph Transformer with Neighborhood Augmentation for Node Classification in Large Graphs0
Real-Aug: Realistic Scene Synthesis for LiDAR Augmentation in 3D Object Detection0
Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs CollaborationCode1
Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study0
Phased Data Augmentation for Training a Likelihood-Based Generative Model with Limited Data0
PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMsCode1
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding0
Understanding the Effect of Data Augmentation on Knowledge Distillation0
Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical ReasoningCode1
SIDAR: Synthetic Image Dataset for Alignment & RestorationCode0
Boosting Crop Classification by Hierarchically Fusing Satellite, Rotational, and Contextual Data0
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation0
Enhancing Few-shot NER with Prompt Ordering based Data Augmentation0
Data Augmentation for Diverse Voice Conversion in Noisy Environments0
Cross-modality Data Augmentation for End-to-End Sign Language TranslationCode1
Adaptive Graph Contrastive Learning for RecommendationCode1
RobustFair: Adversarial Evaluation through Fairness Confusion Directed Gradient SearchCode0
Show:102550
← PrevPage 61 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified