Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–700 of 8378 papers

Title	Date	Tasks	Status	Hype
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning	Dec 12, 2022	Data AugmentationImage Generation	CodeCode Available	1
On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline	Dec 12, 2022	BenchmarkingData Augmentation	CodeCode Available	1
Robust and Explainable Identification of Logical Fallacies in Natural Language Arguments	Dec 12, 2022	Data AugmentationLogical Fallacies	CodeCode Available	1
Towards Scale Balanced 6-DoF Grasp Detection in Cluttered Scenes	Dec 10, 2022	Data AugmentationRobotic Grasping	CodeCode Available	1
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion	Dec 7, 2022	Data AugmentationInstance Segmentation	CodeCode Available	1
ObjectStitch: Generative Object Compositing	Dec 2, 2022	Data AugmentationObject	CodeCode Available	1
CL4CTR: A Contrastive Learning Framework for CTR Prediction	Dec 1, 2022	Click-Through Rate PredictionContrastive Learning	CodeCode Available	1
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles	Nov 29, 2022	Data AugmentationDiagnostic	CodeCode Available	1
RGB no more: Minimally-decoded JPEG Vision Transformers	Nov 29, 2022	Data Augmentation	CodeCode Available	1
Rethinking Data Augmentation for Single-source Domain Generalization in Medical Image Segmentation	Nov 27, 2022	Data AugmentationDiversity	CodeCode Available	1
Towards Good Practices for Missing Modality Robust Action Recognition	Nov 25, 2022	Action RecognitionData Augmentation	CodeCode Available	1
Pose-disentangled Contrastive Learning for Self-supervised Facial Representation	Nov 24, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Mitigating and Evaluating Static Bias of Action Representations in the Background and the Foreground	Nov 23, 2022	Action RecognitionData Augmentation	CodeCode Available	1
Mitigating Data Sparsity for Short Text Topic Modeling by Topic-Semantic Contrastive Learning	Nov 23, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket	Nov 23, 2022	Data AugmentationKnowledge Distillation	CodeCode Available	1
Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence Modeling	Nov 23, 2022	Data AugmentationMachine Translation	CodeCode Available	1
RoentGen: Vision-Language Foundation Model for Chest X-ray Generation	Nov 23, 2022	Data Augmentation	CodeCode Available	1
ModelDiff: A Framework for Comparing Learning Algorithms	Nov 22, 2022	Data Augmentation	CodeCode Available	1
Multimodal Data Augmentation for Visual-Infrared Person ReID with Corrupted Data	Nov 22, 2022	Data Augmentation	CodeCode Available	1
Fed-TDA: Federated Tabular Data Augmentation on Non-IID Data	Nov 22, 2022	Data AugmentationFederated Learning	CodeCode Available	1
EVNet: An Explainable Deep Network for Dimension Reduction	Nov 21, 2022	Data AugmentationDimensionality Reduction	CodeCode Available	1
RIC-CNN: Rotation-Invariant Coordinate Convolutional Neural Network	Nov 21, 2022	Data Augmentation	CodeCode Available	1
Background-Mixed Augmentation for Weakly Supervised Change Detection	Nov 21, 2022	Change DetectionData Augmentation	CodeCode Available	1
SeeABLE: Soft Discrepancies and Bounded Contrastive Learning for Exposing Deepfakes	Nov 21, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation	Nov 18, 2022	Conditional Text GenerationData Augmentation	CodeCode Available	1
A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition	Nov 16, 2022	Action RecognitionData Augmentation	CodeCode Available	1
CST5: Data Augmentation for Code-Switched Semantic Parsing	Nov 14, 2022	Data AugmentationSemantic Parsing	CodeCode Available	1
MDFlow: Unsupervised Optical Flow Learning by Reliable Mutual Knowledge Distillation	Nov 11, 2022	BlockingData Augmentation	CodeCode Available	1
DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering	Nov 10, 2022	counterfactualData Augmentation	CodeCode Available	1
IRNet: Iterative Refinement Network for Noisy Partial Label Learning	Nov 9, 2022	Data AugmentationPartial Label Learning	CodeCode Available	1
Soft Augmentation for Image Classification	Nov 9, 2022	ClassificationData Augmentation	CodeCode Available	1
GOOD-D: On Unsupervised Graph Out-Of-Distribution Detection	Nov 8, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Unleashing the Power of Graph Data Augmentation on Covariate Distribution Shift	Nov 5, 2022	Data AugmentationGraph Classification	CodeCode Available	1
Effective Audio Classification Network Based on Paired Inverse Pyramid Structure and Dense MLP Block	Nov 5, 2022	Audio ClassificationClassification	CodeCode Available	1
Conditional Generative Models for Simulation of EMG During Naturalistic Movements	Nov 3, 2022	Data AugmentationTransfer Learning	CodeCode Available	1
Self-supervised Character-to-Character Distillation for Text Recognition	Nov 1, 2022	Data AugmentationRepresentation Learning	CodeCode Available	1
Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality	Nov 1, 2022	Data AugmentationImage Retrieval	CodeCode Available	1
Rethinking and Improving Robustness of Convolutional Neural Networks: a Shapley Value-based Approach in Frequency Domain	Nov 1, 2022	Adversarial AttackAdversarial Robustness	CodeCode Available	1
Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks	Nov 1, 2022	Data AugmentationGraph Neural Network	CodeCode Available	1
Differentiable Data Augmentation for Contrastive Sentence Representation Learning	Oct 29, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Speaker Representation Learning via Contrastive Loss with Maximal Speaker Separability	Oct 29, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
RoChBert: Towards Robust BERT Fine-tuning for Chinese	Oct 28, 2022	Data AugmentationLanguage Modeling	CodeCode Available	1
U-Net-based Models for Skin Lesion Segmentation: More Attention and Augmentation	Oct 28, 2022	Data AugmentationLesion Segmentation	CodeCode Available	1
Domain Adaptive Object Detection for Autonomous Driving under Foggy Weather	Oct 27, 2022	Autonomous DrivingData Augmentation	CodeCode Available	1
ADLight: A Universal Approach of Traffic Signal Control with Augmented Data Using Reinforcement Learning	Oct 24, 2022	Data Augmentationreinforcement-learning	CodeCode Available	1
Contrastive Representation Learning for Gaze Estimation	Oct 24, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Rethinking Rotation in Self-Supervised Contrastive Learning: Adaptive Positive or Negative Data Augmentation	Oct 23, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Neural Eigenfunctions Are Structured Representation Learners	Oct 23, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
Exploring Representation-Level Augmentation for Code Search	Oct 21, 2022	Code SearchContrastive Learning	CodeCode Available	1
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control	Oct 20, 2022	BenchmarkingData Augmentation	CodeCode Available	1

Show:10 25 50

← PrevPage 14 of 168Next →

All datasets ImageNet CIFAR-10 GA1457

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeiT-B (+MixPro)	Accuracy (%)	82.9	—	Unverified
2	ResNet-200 (DeepAA)	Accuracy (%)	81.32	—	Unverified
3	DeiT-S (+MixPro)	Accuracy (%)	81.3	—	Unverified
4	ResNet-200 (Fast AA)	Accuracy (%)	80.6	—	Unverified
5	ResNet-200 (UA)	Accuracy (%)	80.4	—	Unverified
6	ResNet-200 (AA)	Accuracy (%)	80	—	Unverified
7	ResNet-50 (DeepAA)	Accuracy (%)	78.3	—	Unverified
8	ResNet-50 (TA wide)	Accuracy (%)	78.07	—	Unverified
9	ResNet-50 (LoRot-E)	Accuracy (%)	77.72	—	Unverified
10	ResNet-50 (LoRot-I)	Accuracy (%)	77.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WideResNet-40-2 (Faster AA)	Percentage error	3.7	—	Unverified
2	Shake-Shake (26 2×32d) (Faster AA)	Percentage error	2.7	—	Unverified
3	WideResNet-28-10 (Faster AA)	Percentage error	2.6	—	Unverified
4	Shake-Shake (26 2×112d) (Faster AA)	Percentage error	2	—	Unverified
5	Shake-Shake (26 2×96d) (Faster AA)	Percentage error	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiffAug	Classification Accuracy	92.7	—	Unverified
2	PaCMAP	Classification Accuracy	85.3	—	Unverified
3	hNNE	Classification Accuracy	77.4	—	Unverified
4	TopoAE	Classification Accuracy	74.6	—	Unverified