Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4851–4900 of 8378 papers

Title	Date	Tasks	Status
BioInfo@UAVR@SMM4H’22: Classification and Extraction of Adverse Event mentions in Tweets using Transformer Models	Oct 1, 2022	Data Augmentation	—Unverified
Position Offset Label Prediction for Grammatical Error Correction	Oct 1, 2022	Data AugmentationDecoder	—Unverified
MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation	Oct 1, 2022	Data AugmentationLanguage Modeling	—Unverified
Document-level Event Factuality Identification via Machine Reading Comprehension Frameworks with Transfer Learning	Oct 1, 2022	Data AugmentationMachine Reading Comprehension	—Unverified
Rethinking Data Augmentation in Text-to-text Paradigm	Oct 1, 2022	Data Augmentation	—Unverified
Data Augmentation for Improving the Prediction of Validity and Novelty of Argumentative Conclusions	Oct 1, 2022	Data Augmentation	—Unverified
Dynamic Nonlinear Mixup with Distance-based Sample Selection	Oct 1, 2022	Data Augmentation	—Unverified
Table-based Fact Verification with Self-labeled Keypoint Alignment	Oct 1, 2022	AttributeContrastive Learning	—Unverified
Summarizing Patients’ Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models	Oct 1, 2022	Data AugmentationDiagnostic	—Unverified
Coordination Generation via Synchronized Text-Infilling	Oct 1, 2022	Data AugmentationSentence	—Unverified
Augmented Bio-SBERT: Improving Performance for Pairwise Sentence Tasks in Bio-medical Domain	Oct 1, 2022	Data AugmentationSentence	—Unverified
The Only Chance to Understand: Machine Translation of the Severely Endangered Low-resource Languages of Eurasia	Oct 1, 2022	Data AugmentationLanguage Modeling	—Unverified
Addressing Limitations of Encoder-Decoder Based Approach to Text-to-SQL	Oct 1, 2022	Data AugmentationDecoder	—Unverified
Probing the Robustness of Pre-trained Language Models for Entity Matching	Oct 1, 2022	Data AugmentationDomain Generalization	CodeCode Available
Improving Event Temporal Relation Classification via Auxiliary Label-Aware Contrastive Learning	Oct 1, 2022	Contrastive LearningData Augmentation	—Unverified
Evaluating and Mitigating Inherent Linguistic Bias of African American English through Inference	Oct 1, 2022	Data AugmentationDiversity	—Unverified
KUL@SMM4H’22: Template Augmented Adaptive Pre-training for Tweet Classification	Oct 1, 2022	Data AugmentationLanguage Modeling	—Unverified
CAISA@SMM4H’22: Robust Cross-Lingual Detection of Disease Mentions on Social Media with Adversarial Methods	Oct 1, 2022	Data Augmentation	—Unverified
ParaZh-22M: A Large-Scale Chinese Parabank via Machine Translation	Oct 1, 2022	Data AugmentationMachine Translation	—Unverified
Towards Robust Neural Retrieval with Source Domain Synthetic Pre-Finetuning	Oct 1, 2022	Data AugmentationDomain Generalization	—Unverified
BRCC and SentiBahasaRojak: The First Bahasa Rojak Corpus for Pretraining and Sentiment Analysis Dataset	Oct 1, 2022	Data AugmentationSentiment Analysis	—Unverified
Towards Summarizing Healthcare Questions in Low-Resource Setting	Oct 1, 2022	Data AugmentationDiversity	—Unverified
Effective Data Augmentation for Sentence Classification Using One VAE per Class	Oct 1, 2022	Binary ClassificationData Augmentation	—Unverified
Pseudo-Label Generation and Various Data Augmentation for Semi-Supervised Hyperspectral Object Detection	Oct 1, 2022	Data Augmentationobject-detection	CodeCode Available
Domain Generalization -- A Causal Perspective	Sep 30, 2022	Data AugmentationDomain Generalization	—Unverified
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning	Sep 30, 2022	Data AugmentationImage Generation	CodeCode Available
Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods	Sep 30, 2022	Computational EfficiencyData Augmentation	—Unverified
Using Knowledge Distillation to improve interpretable models in a retail banking context	Sep 30, 2022	Data AugmentationKnowledge Distillation	—Unverified
Augmentation Backdoors	Sep 29, 2022	Data Augmentation	CodeCode Available
Prompt-guided Scene Generation for 3D Zero-Shot Learning	Sep 29, 2022	Contrastive LearningData Augmentation	—Unverified
Automatic Data Augmentation via Invariance-Constrained Learning	Sep 29, 2022	Data AugmentationImage Classification	CodeCode Available
Named Entity Recognition in Industrial Tables using Tabular Language Models	Sep 29, 2022	Data AugmentationInductive Bias	—Unverified
Contrastive Unsupervised Learning of World Model with Invariant Causal Features	Sep 29, 2022	Data AugmentationDepth Estimation	—Unverified
Weighted Contrastive Hashing	Sep 28, 2022	Contrastive LearningData Augmentation	CodeCode Available
Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition	Sep 28, 2022	Data AugmentationLicense Plate Recognition	—Unverified
Data Augmentation using Feature Generation for Volumetric Medical Images	Sep 28, 2022	ClassificationData Augmentation	—Unverified
3D Rendering Framework for Data Augmentation in Optical Character Recognition	Sep 27, 2022	Data AugmentationOptical Character Recognition	—Unverified
TaskMix: Data Augmentation for Meta-Learning of Spoken Intent Understanding	Sep 26, 2022	Data AugmentationDiversity	—Unverified
Ani-GIFs: A benchmark dataset for domain generalization of action recognition from GIFs	Sep 26, 2022	Action RecognitionAnimated GIF Generation	—Unverified
On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question Answering	Sep 26, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Contrastive learning for unsupervised medical image clustering and reconstruction	Sep 24, 2022	ClusteringContrastive Learning	—Unverified
A Simple Strategy to Provable Invariance via Orbit Mapping	Sep 24, 2022	3D Point Cloud ClassificationComputational Efficiency	—Unverified
Towards Bridging the Space Domain Gap for Satellite Pose Estimation using Event Sensing	Sep 24, 2022	Data AugmentationDomain Adaptation	—Unverified
Semantically Consistent Data Augmentation for Neural Machine Translation via Conditional Masked Language Model	Sep 22, 2022	Data AugmentationDiversity	CodeCode Available
Automated detection of Alzheimer disease using MRI images and deep neural networks- A review	Sep 22, 2022	Data AugmentationDeep Learning	—Unverified
StyleTime: Style Transfer for Synthetic Time Series Generation	Sep 22, 2022	Data AugmentationStyle Transfer	—Unverified
Scope of Pre-trained Language Models for Detecting Conflicting Health Information	Sep 22, 2022	Data Augmentation	—Unverified
SR-GCL: Session-Based Recommendation with Global Context Enhanced Augmentation in Contrastive Learning	Sep 22, 2022	Contrastive LearningData Augmentation	—Unverified
DARTSRepair: Core-failure-set Guided DARTS for Network Robustness to Common Corruptions	Sep 21, 2022	Data Augmentation	—Unverified
Exploring Inconsistent Knowledge Distillation for Object Detection with Data Augmentation	Sep 20, 2022	Data AugmentationKnowledge Distillation	CodeCode Available

Show:10 25 50

← PrevPage 98 of 168Next →

All datasets ImageNet CIFAR-10 GA1457

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	DeiT-B (+MixPro)	Accuracy (%)	82.9	—	Unverified
2	ResNet-200 (DeepAA)	Accuracy (%)	81.32	—	Unverified
3	DeiT-S (+MixPro)	Accuracy (%)	81.3	—	Unverified
4	ResNet-200 (Fast AA)	Accuracy (%)	80.6	—	Unverified
5	ResNet-200 (UA)	Accuracy (%)	80.4	—	Unverified
6	ResNet-200 (AA)	Accuracy (%)	80	—	Unverified
7	ResNet-50 (DeepAA)	Accuracy (%)	78.3	—	Unverified
8	ResNet-50 (TA wide)	Accuracy (%)	78.07	—	Unverified
9	ResNet-50 (LoRot-E)	Accuracy (%)	77.72	—	Unverified
10	ResNet-50 (LoRot-I)	Accuracy (%)	77.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WideResNet-40-2 (Faster AA)	Percentage error	3.7	—	Unverified
2	Shake-Shake (26 2×32d) (Faster AA)	Percentage error	2.7	—	Unverified
3	WideResNet-28-10 (Faster AA)	Percentage error	2.6	—	Unverified
4	Shake-Shake (26 2×112d) (Faster AA)	Percentage error	2	—	Unverified
5	Shake-Shake (26 2×96d) (Faster AA)	Percentage error	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DiffAug	Classification Accuracy	92.7	—	Unverified
2	PaCMAP	Classification Accuracy	85.3	—	Unverified
3	hNNE	Classification Accuracy	77.4	—	Unverified
4	TopoAE	Classification Accuracy	74.6	—	Unverified