SOTAVerified

Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting.

Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others.

Further readings:

( Image credit: Albumentations )

Papers

Showing 51100 of 8378 papers

TitleStatusHype
HSIGene: A Foundation Model For Hyperspectral Image GenerationCode2
FreeVC: Towards High-Quality Text-Free One-Shot Voice ConversionCode2
InPars: Data Augmentation for Information Retrieval using Large Language ModelsCode2
Fixing the train-test resolution discrepancy: FixEfficientNetCode2
Generative Adversarial Network in Medical Imaging: A ReviewCode2
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical ReasoningCode2
LightGCL: Simple Yet Effective Graph Contrastive Learning for RecommendationCode2
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 LanguagesCode2
MindBridge: A Cross-Subject Brain Decoding FrameworkCode2
Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event DetectionCode2
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth EstimationCode2
Morphological Symmetries in RoboticsCode2
Identity Decoupling for Multi-Subject Personalization of Text-to-Image ModelsCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
Mustango: Toward Controllable Text-to-Music GenerationCode2
Enhancing the Utility of Privacy-Preserving Cancer Classification using Synthetic DataCode2
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)Code2
Exploring Color Invariance through Image-Level Ensemble LearningCode2
Effective Data Augmentation With Diffusion ModelsCode2
GuardReasoner-VL: Safeguarding VLMs via Reinforced ReasoningCode2
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification TasksCode2
External Knowledge Injection for CLIP-Based Class-Incremental LearningCode2
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future DirectionsCode2
Delving into the Trajectory Long-tail Distribution for Muti-object TrackingCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Deep learning for time series classificationCode2
Deep Visual Geo-localization BenchmarkCode2
Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future DirectionsCode2
Fast-BEV: A Fast and Strong Bird's-Eye View Perception BaselineCode2
Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology imagesCode2
Composed Multi-modal Retrieval: A Survey of Approaches and ApplicationsCode2
Deep PCB To COCO ConvertorCode2
CodeS: Towards Building Open-source Language Models for Text-to-SQLCode2
Conditional Diffusion Models for Semantic 3D Brain MRI SynthesisCode2
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationCode2
Depth Field Networks for Generalizable Multi-view Scene RepresentationCode2
Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation frameworkCode2
EarthLoc: Astronaut Photography Localization by Indexing Earth from SpaceCode2
BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with TransformerCode2
EdgeYOLO: An Edge-Real-Time Object DetectorCode2
BOP Challenge 2020 on 6D Object LocalizationCode2
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual BackbonesCode2
Efficient Training of Language Models to Fill in the MiddleCode2
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
Calib3D: Calibrating Model Preferences for Reliable 3D Scene UnderstandingCode2
Fixing the train-test resolution discrepancyCode2
BiRP: Learning Robot Generalized Bimanual Coordination using Relative Parameterization Method on Human DemonstrationCode2
GAN-Supervised Dense Visual AlignmentCode2
Addressing Concept Shift in Online Time Series Forecasting: Detect-then-AdaptCode2
BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data TrainingCode2
Show:102550
← PrevPage 2 of 168Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeiT-B (+MixPro)Accuracy (%)82.9Unverified
2ResNet-200 (DeepAA)Accuracy (%)81.32Unverified
3DeiT-S (+MixPro)Accuracy (%)81.3Unverified
4ResNet-200 (Fast AA)Accuracy (%)80.6Unverified
5ResNet-200 (UA)Accuracy (%)80.4Unverified
6ResNet-200 (AA)Accuracy (%)80Unverified
7ResNet-50 (DeepAA)Accuracy (%)78.3Unverified
8ResNet-50 (TA wide)Accuracy (%)78.07Unverified
9ResNet-50 (LoRot-E)Accuracy (%)77.72Unverified
10ResNet-50 (LoRot-I)Accuracy (%)77.71Unverified
#ModelMetricClaimedVerifiedStatus
1WideResNet-40-2 (Faster AA)Percentage error3.7Unverified
2Shake-Shake (26 2×32d) (Faster AA)Percentage error2.7Unverified
3WideResNet-28-10 (Faster AA)Percentage error2.6Unverified
4Shake-Shake (26 2×96d) (Faster AA)Percentage error2Unverified
5Shake-Shake (26 2×112d) (Faster AA)Percentage error2Unverified
#ModelMetricClaimedVerifiedStatus
1DiffAugClassification Accuracy92.7Unverified
2PaCMAPClassification Accuracy85.3Unverified
3hNNEClassification Accuracy77.4Unverified
4TopoAEClassification Accuracy74.6Unverified