SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 251300 of 822 papers

TitleStatusHype
CONVERSER: Few-Shot Conversational Dense Retrieval with Synthetic Data GenerationCode0
PATE-GAN: Generating Synthetic Data with Differential Privacy GuaranteesCode0
Advancing Post-OCR Correction: A Comparative Study of Synthetic DataCode0
Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck PerspectiveCode0
PAXQA: Generating Cross-lingual Question Answering Examples at Training ScaleCode0
Scaling While Privacy Preserving: A Comprehensive Synthetic Tabular Data Generation and Evaluation in Learning AnalyticsCode0
Content Disentanglement for Semantically Consistent Synthetic-to-Real Domain AdaptationCode0
Optimization-Free Universal Watermark Forgery with Regenerative Diffusion ModelsCode0
Online Data Augmentation for Forecasting with Deep LearningCode0
Optimizing Synthetic Data for Enhanced Pancreatic Tumor SegmentationCode0
Conditioning on Time is All You Need for Synthetic Survival Data GenerationCode0
Neural Descriptors: Self-Supervised Learning of Robust Local Surface Descriptors Using Polynomial PatchesCode0
Neural spell-checker: Beyond words with synthetic data generationCode0
Adaptation of Back-translation to Automatic Post-Editing for Synthetic Data GenerationCode0
A text-to-tabular approach to generate synthetic patient data using LLMsCode0
Few-shot_LLM_Synthetic_Data_with_Distribution_MatchingCode0
LLM-based Automated Theorem Proving Hinges on Scalable Synthetic Data GenerationCode0
MC-GEN:Multi-level Clustering for Private Synthetic Data GenerationCode0
MMM and MMMSynth: Clustering of heterogeneous tabular data, and synthetic data generationCode0
Combining propensity score methods with variational autoencoders for generating synthetic data in presence of latent sub-groupsCode0
Comparative Study of Differentially Private Synthetic Data Algorithms from the NIST PSCR Differential Privacy Synthetic Data ChallengeCode0
A Systematic Evaluation of Generative Models on Tabular Transportation DataCode0
Exploring the Limits of Synthetic Creation of Solar EUV Images via Image-to-Image TranslationCode0
Joint Selection: Adaptively Incorporating Public Information for Private Synthetic DataCode0
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed EnvironmentsCode0
Combining Data Generation and Active Learning for Low-Resource Question AnsweringCode0
Improving text-conditioned latent diffusion for cancer pathologyCode0
A Little Human Data Goes A Long WayCode0
Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information RetrievalCode0
Improved Generation of Synthetic Imaging Data Using Feature-Aligned DiffusionCode0
HP-GAN: Probabilistic 3D human motion prediction via GANCode0
A Survey on Deep Learning for Skin Lesion SegmentationCode0
HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge DistillationCode0
A Statistical Approach for Synthetic EEG Data GenerationCode0
Improving Generalization of Synthetically Trained Sonar Image Descriptors for Underwater Place RecognitionCode0
GUIDE-VAE: Advancing Data Generation with User Information and Pattern DictionariesCode0
Enhancing Metabolic Syndrome Prediction with Hybrid Data Balancing and CounterfactualsCode0
Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation ExtractionCode0
Hide-and-Seek Privacy ChallengeCode0
A Linear Reconstruction Approach for Attribute Inference Attacks against Synthetic DataCode0
Enhancing human action recognition with GAN-based data augmentationCode0
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System ResponsesCode0
Enhancing Domain-Specific Retrieval-Augmented Generation: Synthetic Data Generation and Evaluation using Reasoning ModelsCode0
How Good Are Synthetic Requirements ? Evaluating LLM-Generated Datasets for AI4RECode0
Comparative study of models trained on synthetic data for Ukrainian grammatical error correctionCode0
Little Giants: Synthesizing High-Quality Embedding Data at ScaleCode0
IR2: Information Regularization for Information RetrievalCode0
LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval TaskCode0
Child Face Recognition at Scale: Synthetic Data Generation and Performance BenchmarkCode0
Flexible Generation of Preference Data for Recommendation AnalysisCode0
Show:102550
← PrevPage 6 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified