SOTAVerified

Synthetic Data Generation

The generation of tabular data by any means possible.

Papers

Showing 151200 of 822 papers

TitleStatusHype
Scalable and Ethical Insider Threat Detection through Data Synthesis and Analysis by LLMs0
Few-shot_LLM_Synthetic_Data_with_Distribution_MatchingCode0
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM GuardrailsCode1
MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation0
Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data SynthesisCode0
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry20
Automatic Prompt Optimization Techniques: Exploring the Potential for Synthetic Data Generation0
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and DatasetCode1
BARE: Leveraging Base Language Models for Few-Shot Synthetic Data Generation0
CoddLLM: Empowering Large Language Models for Data Analytics0
XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and GlassesCode1
Synthetic Data Generation for Augmenting Small Samples0
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop DataCode1
Making Sense of Data in the Wild: Data Analysis Automation at Scale0
Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction0
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement0
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic DataCode4
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and RefinementCode1
Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation0
Synthetic Data Generation by Supervised Neural Gas Network for Physiological Emotion Recognition DataCode1
Data Enrichment Opportunities for Distribution Grid Cable Networks using Variational Autoencoders0
Sequential PatchCore: Anomaly Detection for Surface Inspection using Synthetic Impurities0
Generating Realistic Synthetic Head Rotation Data for Extended Reality using Deep Learning0
Quantum Down Sampling Filter for Variational Auto-encoderCode0
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though0
User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation0
Reading with Intent -- Neutralizing Intent0
Advancing the Understanding of Fine-Grained 3D Forest Structures using Digital Cousins and Simulation-to-Reality: Methods and Datasets0
SMIR: Efficient Synthetic Data Pipeline To Improve Multi-Image ReasoningCode0
License Plate Images Generation with Diffusion Models0
Can Synthetic Data be Fair and Private? A Comparative Study of Synthetic Data Generation and Fairness Algorithms0
Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data GenerationCode0
Time Series Language Model for Descriptive Caption Generation0
SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy0
TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data0
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task SynthesisCode3
Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation0
HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge DistillationCode0
Generating Traffic Scenarios via In-Context Learning to Learn Better Motion PlannerCode1
Autonomous Crack Detection using Deep Learning on Synthetic Thermogram Datasets0
Leveraging Contrastive Learning for Semantic Segmentation with Consistent Labels Across Varying Appearances0
Stochastic Model of siRNA Endosomal Escape Mediated by Fusogenic Peptides in OVCAR-3Code0
Improving Equity in Health Modeling with GPT4-Turbo Generated Synthetic Data: A Comparative Study0
Differentially Private Federated Learning of Diffusion Models for Synthetic Tabular Data Generation0
Using matrix-product states for time-series machine learningCode1
ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance AnalysisCode1
High-throughput digital twin framework for predicting neurite deterioration using MetaFormer attention0
A Systematic Examination of Preference Learning through the Lens of Instruction-Following0
Synthetic Data Generation for Anomaly Detection on Table GrapesCode0
Auto-Cypher: Improving LLMs on Cypher generation via LLM-supervised generation-verification framework0
Show:102550
← PrevPage 4 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1corGANAUROC0.92Unverified
2GANAUROC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1kiNETGANEMD0.07Unverified
2CTGANEMD0.07Unverified