SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Synthetic Data Generation
Synthetic Data Generation
The generation of tabular data by any means possible.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 101–125 of 822 papers
Title
Date
Tasks
Status
Hype
Score
BLEUBERI: BLEU is a surprisingly effective reward for instruction following
May 16, 2025
Instruction Following
Synthetic Data Generation
Code
Code Available
1
5
DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data Generation
Feb 26, 2020
Privacy Preserving
Sensitivity
Code
Code Available
1
5
Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner
Dec 24, 2024
Autonomous Driving
Dataset Generation
Code
Code Available
1
5
AnthroNet: Conditional Generation of Humans via Anthropometrics
Sep 7, 2023
3D human pose and shape estimation
3D Human Reconstruction
Code
Code Available
1
5
Diffusion-based Conditional ECG Generation with Structured State Space Models
Jan 19, 2023
State Space Models
Synthetic Data Generation
Code
Code Available
1
5
Black-Box Attacks on Sequential Recommenders via Data-Free Model Extraction
Sep 1, 2021
Data Poisoning
Knowledge Distillation
Code
Code Available
1
5
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs
Feb 23, 2024
Benchmarking
slot-filling
Code
Code Available
1
5
ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis
Dec 19, 2024
Data Augmentation
Synthetic Data Generation
Code
Code Available
1
5
Boosting Synthetic Data Generation with Effective Nonlinear Causal Discovery
Jan 18, 2023
Causal Discovery
software testing
Code
Code Available
1
5
RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection
Sep 5, 2022
Human-Object Interaction Detection
Relation
Code
Code Available
1
5
dpart: Differentially Private Autoregressive Tabular, a General Framework for Synthetic Data Generation
Jul 12, 2022
Synthetic Data Generation
Code
Code Available
1
5
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
May 25, 2023
Computed Tomography (CT)
Image Generation
Code
Code Available
1
5
dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data Generation
May 31, 2025
Synthetic Data Generation
Tabular Data Generation
Code
Code Available
1
5
Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai
Nov 23, 2024
Diversity
Question Answering
Code
Code Available
1
5
Generating Multidimensional Clusters With Support Lines
Jan 24, 2023
Clustering
Synthetic Data Generation
Code
Code Available
1
5
EEG Synthetic Data Generation Using Probabilistic Diffusion Models
Mar 6, 2023
Brain Computer Interface
Data Augmentation
Code
Code Available
1
5
EC-GAN: Low-Sample Classification using Semi-Supervised Algorithms and GANs
Dec 26, 2020
Classification
Data Augmentation
Code
Code Available
1
5
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset
Feb 4, 2025
3D Object Detection
Autonomous Driving
Code
Code Available
1
5
CAD2Render: A Modular Toolkit for GPU-accelerated Photorealistic Synthetic Data Generation for the Manufacturing Industry
Nov 25, 2022
GPU
object-detection
Code
Code Available
1
5
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Mar 7, 2025
Diversity
Fairness
Code
Code Available
1
5
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
Sep 20, 2023
Articles
Decoder
Code
Code Available
1
5
SoftAdapt: Techniques for Adaptive Loss Weighting of Neural Networks with Multi-Part Loss Functions
Dec 27, 2019
Image Reconstruction
Synthetic Data Generation
Code
Code Available
1
5
RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization
May 16, 2025
RAG
Synthetic Data Generation
Code
Code Available
1
5
Generalizing electrocardiogram delineation -- Training convolutional neural networks with synthetic data augmentation
Nov 25, 2021
Data Augmentation
Rhythm
Code
Code Available
1
5
Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs
Mar 15, 2021
Optical Character Recognition (OCR)
Synthetic Data Generation
Code
Code Available
1
5
Show:
10
25
50
← Prev
Page 5 of 33
Next →
All datasets
UCI Epileptic Seizure Recognition
UNSW-NB15
Benchmark Results
▼
UCI Epileptic Seizure Recognition
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
corGAN
AUROC
0.92
—
Unverified
2
GAN
AUROC
0.87
—
Unverified
▼
UNSW-NB15
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
kiNETGAN
EMD
0.07
—
Unverified
2
CTGAN
EMD
0.07
—
Unverified