SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Synthetic Data Generation
Synthetic Data Generation
The generation of tabular data by any means possible.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 26–50 of 822 papers
Title
Date
Tasks
Status
Hype
Score
End-to-End Full-Page Optical Music Recognition for Pianoform Sheet Music
May 20, 2024
Synthetic Data Generation
Code
Code Available
2
5
A Synthetic Dataset for Personal Attribute Inference
Jun 11, 2024
Attribute
Author Profiling
Code
Code Available
2
5
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs
May 12, 2025
AI Agent
Knowledge Distillation
Code
Code Available
2
5
TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series
May 19, 2023
Diversity
Synthetic Data Generation
Code
Code Available
2
5
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting
Jul 21, 2023
Imputation
Probabilistic Time Series Forecasting
Code
Code Available
2
5
Mellow: a small audio language model for reasoning
Mar 11, 2025
Audio captioning
Language Modeling
Code
Code Available
2
5
Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data
Jul 13, 2023
2D Human Pose Estimation
Pose Estimation
Code
Code Available
2
5
Improved Multi-Task Brain Tumour Segmentation with Synthetic Data Augmentation
Nov 7, 2024
Data Augmentation
Synthetic Data Generation
Code
Code Available
2
5
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information Retrieval
Jul 10, 2023
GPU
Information Retrieval
Code
Code Available
2
5
BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion
Jun 29, 2023
Synthetic Data Generation
Code
Code Available
2
5
REaLTabFormer: Generating Realistic Relational and Tabular Data using Transformers
Feb 4, 2023
Synthetic Data Generation
Code
Code Available
2
5
UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization
Jan 11, 2024
Synthetic Data Generation
Visual Localization
Code
Code Available
2
5
Pedagogical Alignment of Large Language Models
Feb 7, 2024
Synthetic Data Generation
Code
Code Available
2
5
EC-GAN: Low-Sample Classification using Semi-Supervised Algorithms and GANs
Dec 26, 2020
Classification
Data Augmentation
Code
Code Available
1
5
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Mar 7, 2025
Diversity
Fairness
Code
Code Available
1
5
EEG Synthetic Data Generation Using Probabilistic Diffusion Models
Mar 6, 2023
Brain Computer Interface
Data Augmentation
Code
Code Available
1
5
dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data Generation
May 31, 2025
Synthetic Data Generation
Tabular Data Generation
Code
Code Available
1
5
Black-Box Attacks on Sequential Recommenders via Data-Free Model Extraction
Sep 1, 2021
Data Poisoning
Knowledge Distillation
Code
Code Available
1
5
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
Feb 7, 2025
Reinforcement Learning (RL)
Synthetic Data Generation
Code
Code Available
1
5
Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging Domains
Mar 16, 2023
Human Mesh Recovery
Synthetic Data Generation
Code
Code Available
1
5
Diffusion-based Conditional ECG Generation with Structured State Space Models
Jan 19, 2023
State Space Models
Synthetic Data Generation
Code
Code Available
1
5
DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data Generation
Feb 26, 2020
Privacy Preserving
Sensitivity
Code
Code Available
1
5
BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages
Nov 7, 2024
automatic-speech-translation
Synthetic Data Generation
Code
Code Available
1
5
Differentially Private Synthetic Medical Data Generation using Convolutional GANs
Dec 22, 2020
Deep Learning
image-classification
Code
Code Available
1
5
dpart: Differentially Private Autoregressive Tabular, a General Framework for Synthetic Data Generation
Jul 12, 2022
Synthetic Data Generation
Code
Code Available
1
5
Show:
10
25
50
← Prev
Page 2 of 33
Next →
All datasets
UCI Epileptic Seizure Recognition
UNSW-NB15
Benchmark Results
▼
UCI Epileptic Seizure Recognition
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
corGAN
AUROC
0.92
—
Unverified
2
GAN
AUROC
0.87
—
Unverified
▼
UNSW-NB15
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
kiNETGAN
EMD
0.07
—
Unverified
2
CTGAN
EMD
0.07
—
Unverified