SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Synthetic Data Generation
Synthetic Data Generation
The generation of tabular data by any means possible.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 126–150 of 822 papers
Title
Date
Tasks
Status
Hype
Using matrix-product states for time-series machine learning
Dec 20, 2024
Astronomy
Imputation
Code
Code Available
1
Characterization and Greedy Learning of Gaussian Structural Causal Models under Unknown Interventions
Nov 27, 2022
Synthetic Data Generation
Code
Code Available
1
DFNet: Enhance Absolute Pose Regression with Direct Feature Matching
Apr 1, 2022
Camera Pose Estimation
Camera Relocalization
Code
Code Available
1
DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data Generation
Feb 26, 2020
Privacy Preserving
Sensitivity
Code
Code Available
1
Controllable 3D Generative Adversarial Face Model via Disentangling Shape and Appearance
Aug 30, 2022
3D Face Modelling
Face Model
Code
Code Available
1
A Comprehensive Survey of Synthetic Tabular Data Generation
Apr 23, 2025
Privacy Preserving
Survey
Code
Code Available
1
Diffusion-HPC: Synthetic Data Generation for Human Mesh Recovery in Challenging Domains
Mar 16, 2023
Human Mesh Recovery
Synthetic Data Generation
Code
Code Available
1
dpart: Differentially Private Autoregressive Tabular, a General Framework for Synthetic Data Generation
Jul 12, 2022
Synthetic Data Generation
Code
Code Available
1
dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data Generation
May 31, 2025
Synthetic Data Generation
Tabular Data Generation
Code
Code Available
1
CorGAN: Correlation-Capturing Convolutional Generative Adversarial Networks for Generating Synthetic Healthcare Records
Jan 25, 2020
Disease Prediction
General Classification
Code
Code Available
1
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
May 27, 2025
Image Retrieval
Retrieval
Code
Code Available
1
CLIPPER: Compression enables long-context synthetic data generation
Feb 20, 2025
Claim Verification
Synthetic Data Generation
Code
Code Available
1
Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes
Jan 29, 2024
Data Augmentation
Sound Event Localization and Detection
Code
Code Available
1
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
Jan 21, 2025
Synthetic Data Generation
World Knowledge
Code
Code Available
1
Exploring Transformer Text Generation for Medical Dataset Augmentation
May 1, 2020
Synthetic Data Generation
Text Generation
Code
Code Available
1
Differentially Private Synthetic Medical Data Generation using Convolutional GANs
Dec 22, 2020
Deep Learning
image-classification
Code
Code Available
1
EEG Synthetic Data Generation Using Probabilistic Diffusion Models
Mar 6, 2023
Brain Computer Interface
Data Augmentation
Code
Code Available
1
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
May 25, 2023
Computed Tomography (CT)
Image Generation
Code
Code Available
1
Copula-based synthetic data augmentation for machine-learning emulators
Dec 16, 2020
BIG-bench Machine Learning
Data Augmentation
Code
Code Available
1
BLEUBERI: BLEU is a surprisingly effective reward for instruction following
May 16, 2025
Instruction Following
Synthetic Data Generation
Code
Code Available
1
Generating Synthetic Handwritten Historical Documents With OCR Constrained GANs
Mar 15, 2021
Optical Character Recognition (OCR)
Synthetic Data Generation
Code
Code Available
1
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONS
Apr 25, 2025
Clinical Language Translation
Machine Translation
Code
Code Available
1
GeoPointGAN: Synthetic Spatial Data with Local Label Differential Privacy
May 18, 2022
Management
Privacy Preserving
Code
Code Available
1
Scrape, Cut, Paste and Learn: Automated Dataset Generation Applied to Parcel Logistics
Oct 18, 2022
3D Object Detection
3D Reconstruction
Code
Code Available
1
Will we run out of data? Limits of LLM scaling based on human-generated data
Oct 26, 2022
Language Modeling
Language Modelling
Code
Code Available
1
Show:
10
25
50
← Prev
Page 6 of 33
Next →
All datasets
UCI Epileptic Seizure Recognition
UNSW-NB15
Benchmark Results
▼
UCI Epileptic Seizure Recognition
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
corGAN
AUROC
0.92
—
Unverified
2
GAN
AUROC
0.87
—
Unverified
▼
UNSW-NB15
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
kiNETGAN
EMD
0.07
—
Unverified
2
CTGAN
EMD
0.07
—
Unverified