SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Synthetic Data Generation
Synthetic Data Generation
The generation of tabular data by any means possible.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 126–150 of 822 papers
Title
Date
Tasks
Status
Hype
Score
Generative Wind Power Curve Modeling Via Machine Vision: A Self-learning Deep Convolutional Network Based Method
Aug 19, 2021
Benchmarking
Synthetic Data Generation
Code
Code Available
1
5
LEyes: A Lightweight Framework for Deep Learning-Based Eye Tracking using Synthetic Eye Images
Sep 12, 2023
Gaze Estimation
Synthetic Data Generation
Code
Code Available
1
5
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONS
Apr 25, 2025
Clinical Language Translation
Machine Translation
Code
Code Available
1
5
D3A-TS: Denoising-Driven Data Augmentation in Time Series
Dec 9, 2023
Data Augmentation
Denoising
Code
Code Available
1
5
RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization
May 16, 2025
RAG
Synthetic Data Generation
Code
Code Available
1
5
Leveraging Generative AI Models for Synthetic Data Generation in Healthcare: Balancing Research and Privacy
May 9, 2023
Synthetic Data Generation
Code
Code Available
1
5
MTSS-GAN: Multivariate Time Series Simulation Generative Adversarial Networks
Jun 26, 2020
Generative Adversarial Network
Image Generation
Code
Code Available
1
5
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
May 27, 2025
Image Retrieval
Retrieval
Code
Code Available
1
5
Controllable 3D Generative Adversarial Face Model via Disentangling Shape and Appearance
Aug 30, 2022
3D Face Modelling
Face Model
Code
Code Available
1
5
Improved Training of Wasserstein GANs
Mar 31, 2017
Conditional Image Generation
Image Generation
Code
Code Available
1
5
Copula-based synthetic data augmentation for machine-learning emulators
Dec 16, 2020
BIG-bench Machine Learning
Data Augmentation
Code
Code Available
1
5
CLIPPER: Compression enables long-context synthetic data generation
Feb 20, 2025
Claim Verification
Synthetic Data Generation
Code
Code Available
1
5
CorGAN: Correlation-Capturing Convolutional Generative Adversarial Networks for Generating Synthetic Healthcare Records
Jan 25, 2020
Disease Prediction
General Classification
Code
Code Available
1
5
Learning Compact Metrics for MT
Oct 12, 2021
Cross-Lingual Transfer
Language Modeling
Code
Code Available
1
5
MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures
Mar 20, 2025
Synthetic Data Generation
Code
Code Available
1
5
Characterization and Greedy Learning of Gaussian Structural Causal Models under Unknown Interventions
Nov 27, 2022
Synthetic Data Generation
Code
Code Available
1
5
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
Jan 21, 2025
Synthetic Data Generation
World Knowledge
Code
Code Available
1
5
Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data
Aug 26, 2020
Decoder
Music Genre Transfer
Code
Code Available
1
5
DP-MERF: Differentially Private Mean Embeddings with Random Features for Practical Privacy-Preserving Data Generation
Feb 26, 2020
Privacy Preserving
Sensitivity
Code
Code Available
1
5
A Comprehensive Survey of Synthetic Tabular Data Generation
Apr 23, 2025
Privacy Preserving
Survey
Code
Code Available
1
5
Learning from synthetic data generated with GRADE
May 7, 2023
Pose Estimation
Synthetic Data Generation
Code
Code Available
1
5
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability
Jun 2, 2025
Descriptive
Synthetic Data Generation
Code
Code Available
1
5
Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint
Jan 1, 2023
Data Augmentation
Data-free Knowledge Distillation
Code
Code Available
1
5
SocialDial: A Benchmark for Socially-Aware Dialogue Systems
Apr 24, 2023
Cultural Vocal Bursts Intensity Prediction
Synthetic Data Generation
Code
Code Available
1
5
Will we run out of data? Limits of LLM scaling based on human-generated data
Oct 26, 2022
Language Modeling
Language Modelling
Code
Code Available
1
5
Show:
10
25
50
← Prev
Page 6 of 33
Next →
All datasets
UCI Epileptic Seizure Recognition
UNSW-NB15
Benchmark Results
▼
UCI Epileptic Seizure Recognition
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
corGAN
AUROC
0.92
—
Unverified
2
GAN
AUROC
0.87
—
Unverified
▼
UNSW-NB15
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
kiNETGAN
EMD
0.07
—
Unverified
2
CTGAN
EMD
0.07
—
Unverified