SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Synthetic Data Generation
Synthetic Data Generation
The generation of tabular data by any means possible.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 26–50 of 822 papers
Title
Date
Tasks
Status
Hype
IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data
Jun 3, 2025
Attribute
Synthetic Data Generation
—
Unverified
0
Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models
Jun 3, 2025
Synthetic Data Generation
—
Unverified
0
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability
Jun 2, 2025
Descriptive
Synthetic Data Generation
Code
Code Available
1
SMOTE-DP: Improving Privacy-Utility Tradeoff with Synthetic Data
Jun 2, 2025
Privacy Preserving
Synthetic Data Generation
—
Unverified
0
dpmm: Differentially Private Marginal Models, a Library for Synthetic Tabular Data Generation
May 31, 2025
Synthetic Data Generation
Tabular Data Generation
Code
Code Available
1
VietMix: A Naturally Occurring Vietnamese-English Code-Mixed Corpus with Iterative Augmentation for Machine Translation
May 30, 2025
Machine Translation
Synthetic Data Generation
—
Unverified
0
Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison
May 30, 2025
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
—
Unverified
0
CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis
May 29, 2025
Contrastive Learning
Diversity
—
Unverified
0
StressTest: Can YOUR Speech LM Handle the Stress?
May 28, 2025
Question Answering
Sentence
—
Unverified
0
Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection
May 28, 2025
Diversity
Synthetic Data Generation
Code
Code Available
1
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
May 27, 2025
Image Retrieval
Retrieval
Code
Code Available
1
Phir Hera Fairy: An English Fairytaler is a Strong Faker of Fluent Speech in Low-Resource Indian Languages
May 27, 2025
Synthetic Data Generation
Voice Cloning
—
Unverified
0
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
May 26, 2025
Question Answering
Synthetic Data Generation
Code
Code Available
4
Improving Heart Rejection Detection in XPCI Images Using Synthetic Data Augmentation
May 26, 2025
Data Augmentation
Synthetic Data Generation
—
Unverified
0
Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations
May 26, 2025
All
Diagnostic
—
Unverified
0
From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data
May 26, 2025
cross-modal alignment
Instruction Following
—
Unverified
0
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
May 26, 2025
Benchmarking
Optical Flow Estimation
—
Unverified
0
SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback
May 26, 2025
Prompt Learning
Question Answering
—
Unverified
0
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments
May 26, 2025
Data-free Knowledge Distillation
Federated Learning
Code
Code Available
0
PIGPVAE: Physics-Informed Gaussian Process Variational Autoencoders
May 25, 2025
Diversity
Synthetic Data Generation
—
Unverified
0
The Prompt is Mightier than the Example
May 24, 2025
In-Context Learning
Synthetic Data Generation
—
Unverified
0
Large language model as user daily behavior data generator: balancing population diversity and individual personality
May 23, 2025
Data Augmentation
Diversity
—
Unverified
0
Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review
May 22, 2025
Federated Learning
GPU
—
Unverified
0
V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation
May 22, 2025
Event-based vision
Optical Flow Estimation
Code
Code Available
1
Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation
May 21, 2025
Language Modeling
Language Modelling
—
Unverified
0
Show:
10
25
50
← Prev
Page 2 of 33
Next →
All datasets
UCI Epileptic Seizure Recognition
UNSW-NB15
Benchmark Results
▼
UCI Epileptic Seizure Recognition
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
corGAN
AUROC
0.92
—
Unverified
2
GAN
AUROC
0.87
—
Unverified
▼
UNSW-NB15
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
kiNETGAN
EMD
0.07
—
Unverified
2
CTGAN
EMD
0.07
—
Unverified