SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Synthetic Data Generation
Synthetic Data Generation
The generation of tabular data by any means possible.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 76–100 of 822 papers
Title
Date
Tasks
Status
Hype
Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing
Apr 27, 2025
Language Modeling
Language Modelling
—
Unverified
0
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONS
Apr 25, 2025
Clinical Language Translation
Machine Translation
Code
Code Available
1
TarDiff: Target-Oriented Diffusion Guidance for Synthetic Electronic Health Record Time Series Generation
Apr 24, 2025
Synthetic Data Generation
Time Series
—
Unverified
0
Optimizing the Privacy-Utility Balance using Synthetic Data and Configurable Perturbation Pipelines
Apr 24, 2025
Privacy Preserving
Synthetic Data Generation
—
Unverified
0
A Comprehensive Survey of Synthetic Tabular Data Generation
Apr 23, 2025
Privacy Preserving
Survey
Code
Code Available
1
ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving
Apr 23, 2025
Code Generation
Synthetic Data Generation
—
Unverified
0
Case Study: Fine-tuning Small Language Models for Accurate and Private CWE Detection in Python Code
Apr 23, 2025
Instruction Following
Privacy Preserving
—
Unverified
0
A Statistical Approach for Synthetic EEG Data Generation
Apr 22, 2025
EEG
Electroencephalogram (EEG)
Code
Code Available
0
Learning from Reasoning Failures via Synthetic Data Generation
Apr 20, 2025
Synthetic Data Generation
—
Unverified
0
Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation
Apr 17, 2025
Synthetic Data Generation
—
Unverified
0
MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation
Apr 17, 2025
Diversity
Domain Adaptation
—
Unverified
0
Synthetic Data for Blood Vessel Network Extraction
Apr 16, 2025
Graph Generation
Image Generation
—
Unverified
0
Evaluating the Diversity and Quality of LLM Generated Content
Apr 16, 2025
Diversity
Synthetic Data Generation
—
Unverified
0
Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task
Apr 15, 2025
2D Object Detection
Object
—
Unverified
0
Leveraging Vertical Public-Private Split for Improved Synthetic Data Generation
Apr 15, 2025
Synthetic Data Generation
—
Unverified
0
Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data Generation
Apr 11, 2025
Depth Estimation
Instance Segmentation
Code
Code Available
0
SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data
Apr 11, 2025
Decoder
Image Segmentation
—
Unverified
0
Enhancing Metabolic Syndrome Prediction with Hybrid Data Balancing and Counterfactuals
Apr 9, 2025
counterfactual
Synthetic Data Generation
Code
Code Available
0
MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection
Apr 9, 2025
Data Augmentation
Diversity
—
Unverified
0
SemEval-2025 Task 5: LLMs4Subjects -- LLM-based Automated Subject Tagging for a National Technical Library's Open-Access Catalog
Apr 9, 2025
Synthetic Data Generation
Code
Code Available
0
A Self-Supervised Framework for Space Object Behaviour Characterisation
Apr 8, 2025
Anomaly Detection
Earth Observation
—
Unverified
0
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use
Apr 7, 2025
GSM8K
Math
—
Unverified
0
CORTEX-AVD: A Framework for CORner Case Testing and EXploration in Autonomous Vehicle Development
Apr 4, 2025
Autonomous Vehicles
Synthetic Data Generation
—
Unverified
0
Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data
Apr 3, 2025
Computational Efficiency
Synthetic Data Generation
—
Unverified
0
Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation
Apr 2, 2025
Cross-Lingual Transfer
Decoder
—
Unverified
0
Show:
10
25
50
← Prev
Page 4 of 33
Next →
All datasets
UCI Epileptic Seizure Recognition
UNSW-NB15
Benchmark Results
▼
UCI Epileptic Seizure Recognition
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
corGAN
AUROC
0.92
—
Unverified
2
GAN
AUROC
0.87
—
Unverified
▼
UNSW-NB15
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
kiNETGAN
EMD
0.07
—
Unverified
2
CTGAN
EMD
0.07
—
Unverified