SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Synthetic Data Generation
Synthetic Data Generation
The generation of tabular data by any means possible.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 51–75 of 822 papers
Title
Date
Tasks
Status
Hype
Aug2Search: Enhancing Facebook Marketplace Search with LLM-Generated Synthetic Data Augmentation
May 21, 2025
Data Augmentation
Diversity
—
Unverified
0
Challenges and Limitations in the Synthetic Generation of mHealth Sensor Data
May 20, 2025
Data Augmentation
Synthetic Data Generation
—
Unverified
0
Scaling Low-Resource MT via Synthetic Data Generation with LLMs
May 20, 2025
Machine Translation
Synthetic Data Generation
—
Unverified
0
LLM-based Automated Theorem Proving Hinges on Scalable Synthetic Data Generation
May 17, 2025
Automated Theorem Proving
Synthetic Data Generation
Code
Code Available
0
BLEUBERI: BLEU is a surprisingly effective reward for instruction following
May 16, 2025
Instruction Following
Synthetic Data Generation
Code
Code Available
1
RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization
May 16, 2025
RAG
Synthetic Data Generation
Code
Code Available
1
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
May 15, 2025
Knowledge Graphs
Natural Language Queries
—
Unverified
0
Robust Federated Learning with Confidence-Weighted Filtering and GAN-Based Completion under Noisy and Incomplete Data
May 14, 2025
Federated Learning
Missing Labels
—
Unverified
0
Privacy-Preserving Analytics for Smart Meter (AMI) Data: A Hybrid Approach to Comply with CPUC Privacy Regulations
May 13, 2025
Econometrics
Federated Learning
—
Unverified
0
Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data
May 12, 2025
Program Repair
Synthetic Data Generation
—
Unverified
0
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs
May 12, 2025
AI Agent
Knowledge Distillation
Code
Code Available
2
Uni-AIMS: AI-Powered Microscopy Image Analysis
May 11, 2025
Synthetic Data Generation
—
Unverified
0
Advancing Uto-Aztecan Language Technologies: A Case Study on the Endangered Comanche Language
May 10, 2025
Language Identification
Synthetic Data Generation
Code
Code Available
0
Generating Reliable Synthetic Clinical Trial Data: The Role of Hyperparameter Optimization and Domain Constraints
May 8, 2025
Hyperparameter Optimization
Synthetic Data Generation
—
Unverified
0
SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation
May 8, 2025
3DGS
Data Augmentation
Code
Code Available
2
AI-Generated Fall Data: Assessing LLMs and Diffusion Model for Wearable Fall Detection
May 7, 2025
Synthetic Data Generation
—
Unverified
0
Improving Omics-Based Classification: The Role of Feature Selection and Synthetic Data Generation
May 6, 2025
Binary Classification
Classification
—
Unverified
0
Synthline: A Product Line Approach for Synthetic Requirements Engineering Data Generation using Large Language Models
May 6, 2025
Diversity
Synthetic Data Generation
Code
Code Available
0
Modeling supply chain compliance response strategies based on AI synthetic data with structural path regression: A Simulation Study of EU 2027 Mandatory Labor Regulations
May 4, 2025
regression
Synthetic Data Generation
—
Unverified
0
Synthesize-on-Graph: Knowledgeable Synthetic Data Generation for Continue Pre-training of Large Language Models
May 2, 2025
Diversity
Reading Comprehension
—
Unverified
0
ReasonIR: Training Retrievers for Reasoning Tasks
Apr 29, 2025
Information Retrieval
MMLU
Code
Code Available
3
Artificial Intelligence for Personalized Prediction of Alzheimer's Disease Progression: A Survey of Methods, Data Challenges, and Future Directions
Apr 29, 2025
Causal Inference
Federated Learning
—
Unverified
0
Bridging the Generalisation Gap: Synthetic Data Generation for Multi-Site Clinical Model Validation
Apr 29, 2025
Benchmarking
Fairness
Code
Code Available
0
Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs
Apr 28, 2025
Synthetic Data Generation
Code
Code Available
3
Towards Ball Spin and Trajectory Analysis in Table Tennis Broadcast Videos via Physically Grounded Synthetic-to-Real Transfer
Apr 28, 2025
Monocular 3D Object Localization
Sports Analytics
Code
Code Available
1
Show:
10
25
50
← Prev
Page 3 of 33
Next →
All datasets
UCI Epileptic Seizure Recognition
UNSW-NB15
Benchmark Results
▼
UCI Epileptic Seizure Recognition
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
corGAN
AUROC
0.92
—
Unverified
2
GAN
AUROC
0.87
—
Unverified
▼
UNSW-NB15
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
kiNETGAN
EMD
0.07
—
Unverified
2
CTGAN
EMD
0.07
—
Unverified