SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Synthetic Data Generation
Synthetic Data Generation
The generation of tabular data by any means possible.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 176–200 of 822 papers
Title
Date
Tasks
Status
Hype
Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models
Jun 3, 2025
Synthetic Data Generation
—
Unverified
0
IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data
Jun 3, 2025
Attribute
Synthetic Data Generation
—
Unverified
0
SMOTE-DP: Improving Privacy-Utility Tradeoff with Synthetic Data
Jun 2, 2025
Privacy Preserving
Synthetic Data Generation
—
Unverified
0
VietMix: A Naturally Occurring Vietnamese-English Code-Mixed Corpus with Iterative Augmentation for Machine Translation
May 30, 2025
Machine Translation
Synthetic Data Generation
—
Unverified
0
Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison
May 30, 2025
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
—
Unverified
0
CryoCCD: Conditional Cycle-consistent Diffusion with Biophysical Modeling for Cryo-EM Synthesis
May 29, 2025
Contrastive Learning
Diversity
—
Unverified
0
StressTest: Can YOUR Speech LM Handle the Stress?
May 28, 2025
Question Answering
Sentence
—
Unverified
0
Phir Hera Fairy: An English Fairytaler is a Strong Faker of Fluent Speech in Low-Resource Indian Languages
May 27, 2025
Synthetic Data Generation
Voice Cloning
—
Unverified
0
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments
May 26, 2025
Data-free Knowledge Distillation
Federated Learning
Code
Code Available
0
From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data
May 26, 2025
cross-modal alignment
Instruction Following
—
Unverified
0
SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback
May 26, 2025
Prompt Learning
Question Answering
—
Unverified
0
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
May 26, 2025
Benchmarking
Optical Flow Estimation
—
Unverified
0
Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations
May 26, 2025
All
Diagnostic
—
Unverified
0
Improving Heart Rejection Detection in XPCI Images Using Synthetic Data Augmentation
May 26, 2025
Data Augmentation
Synthetic Data Generation
—
Unverified
0
PIGPVAE: Physics-Informed Gaussian Process Variational Autoencoders
May 25, 2025
Diversity
Synthetic Data Generation
—
Unverified
0
The Prompt is Mightier than the Example
May 24, 2025
In-Context Learning
Synthetic Data Generation
—
Unverified
0
Large language model as user daily behavior data generator: balancing population diversity and individual personality
May 23, 2025
Data Augmentation
Diversity
—
Unverified
0
Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review
May 22, 2025
Federated Learning
GPU
—
Unverified
0
Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation
May 21, 2025
Language Modeling
Language Modelling
—
Unverified
0
Aug2Search: Enhancing Facebook Marketplace Search with LLM-Generated Synthetic Data Augmentation
May 21, 2025
Data Augmentation
Diversity
—
Unverified
0
Scaling Low-Resource MT via Synthetic Data Generation with LLMs
May 20, 2025
Machine Translation
Synthetic Data Generation
—
Unverified
0
Challenges and Limitations in the Synthetic Generation of mHealth Sensor Data
May 20, 2025
Data Augmentation
Synthetic Data Generation
—
Unverified
0
LLM-based Automated Theorem Proving Hinges on Scalable Synthetic Data Generation
May 17, 2025
Automated Theorem Proving
Synthetic Data Generation
Code
Code Available
0
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
May 15, 2025
Knowledge Graphs
Natural Language Queries
—
Unverified
0
Robust Federated Learning with Confidence-Weighted Filtering and GAN-Based Completion under Noisy and Incomplete Data
May 14, 2025
Federated Learning
Missing Labels
—
Unverified
0
Show:
10
25
50
← Prev
Page 8 of 33
Next →
All datasets
UCI Epileptic Seizure Recognition
UNSW-NB15
Benchmark Results
▼
UCI Epileptic Seizure Recognition
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
corGAN
AUROC
0.92
—
Unverified
2
GAN
AUROC
0.87
—
Unverified
▼
UNSW-NB15
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
kiNETGAN
EMD
0.07
—
Unverified
2
CTGAN
EMD
0.07
—
Unverified