SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 5175 of 308 papers

TitleStatusHype
DCFace: Synthetic Face Generation with Dual Condition Diffusion ModelCode1
CamDiff: Camouflage Image Augmentation via Diffusion ModelCode1
LIQUID: A Framework for List Question Answering Dataset GenerationCode1
ProGen: Progressive Zero-shot Dataset Generation via In-context FeedbackCode1
Scrape, Cut, Paste and Learn: Automated Dataset Generation Applied to Parcel LogisticsCode1
RealFlow: EM-based Realistic Optical Flow Dataset Generation from VideosCode1
HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D ReconstructionCode1
Learning to Answer Visual Questions from Web VideosCode1
ZeroGen: Efficient Zero-shot Learning via Dataset GenerationCode1
Detecting Anti-Vaccine Users on TwitterCode1
SynPick: A Dataset for Dynamic Bin Picking Scene UnderstandingCode1
SofaMyRoom: a fast and multiplatform "shoebox" room simulator for binaural room impulse response dataset generationCode1
Improving Paraphrase Detection with the Adversarial Paraphrasing TaskCode1
Perceptual Loss for Robust Unsupervised Homography EstimationCode1
Monocular Multi-Layer Layout Estimation for Warehouse RacksCode1
MK-SQuIT: Synthesizing Questions using Iterative Template-fillingCode1
Actionet: An Interactive End-To-End Platform For Task-Based Data Collection And Augmentation In 3D EnvironmentCode1
Afro-MNIST: Synthetic generation of MNIST-style datasets for low-resource languagesCode1
Image Generation for Efficient Neural Network Training in Autonomous Drone RacingCode1
ViWi Vision-Aided mmWave Beam Tracking: Dataset, Task, and Baseline SolutionsCode1
Communicating Smartly in the Molecular Domain: Neural Networks in the Internet of Bio-Nano ThingsCode0
From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation0
A large-scale, physically-based synthetic dataset for satellite pose estimation0
Enhancing Clinical Models with Pseudo Data for De-identificationCode0
Code Execution as Grounded Supervision for LLM ReasoningCode0
Show:102550
← PrevPage 3 of 13Next →

No leaderboard results yet.