SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 1120 of 308 papers

TitleStatusHype
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language ModelsCode2
JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics FrameworkCode2
An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset GenerationCode2
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object DetectionCode2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
Real-Time Per-Garment Virtual Try-On with Temporal Consistency for Loose-Fitting GarmentsCode1
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal ReasoningCode1
TimeGraph: Synthetic Benchmark Datasets for Robust Time-Series Causal DiscoveryCode1
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM EvaluationCode1
Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation MapCode1
Show:102550
← PrevPage 2 of 31Next →

No leaderboard results yet.