SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 110 of 308 papers

TitleStatusHype
Better Synthetic Data by Retrieving and Transforming Existing DatasetsCode7
Synthetic Dataset Generation for Adversarial Machine Learning ResearchCode6
Prompt2Model: Generating Deployable Models from Natural Language InstructionsCode4
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkCode3
Hierarchical Lexical Graph for Enhanced Multi-Hop RetrievalCode3
JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics FrameworkCode2
DataDream: Few-shot Guided Dataset GenerationCode2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset GenerationCode2
Show:102550
← PrevPage 1 of 31Next →

No leaderboard results yet.