SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 2650 of 308 papers

TitleStatusHype
SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion ModelsCode1
Global Tensor Motion PlanningCode1
OpenLS-DGF: An Adaptive Open-Source Dataset Generation Framework for Machine Learning Tasks in Logic SynthesisCode1
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and AugmentationCode1
PADetBench: Towards Benchmarking Physical Attacks against Object DetectionCode1
Chip Placement with Diffusion ModelsCode1
TheoremLlama: Transforming General-Purpose LLMs into Lean4 ExpertsCode1
Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and DesignCode1
Automated Multi-level Preference for MLLMsCode1
Forcing Diffuse Distributions out of Language ModelsCode1
UAV-Rain1k: A Benchmark for Raindrop Removal from UAV Aerial ImageryCode1
PEGASUS: Physically Enhanced Gaussian Splatting Simulation System for 6DoF Object Pose Dataset GenerationCode1
Generalizing Single-View 3D Shape Retrieval to Occlusions and Unseen ObjectsCode1
Faithful Persona-based Conversational Dataset Generation with Large Language ModelsCode1
LLMaAA: Making Large Language Models as Active AnnotatorsCode1
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic SegmentationCode1
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMsCode1
Learning-based NLOS Detection and Uncertainty Prediction of GNSS Observations with Transformer-Enhanced LSTM NetworkCode1
DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion ModelsCode1
Developing a Scalable Benchmark for Assessing Large Language Models in Knowledge Graph EngineeringCode1
Supervised Homography Learning with Realistic Dataset GenerationCode1
SynTable: A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop ScenesCode1
NeuroGraph: Benchmarks for Graph Machine Learning in Brain ConnectomicsCode1
Sim-Suction: Learning a Suction Grasp Policy for Cluttered Environments Using a Synthetic BenchmarkCode1
Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic TasksCode1
Show:102550
← PrevPage 2 of 13Next →

No leaderboard results yet.