SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 251275 of 308 papers

TitleStatusHype
Icy Moon Surface Simulation and Stereo Depth Estimation for Sampling AutonomyCode0
Location-Aware Visual Question Generation with Lightweight ModelsCode0
LoFT: LoRA-fused Training Dataset Generation with Few-shot GuidanceCode0
Segmenting Unknown 3D Objects from Real Depth Images using Mask R-CNN Trained on Synthetic DataCode0
TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language ModelsCode0
Masked Face Dataset Generation and Masked Face RecognitionCode0
Semantically Rich Local Dataset Generation for Explainable AI in GenomicsCode0
Semantic Segmentation for Autonomous Driving: Model Evaluation, Dataset Generation, Perspective Comparison, and Real-Time CapabilityCode0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented DataCode0
Sim-MEES: Modular End-Effector System Grasping Dataset for Mobile Manipulators in Cluttered EnvironmentsCode0
Mitosis Detection from Partial Annotation by Dataset Generation via Frame-Order FlippingCode0
GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning BenchmarksCode0
Code Execution as Grounded Supervision for LLM ReasoningCode0
A Semi-Synthetic Dataset Generation Framework for Causal Inference in Recommender SystemsCode0
Geometric Generality of Transformer-Based Gröbner Basis ComputationCode0
Clustering in Dynamic Environments: A Framework for Benchmark Dataset Generation With Heterogeneous ChangesCode0
Smart Home Appliances: Chat with Your FridgeCode0
GECOBench: A Gender-Controlled Text Dataset and Benchmark for Quantifying Biases in ExplanationsCode0
Closing the Loop: A Framework for Trustworthy Machine Learning in Power SystemsCode0
The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset GenerationCode0
Automating 3D Dataset Generation with Neural Radiance FieldsCode0
Building Large Machine Reading-Comprehension Datasets using Paragraph VectorsCode0
ADG-Pose: Automated Dataset Generation for Real-World Human Pose EstimationCode0
GAN-Leaks: A Taxonomy of Membership Inference Attacks against Generative ModelsCode0
SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table ExtractionCode0
Show:102550
← PrevPage 11 of 13Next →

No leaderboard results yet.