SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 201250 of 308 papers

TitleStatusHype
Synthetic Dataset Generation with Itemset-Based Generative Models0
Synthetic Datasets for Autonomous Driving: A Survey0
Synthetic Error Dataset Generation Mimicking Bengali Writing Pattern0
Synthetic-to-real Composite Semantic Segmentation in Additive Manufacturing0
Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes0
Technical report of a DMD-based Characterization Method for Vision Sensors0
The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models0
Through Fog High-Resolution Imaging Using Millimeter Wave Radar0
Towards a methodology for addressing missingness in datasets, with an application to demographic health datasets0
Towards Real-World Category-level Articulation Pose Estimation0
Training dataset generation for bridge game registration0
TrainSim: A Railway Simulation Framework for LiDAR and Camera Dataset Generation0
Transfer learning for self-supervised, blind-spot seismic denoising0
Unbiased General Annotated Dataset Generation0
Undertrained Image Reconstruction for Realistic Degradation in Blind Image Super-Resolution0
Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering0
Unsupervised Learning of Shape Concepts - From Real-World Objects to Mental Simulation0
Unsupervised Multi-label Dataset Generation from Web Data0
The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation0
Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA0
USM-SED - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios0
V^2R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations0
ValuePilot: A Two-Phase Framework for Value-Driven Decision-Making0
VariFace: Fair and Diverse Synthetic Dataset Generation for Face Recognition0
Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors0
Visual-tactile Fusion for Transparent Object Grasping in Complex Backgrounds0
WebFAQ: A Multilingual Collection of Natural Q&A Datasets for Dense Retrieval0
Wireless Sensing With Deep Spectrogram Network and Primitive Based Autoregressive Hybrid Channel Model0
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation0
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems0
Labeling without Seeing? Blind Annotation for Privacy-Preserving Entity Resolution0
Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey0
Learning Wireless Data Knowledge Graph for Green Intelligent Communications: Methodology and Experiments0
FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs0
JABBERWOCK: A Tool for WebAssembly Dataset Generation and Its Application to Malicious Website DetectionCode0
JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLMCode0
IrrMap: A Large-Scale Comprehensive Dataset for Irrigation Method MappingCode0
Transport-Related Surface Detection with Machine Learning: Analyzing Temporal Trends in Madrid and ViennaCode0
SciFaultyQA: Benchmarking LLMs on Faulty Science Question Detection with a GAN-Inspired Approach to Synthetic Dataset GenerationCode0
KoCoSa: Korean Context-aware Sarcasm Detection DatasetCode0
Code Execution as Grounded Supervision for LLM ReasoningCode0
Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving ScenariosCode0
Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot ExamplesCode0
Learning Camera Miscalibration DetectionCode0
Icy Moon Surface Simulation and Stereo Depth Estimation for Sampling AutonomyCode0
Learning to Compute Gröbner BasesCode0
Learning to Propagate for Graph Meta-LearningCode0
TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination EvaluationCode0
seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic SegmentationCode0
GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning BenchmarksCode0
Show:102550
← PrevPage 5 of 7Next →

No leaderboard results yet.