SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 126150 of 308 papers

TitleStatusHype
Learning to Compute Gröbner BasesCode0
Learning to Propagate for Graph Meta-LearningCode0
KoCoSa: Korean Context-aware Sarcasm Detection DatasetCode0
Communicating Smartly in the Molecular Domain: Neural Networks in the Internet of Bio-Nano ThingsCode0
Learning Camera Miscalibration DetectionCode0
Location-Aware Visual Question Generation with Lightweight ModelsCode0
Cognition Chain for Explainable Psychological Stress Detection on Social MediaCode0
Code Execution as Grounded Supervision for LLM ReasoningCode0
Geometric Generality of Transformer-Based Gröbner Basis ComputationCode0
JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLMCode0
GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning BenchmarksCode0
A universal synthetic dataset for machine learning on spectroscopic dataCode0
Clustering in Dynamic Environments: A Framework for Benchmark Dataset Generation With Heterogeneous ChangesCode0
IrrMap: A Large-Scale Comprehensive Dataset for Irrigation Method MappingCode0
Closing the Loop: A Framework for Trustworthy Machine Learning in Power SystemsCode0
JABBERWOCK: A Tool for WebAssembly Dataset Generation and Its Application to Malicious Website DetectionCode0
LoFT: LoRA-fused Training Dataset Generation with Few-shot GuidanceCode0
Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer0
Generating Synthetic Ground Truth Distributions for Multi-step Trajectory Prediction using Probabilistic Composite Bézier Curves0
Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation0
A systematic dataset generation technique applied to data-driven automotive aerodynamics0
From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation0
Channel Modeling Aided Dataset Generation for AI-Enabled CSI Feedback: Advances, Challenges, and Solutions0
FMSD-TTS: Few-shot Multi-Speaker Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset Generation0
CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking0
Show:102550
← PrevPage 6 of 13Next →

No leaderboard results yet.