SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 126150 of 308 papers

TitleStatusHype
Neural Error Covariance Estimation for Precise LiDAR Localization0
DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI0
Low-Biased General Annotated Dataset Generation0
Movie2Story: A framework for understanding videos and telling stories in the form of novel text0
Cognition Chain for Explainable Psychological Stress Detection on Social MediaCode0
SciFaultyQA: Benchmarking LLMs on Faulty Science Question Detection with a GAN-Inspired Approach to Synthetic Dataset GenerationCode0
Unbiased General Annotated Dataset Generation0
JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLMCode0
VariFace: Fair and Diverse Synthetic Dataset Generation for Face Recognition0
SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table ExtractionCode0
An Evolutionary Large Language Model for Hallucination Mitigation0
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems0
Drone Detection using Deep Neural Networks Trained on Pure Synthetic DataCode0
CorrSynth -- A Correlated Sampling Method for Diverse Dataset Generation from LLMs0
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere0
Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models0
Fairness-Utilization Trade-off in Wireless Networks with Explainable Kolmogorov-Arnold Networks0
Simulating User Agents for Embodied Conversational-AI0
SYNOSIS: Image synthesis pipeline for machine vision in metal surface inspection0
Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation0
FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs0
Anchored Alignment for Self-Explanations Enhancement0
Autonomous Self-Trained Channel State Prediction Method for mmWave Vehicular Communications0
HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations0
EarthquakeNPP: Benchmark Datasets for Earthquake Forecasting with Neural Point Processes0
Show:102550
← PrevPage 6 of 13Next →

No leaderboard results yet.