SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 226250 of 308 papers

TitleStatusHype
Visual-tactile Fusion for Transparent Object Grasping in Complex Backgrounds0
WebFAQ: A Multilingual Collection of Natural Q&A Datasets for Dense Retrieval0
Wireless Sensing With Deep Spectrogram Network and Primitive Based Autoregressive Hybrid Channel Model0
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation0
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems0
Labeling without Seeing? Blind Annotation for Privacy-Preserving Entity Resolution0
Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey0
Learning Wireless Data Knowledge Graph for Green Intelligent Communications: Methodology and Experiments0
FTSmartAudit: A Knowledge Distillation-Enhanced Framework for Automated Smart Contract Auditing Using Fine-Tuned LLMs0
Low-Biased General Annotated Dataset Generation0
LSD3K: A Benchmark for Smoke Removal from Laparoscopic Surgery Images0
JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLMCode0
Transport-Related Surface Detection with Machine Learning: Analyzing Temporal Trends in Madrid and ViennaCode0
SciFaultyQA: Benchmarking LLMs on Faulty Science Question Detection with a GAN-Inspired Approach to Synthetic Dataset GenerationCode0
KoCoSa: Korean Context-aware Sarcasm Detection DatasetCode0
Cognition Chain for Explainable Psychological Stress Detection on Social MediaCode0
Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving ScenariosCode0
JABBERWOCK: A Tool for WebAssembly Dataset Generation and Its Application to Malicious Website DetectionCode0
Learning Camera Miscalibration DetectionCode0
IrrMap: A Large-Scale Comprehensive Dataset for Irrigation Method MappingCode0
Learning to Compute Gröbner BasesCode0
Learning to Propagate for Graph Meta-LearningCode0
TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination EvaluationCode0
seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic SegmentationCode0
Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot ExamplesCode0
Show:102550
← PrevPage 10 of 13Next →

No leaderboard results yet.