SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 151200 of 308 papers

TitleStatusHype
Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot ExamplesCode0
KoCoSa: Korean Context-aware Sarcasm Detection DatasetCode0
Deep learning-driven scheduling algorithm for a single machine problem minimizing the total tardiness0
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object DetectionCode2
UAV-Rain1k: A Benchmark for Raindrop Removal from UAV Aerial ImageryCode1
Accelerating PDE Data Generation via Differential Operator Action in Solution Space0
Synthetic Dialogue Dataset Generation using LLM AgentsCode0
Evaluating ML-Based Anomaly Detection Across Datasets of Varied Integrity: A Case StudyCode0
Real-time object detection and robotic manipulation for agriculture using a YOLO-based learning approach0
Synthetic data enables faster annotation and robust segmentation for multi-object grasping in clutter0
GTAutoAct: An Automatic Datasets Generation Framework Based on Game Engine Redevelopment for Action Recognition0
Icy Moon Surface Simulation and Stereo Depth Estimation for Sampling AutonomyCode0
FinLLMs: A Framework for Financial Reasoning Dataset Generation with Large Language Models0
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture0
Automatic UAV-based Airport Pavement Inspection Using Mixed Real and Virtual Scenarios0
Model-Driven Dataset Generation for Data-Driven Battery SOH Models0
PEGASUS: Physically Enhanced Gaussian Splatting Simulation System for 6DoF Object Pose Dataset GenerationCode1
RainSD: Rain Style Diversification Module for Image Synthesis Enhancement using Feature-Level Style Distribution0
Generalizing Single-View 3D Shape Retrieval to Occlusions and Unseen ObjectsCode1
FlowDA: Unsupervised Domain Adaptive Framework for Optical Flow Estimation0
Fast and Knowledge-Free Deep Learning for General Game Playing (Student Abstract)0
Pipeline and Dataset Generation for Automated Fact-checking in Almost Any LanguageCode0
Faithful Persona-based Conversational Dataset Generation with Large Language ModelsCode1
Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey0
A Probabilistic Neural Twin for Treatment Planning in Peripheral Pulmonary Artery Stenosis0
Learning to Compute Gröbner BasesCode0
Masked Face Dataset Generation and Masked Face RecognitionCode0
Diffusion Models for Earth Observation Use-cases: from cloud removal to urban change detection0
Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources0
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization0
LLMaAA: Making Large Language Models as Active AnnotatorsCode1
On the Inherent Privacy Properties of Discrete Denoising Diffusion Models0
Location-Aware Visual Question Generation with Lightweight ModelsCode0
AutoHall: Automated Hallucination Dataset Generation for Large Language Models0
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic SegmentationCode1
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMsCode1
Dataset Generation and Bonobo Classification from Weakly Labelled VideosCode0
DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion ModelsCode1
Learning-based NLOS Detection and Uncertainty Prediction of GNSS Observations with Transformer-Enhanced LSTM NetworkCode1
Developing a Scalable Benchmark for Assessing Large Language Models in Knowledge Graph EngineeringCode1
Prompt2Model: Generating Deployable Models from Natural Language InstructionsCode4
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
Labeling without Seeing? Blind Annotation for Privacy-Preserving Entity Resolution0
Around the GLOBE: Numerical Aggregation Question-Answering on Heterogeneous Genealogical Knowledge Graphs with Deep Neural Networks0
Supervised Homography Learning with Realistic Dataset GenerationCode1
SynTable: A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop ScenesCode1
Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D ReconstructionCode0
Mitosis Detection from Partial Annotation by Dataset Generation via Frame-Order FlippingCode0
The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models0
NeuroGraph: Benchmarks for Graph Machine Learning in Brain ConnectomicsCode1
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.