SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 101150 of 308 papers

TitleStatusHype
E-Gen: Leveraging E-Graphs to Improve Continuous Representations of Symbolic ExpressionsCode0
Enhancing Clinical Models with Pseudo Data for De-identificationCode0
Semantic Segmentation for Autonomous Driving: Model Evaluation, Dataset Generation, Perspective Comparison, and Real-Time CapabilityCode0
Evaluating ML-Based Anomaly Detection Across Datasets of Varied Integrity: A Case StudyCode0
ADG-Pose: Automated Dataset Generation for Real-World Human Pose EstimationCode0
Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D ReconstructionCode0
Rethinking Table Recognition using Graph Neural NetworksCode0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented DataCode0
Building Large Machine Reading-Comprehension Datasets using Paragraph VectorsCode0
Pipeline and Dataset Generation for Automated Fact-checking in Almost Any LanguageCode0
Private Dataset Generation Using Privacy Preserving Collaborative LearningCode0
Fonts-2-Handwriting: A Seed-Augment-Train framework for universal digit classificationCode0
Dataset Generation and Bonobo Classification from Weakly Labelled VideosCode0
Automating 3D Dataset Generation with Neural Radiance FieldsCode0
A Dataset Generation Toolbox for Dynamic Security Assessment: On the Role of the Security BoundaryCode0
Towards Synthetic Data Generation for Improved Pain Recognition in Videos under Patient ConstraintsCode0
Sim-MEES: Modular End-Effector System Grasping Dataset for Mobile Manipulators in Cluttered EnvironmentsCode0
Fire Dynamic Vision: Image Segmentation and Tracking for Multi-Scale Fire and Plume BehaviorCode0
Noisemaker 3D: Comprehensive Framework for Mesh Noise GenerationCode0
NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye ImagesCode0
LeagueAI: Improving object detector performance and flexibility through automatically generated training data and domain randomizationCode0
Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot ExamplesCode0
Icy Moon Surface Simulation and Stereo Depth Estimation for Sampling AutonomyCode0
Mitosis Detection from Partial Annotation by Dataset Generation via Frame-Order FlippingCode0
LoFT: LoRA-fused Training Dataset Generation with Few-shot GuidanceCode0
Masked Face Dataset Generation and Masked Face RecognitionCode0
Learning to Propagate for Graph Meta-LearningCode0
Learning Camera Miscalibration DetectionCode0
Learning to Compute Gröbner BasesCode0
KoCoSa: Korean Context-aware Sarcasm Detection DatasetCode0
GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning BenchmarksCode0
Communicating Smartly in the Molecular Domain: Neural Networks in the Internet of Bio-Nano ThingsCode0
Cognition Chain for Explainable Psychological Stress Detection on Social MediaCode0
Code Execution as Grounded Supervision for LLM ReasoningCode0
Geometric Generality of Transformer-Based Gröbner Basis ComputationCode0
JABBERWOCK: A Tool for WebAssembly Dataset Generation and Its Application to Malicious Website DetectionCode0
Generative Dataset Distillation: Balancing Global Structure and Local DetailsCode0
Clustering in Dynamic Environments: A Framework for Benchmark Dataset Generation With Heterogeneous ChangesCode0
A universal synthetic dataset for machine learning on spectroscopic dataCode0
Closing the Loop: A Framework for Trustworthy Machine Learning in Power SystemsCode0
IrrMap: A Large-Scale Comprehensive Dataset for Irrigation Method MappingCode0
JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLMCode0
Location-Aware Visual Question Generation with Lightweight ModelsCode0
PAXQA: Generating Cross-lingual Question Answering Examples at Training ScaleCode0
Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer0
Generating Synthetic Ground Truth Distributions for Multi-step Trajectory Prediction using Probabilistic Composite Bézier Curves0
Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation0
A systematic dataset generation technique applied to data-driven automotive aerodynamics0
From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation0
Channel Modeling Aided Dataset Generation for AI-Enabled CSI Feedback: Advances, Challenges, and Solutions0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.