SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 251300 of 308 papers

TitleStatusHype
Geometric Generality of Transformer-Based Gröbner Basis ComputationCode0
Location-Aware Visual Question Generation with Lightweight ModelsCode0
LoFT: LoRA-fused Training Dataset Generation with Few-shot GuidanceCode0
Segmenting Unknown 3D Objects from Real Depth Images using Mask R-CNN Trained on Synthetic DataCode0
TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language ModelsCode0
Masked Face Dataset Generation and Masked Face RecognitionCode0
Semantically Rich Local Dataset Generation for Explainable AI in GenomicsCode0
Semantic Segmentation for Autonomous Driving: Model Evaluation, Dataset Generation, Perspective Comparison, and Real-Time CapabilityCode0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented DataCode0
Sim-MEES: Modular End-Effector System Grasping Dataset for Mobile Manipulators in Cluttered EnvironmentsCode0
Mitosis Detection from Partial Annotation by Dataset Generation via Frame-Order FlippingCode0
Generative Dataset Distillation: Balancing Global Structure and Local DetailsCode0
Clustering in Dynamic Environments: A Framework for Benchmark Dataset Generation With Heterogeneous ChangesCode0
A Framework for Large Scale Synthetic Graph Dataset GenerationCode0
GECOBench: A Gender-Controlled Text Dataset and Benchmark for Quantifying Biases in ExplanationsCode0
Closing the Loop: A Framework for Trustworthy Machine Learning in Power SystemsCode0
Smart Home Appliances: Chat with Your FridgeCode0
GAN-Leaks: A Taxonomy of Membership Inference Attacks against Generative ModelsCode0
Building Large Machine Reading-Comprehension Datasets using Paragraph VectorsCode0
The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset GenerationCode0
Fonts-2-Handwriting: A Seed-Augment-Train framework for universal digit classificationCode0
Bias Reduction via Cooperative Bargaining in Synthetic Graph Dataset GenerationCode0
ADG-Pose: Automated Dataset Generation for Real-World Human Pose EstimationCode0
Fire Dynamic Vision: Image Segmentation and Tracking for Multi-Scale Fire and Plume BehaviorCode0
SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table ExtractionCode0
Noisemaker 3D: Comprehensive Framework for Mesh Noise GenerationCode0
NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye ImagesCode0
F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of NanoparticlesCode0
A universal synthetic dataset for machine learning on spectroscopic dataCode0
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-CheckingCode0
Bag of Views: An Appearance-based Approach to Next-Best-View Planning for 3D ReconstructionCode0
A Semi-Synthetic Dataset Generation Framework for Causal Inference in Recommender SystemsCode0
Zero-shot racially balanced dataset generation using an existing biased StyleGAN2Code0
Face Manifold: Manifold Learning for Synthetic Face GenerationCode0
PAXQA: Generating Cross-lingual Question Answering Examples at Training ScaleCode0
Evaluating ML-Based Anomaly Detection Across Datasets of Varied Integrity: A Case StudyCode0
Enhancing Clinical Models with Pseudo Data for De-identificationCode0
E-Gen: Leveraging E-Graphs to Improve Continuous Representations of Symbolic ExpressionsCode0
Pipeline and Dataset Generation for Automated Fact-checking in Almost Any LanguageCode0
Private Dataset Generation Using Privacy Preserving Collaborative LearningCode0
Drone Detection using Deep Neural Networks Trained on Pure Synthetic DataCode0
Neural Network Surrogate and Projected Gradient Descent for Fast and Reliable Finite Element Model Calibration: a Case Study on an Intervertebral DiscCode0
Towards Realistic Underwater Dataset Generation and Color RestorationCode0
Automating 3D Dataset Generation with Neural Radiance FieldsCode0
Dataset Generation and Bonobo Classification from Weakly Labelled VideosCode0
Affordance Learning for End-to-End Visuomotor Robot ControlCode0
Synthetic dataset generation for object-to-model deep learning in industrial applicationsCode0
Towards Synthetic Data Generation for Improved Pain Recognition in Videos under Patient ConstraintsCode0
Communicating Smartly in the Molecular Domain: Neural Networks in the Internet of Bio-Nano ThingsCode0
Synthetic Dataset Generation of Driver TelematicsCode0
Show:102550
← PrevPage 6 of 7Next →

No leaderboard results yet.