SOTAVerified

Dataset Generation

The task involves enhancing the training of target application (e.g. autonomous driving systems) by generating datasets of diverse and critical elements (e.g. traffic scenarios). Traditional methods rely on expensive and limited datasets, which often fail to capture rare but essential situations that can pose risks during testing.

Papers

Showing 51100 of 308 papers

TitleStatusHype
Generalizing Single-View 3D Shape Retrieval to Occlusions and Unseen ObjectsCode1
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal ReasoningCode1
Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation MapCode1
Sim-Suction: Learning a Suction Grasp Policy for Cluttered Environments Using a Synthetic BenchmarkCode1
Perceptual Loss for Robust Unsupervised Homography EstimationCode1
Image Generation for Efficient Neural Network Training in Autonomous Drone RacingCode1
ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content ModerationCode1
Learning-based NLOS Detection and Uncertainty Prediction of GNSS Observations with Transformer-Enhanced LSTM NetworkCode1
Oasis: One Image is All You Need for Multimodal Instruction Data SynthesisCode1
Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic TasksCode1
Chip Placement with Diffusion ModelsCode1
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMsCode1
OpenLS-DGF: An Adaptive Open-Source Dataset Generation Framework for Machine Learning Tasks in Logic SynthesisCode1
Global Tensor Motion PlanningCode1
HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D ReconstructionCode1
Learning to Answer Visual Questions from Web VideosCode1
ColabSfM: Collaborative Structure-from-Motion by Point Cloud RegistrationCode1
Actionet: An Interactive End-To-End Platform For Task-Based Data Collection And Augmentation In 3D EnvironmentCode1
ZeroGen: Efficient Zero-shot Learning via Dataset GenerationCode1
Improving Paraphrase Detection with the Adversarial Paraphrasing TaskCode1
Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation0
A systematic dataset generation technique applied to data-driven automotive aerodynamics0
Channel Modeling Aided Dataset Generation for AI-Enabled CSI Feedback: Advances, Challenges, and Solutions0
CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking0
A large-scale, physically-based synthetic dataset for satellite pose estimation0
A Chinese Machine Reading Comprehension Dataset Automatic Generated Based on Knowledge Graph0
A Seed-Augment-Train Framework for Universal Digit Classification0
Cashew dataset generation using augmentation and RaLSGAN and a transfer learning based tinyML approach towards disease detection0
Artificial Neural Network for Resource Allocation in Laser-based Optical wireless Networks0
Generating Synthetic Stereo Datasets using 3D Gaussian Splatting and Expert Knowledge Transfer0
Around the GLOBE: Numerical Aggregation Question-Answering on Heterogeneous Genealogical Knowledge Graphs with Deep Neural Networks0
Accelerating PDE Data Generation via Differential Operator Action in Solution Space0
Boundary Aware Multi-Focus Image Fusion Using Deep Neural Network0
Block and Detail: Scaffolding Sketch-to-Image Generation0
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets0
A Recursive Framework for Expression Recognition: From Web Images to Deep Models to Game Dataset0
DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI0
FlowDA: Unsupervised Domain Adaptive Framework for Optical Flow Estimation0
Do Transformers Understand Polynomial Simplification?0
Benchmark dataset and instance generator for Real-World Three-Dimensional Bin Packing Problems0
FMSD-TTS: Few-shot Multi-Speaker Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset Generation0
From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation0
Generating Synthetic Ground Truth Distributions for Multi-step Trajectory Prediction using Probabilistic Composite Bézier Curves0
Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments0
DiSECt: A Differentiable Simulator for Parameter Inference and Control in Robotic Cutting0
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation0
Direct Alignment of Draft Model for Speculative Decoding with Chat-Fine-Tuned LLMs0
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets0
Diffusion Models for Earth Observation Use-cases: from cloud removal to urban change detection0
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning0
Show:102550
← PrevPage 2 of 7Next →

No leaderboard results yet.