SOTAVerified

Benchmarking

Papers

Showing 341350 of 5548 papers

TitleStatusHype
PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEsCode2
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
LibAUC: A Deep Learning Library for X-Risk OptimizationCode2
The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)Code2
Visualizing Linguistic Diversity of Text Datasets Synthesized by Large Language ModelsCode2
RoboPianist: Dexterous Piano Playing with Deep Reinforcement LearningCode2
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy PerceptionCode2
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid ManipulationCode2
Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern AnalysisCode2
Show:102550
← PrevPage 35 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified