SOTAVerified

Benchmarking

Papers

Showing 121130 of 5548 papers

TitleStatusHype
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for LocomotionCode3
CRITERIA: a New Benchmarking Paradigm for Evaluating Trajectory Prediction Models for Autonomous DrivingCode3
Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity AnalysisCode3
T^3Bench: Benchmarking Current Progress in Text-to-3D GenerationCode3
SMPLer-X: Scaling Up Expressive Human Pose and Shape EstimationCode3
Matbench Discovery -- A framework to evaluate machine learning crystal stability predictionsCode3
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot LearningCode3
TorchBench: Benchmarking PyTorch with High API Surface CoverageCode3
Highly Accurate Quantum Chemical Property Prediction with Uni-Mol+Code3
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement LearningCode3
Show:102550
← PrevPage 13 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified