SOTAVerified

Benchmarking

Papers

Showing 45214530 of 5548 papers

TitleStatusHype
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery AnalysisCode1
Parametrized quantum policies for reinforcement learning0
Benchmarking Off-The-Shelf Solutions to Robotic Assembly Tasks0
Synplex: A synthetic simulator of highly multiplexed histological images0
GraphMineSuite: Enabling High-Performance and Programmable Graph Mining Algorithms with Set Algebra0
The Dota 2 Bot Competition0
Learning to Fly -- a Gym Environment with PyBullet Physics for Reinforcement Learning of Multi-agent Quadcopter ControlCode2
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor PerturbationCode0
Adversarial Environment Generation for Learning to Navigate the WebCode0
Accounting for Variance in Machine Learning Benchmarks0
Show:102550
← PrevPage 453 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified