SOTAVerified

Benchmarking

Papers

Showing 25112520 of 5548 papers

TitleStatusHype
A Theory of Dynamic Benchmarks0
Geometric feature performance under downsampling for EEG classification tasks0
Geometry-Based Next Frame Prediction from Monocular Video0
ATG: Benchmarking Automated Theorem Generation for Generative Language Models0
Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games0
A Comprehensive Study on Dataset Distillation: Performance, Privacy, Robustness and Fairness0
Geometry Matters: Benchmarking Scientific ML Approaches for Flow Prediction around Complex Geometries0
Benchmarking Robustness of Deep Reinforcement Learning approaches to Online Portfolio Management0
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor Perturbation0
A tale of two toolkits, report the first: benchmarking time series classification algorithms for correctness and efficiency0
Show:102550
← PrevPage 252 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified