SOTAVerified

Benchmarking

Papers

Showing 36813690 of 5548 papers

TitleStatusHype
Point Cloud Objective Quality: Benchmarking Features and Quality Evaluation0
Polarization and Index Modulations: a Theoretical and Practical Perspective0
Policy Entropy for Out-of-Distribution Classification0
Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing0
Portfolio Benchmarking under Drawdown Constraint and Stochastic Sharpe Ratio0
PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions0
Pose Estimation for Non-Cooperative Spacecraft Rendezvous Using Convolutional Neural Networks0
Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation0
Position: Benchmarking is Limited in Reinforcement Learning Research0
Position: Graph Learning Will Lose Relevance Due To Poor Benchmarks0
Show:102550
← PrevPage 369 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified