SOTAVerified

Benchmarking

Papers

Showing 28512860 of 5548 papers

TitleStatusHype
The Design and Implementation of a Scalable DL Benchmarking Platform0
Handwritten Text Recognition: A Survey0
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset0
xai_evals : A Framework for Evaluating Post-Hoc Local Explanation Methods0
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead0
Hardware-aware mobile building block evaluation for computer vision0
The Disagreement Problem in Faithfulness Metrics0
The DLV System for Knowledge Representation and Reasoning0
Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study0
The Dota 2 Bot Competition0
Show:102550
← PrevPage 286 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified