SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2851–2860 of 5548 papers

Title	Date	Tasks	Status	Hype	Score
The Design and Implementation of a Scalable DL Benchmarking Platform	Nov 19, 2019	Benchmarking	—Unverified	0	0
Handwritten Text Recognition: A Survey	Feb 12, 2025	BenchmarkingHandwritten Text Recognition	—Unverified	0	0
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset	Jan 3, 2023	BenchmarkingComputed Tomography (CT)	—Unverified	0	0
xai_evals : A Framework for Evaluating Post-Hoc Local Explanation Methods	Feb 5, 2025	Benchmarking	—Unverified	0	0
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead	Dec 21, 2020	Autonomous DrivingBenchmarking	—Unverified	0	0
Hardware-aware mobile building block evaluation for computer vision	Aug 26, 2022	BenchmarkingEfficient Neural Network	—Unverified	0	0
The Disagreement Problem in Faithfulness Metrics	Nov 13, 2023	BenchmarkingExplainable artificial intelligence	—Unverified	0	0
The DLV System for Knowledge Representation and Reasoning	Nov 4, 2002	Benchmarking	—Unverified	0	0
Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study	May 24, 2024	BenchmarkingVulnerability Detection	—Unverified	0	0
The Dota 2 Bot Competition	Mar 4, 2021	BenchmarkingDota 2	—Unverified	0	0

Show:10 25 50

← PrevPage 286 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified