SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3771–3780 of 5548 papers

Title	Date	Tasks	Status	Hype
OptIForest: Optimal Isolation Forest for Anomaly Detection	Jun 22, 2023	Anomaly DetectionBenchmarking	CodeCode Available	0
On Evaluation of Document Classification using RVL-CDIP	Jun 21, 2023	BenchmarkingClassification	—Unverified	0
Evaluation of Popular XAI Applied to Clinical Prediction Models: Can They be Trusted?	Jun 21, 2023	BenchmarkingExplainable artificial intelligence	—Unverified	0
A Comprehensive Study on the Robustness of Image Classification and Object Detection in Remote Sensing: Surveying and Benchmarking	Jun 21, 2023	Adversarial RobustnessBenchmarking	—Unverified	0
On-orbit model training for satellite imagery with label proportions	Jun 21, 2023	BenchmarkingEarth Observation	CodeCode Available	0
Diverse Community Data for Benchmarking Data Privacy Algorithms	Jun 20, 2023	Benchmarking	—Unverified	0
Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation Extraction	Jun 20, 2023	BenchmarkingDocument-level Relation Extraction	CodeCode Available	0
Benchmarking Robustness of Deep Reinforcement Learning approaches to Online Portfolio Management	Jun 19, 2023	BenchmarkingDeep Reinforcement Learning	—Unverified	0
Fairness Index Measures to Evaluate Bias in Biometric Recognition	Jun 19, 2023	BenchmarkingFairness	—Unverified	0
Using Motif Transitions for Temporal Graph Generation	Jun 19, 2023	BenchmarkingGraph Generation	CodeCode Available	0

Show:10 25 50

← PrevPage 378 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified