SOTAVerified

Benchmarking

Papers

Showing 37713780 of 5548 papers

TitleStatusHype
OptIForest: Optimal Isolation Forest for Anomaly DetectionCode0
On Evaluation of Document Classification using RVL-CDIP0
Evaluation of Popular XAI Applied to Clinical Prediction Models: Can They be Trusted?0
A Comprehensive Study on the Robustness of Image Classification and Object Detection in Remote Sensing: Surveying and Benchmarking0
On-orbit model training for satellite imagery with label proportionsCode0
Diverse Community Data for Benchmarking Data Privacy Algorithms0
Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation ExtractionCode0
Benchmarking Robustness of Deep Reinforcement Learning approaches to Online Portfolio Management0
Fairness Index Measures to Evaluate Bias in Biometric Recognition0
Using Motif Transitions for Temporal Graph GenerationCode0
Show:102550
← PrevPage 378 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified