SOTAVerified

Benchmarking

Papers

Showing 11811190 of 5548 papers

TitleStatusHype
Benchmarking saliency methods for chest X-ray interpretationCode1
Benchmarking Multi-Scene Fire and Smoke DetectionCode1
Benchmarking Segmentation Models with Mask-Preserved Attribute EditingCode1
Benchmarking Self-Supervised Learning on Diverse Pathology DatasetsCode1
GraphArena: Benchmarking Large Language Models on Graph Computational ProblemsCode1
GraphWorld: Fake Graphs Bring Real Insights for GNNsCode1
Benchmarking Natural Language Understanding Services for building Conversational AgentsCode1
Boosting Neural Image Compression for Machines Using Latent Space MaskingCode1
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object InteractionsCode1
GNNs as Predictors of Agentic Workflow PerformancesCode1
Show:102550
← PrevPage 119 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified