SOTAVerified

Benchmarking

Papers

Showing 40014010 of 5548 papers

TitleStatusHype
Hawk: An Industrial-strength Multi-label Document Classifier0
Benchmarking Robustness in Neural Radiance Fields0
Evaluating the Transferability of Machine-Learned Force Fields for Material Property ModelingCode0
Critical review of conformational B-cell epitope prediction methodsCode0
Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture0
AERF: Adaptive ensemble random fuzzy algorithm for anomaly detection in cloud computing0
"It's a Match!" -- A Benchmark of Task Affinity Scores for Joint Learning0
The Evolutionary Computation Methods No One Should Use0
ANNA: Abstractive Text-to-Image Synthesis with Filtered News CaptionsCode0
Benchmarking common uncertainty estimation methods with histopathological images under domain shift and label noise0
Show:102550
← PrevPage 401 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified