SOTAVerified

Benchmarking

Papers

Showing 31113120 of 5548 papers

TitleStatusHype
Face Detection on Surveillance Images0
Face Morphing Attack Generation & Detection: A Comprehensive Survey0
FACT: Learning Governing Abstractions Behind Integer Sequences0
FactLens: Benchmarking Fine-Grained Fact Verification0
Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations0
A Normative Framework for Benchmarking Consumer Fairness in Large Language Model Recommender System0
FAIRification of MLC data0
FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs0
Fairness-Aware Graph Neural Networks: A Survey0
Fairness Index Measures to Evaluate Bias in Biometric Recognition0
Show:102550
← PrevPage 312 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified