SOTAVerified

Benchmarking

Papers

Showing 45214530 of 5548 papers

TitleStatusHype
The Collective Knowledge project: making ML models more portable and reproducible with open APIs, reusable best practices and MLOpsCode0
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verificationCode0
Resource Interoperability for Sustainable Benchmarking: The Case of EventsCode0
Bayesian Neural Networks with Soft EvidenceCode0
BASED: Benchmarking, Analysis, and Structural Estimation of DeblurringCode0
Bugs in the Data: How ImageNet Misrepresents BiodiversityCode0
inMOTIFin: a lightweight end-to-end simulation software for regulatory sequencesCode0
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in EnglishCode0
InDL: A New Dataset and Benchmark for In-Diagram Logic Interpretation based on Visual IllusionCode0
Individual Fairness Guarantees for Neural NetworksCode0
Show:102550
← PrevPage 453 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified