SOTAVerified

Benchmarking

Papers

Showing 26912700 of 5548 papers

TitleStatusHype
Full-stack evaluation of Machine Learning inference workloads for RISC-V systems0
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models0
FunBench: Benchmarking Fundus Reading Skills of MLLMs0
Functional Code Building Genetic Programming0
Efficient Pauli channel estimation with logarithmic quantum memory0
A Normative Framework for Benchmarking Consumer Fairness in Large Language Model Recommender System0
FuzzWiz -- Fuzzing Framework for Efficient Hardware Coverage0
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK0
A Survey of Spanish Clinical Language Models0
AI Matrix - Synthetic Benchmarks for DNN0
Show:102550
← PrevPage 270 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified