SOTAVerified

Benchmarking

Papers

Showing 37013710 of 5548 papers

TitleStatusHype
Benchmarking Algorithmic Bias in Face Recognition: An Experimental Approach Using Synthetic Faces and Human Evaluation0
Spintronics for image recognition: performance benchmarking via ultrafast data-driven simulations0
Enhancing Architecture Frameworks by Including Modern Stakeholders and their Views/Viewpoints0
Benchmarking LLM powered Chatbots: Methods and Metrics0
RECipe: Does a Multi-Modal Recipe Knowledge Graph Fit a Multi-Purpose Recommendation System?0
Microvasculature Segmentation in Human BioMolecular Atlas Program (HuBMAP)0
Precise Benchmarking of Explainable AI Attribution MethodsCode0
A Survey of Spanish Clinical Language Models0
ChatGPT for GTFS: Benchmarking LLMs on GTFS Understanding and RetrievalCode0
RobustMQ: Benchmarking Robustness of Quantized Models0
Show:102550
← PrevPage 371 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified