SOTAVerified

Benchmarking

Papers

Showing 21012110 of 5548 papers

TitleStatusHype
Benchmarking the Attribution Quality of Vision ModelsCode0
HuSc3D: Human Sculpture dataset for 3D object reconstructionCode0
HR-VILAGE-3K3M: A Human Respiratory Viral Immunization Longitudinal Gene Expression Dataset for Systems ImmunityCode0
HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language ModelsCode0
Benchmarking Temporal Reasoning and Alignment Across Chinese DynastiesCode0
HRNET: AI on Edge for mask detection and social distancingCode0
Hybrid Machine Learning Models of Classifying Residential Requests for Smart DispatchingCode0
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A SurveyCode0
How to Manage Tiny Machine Learning at Scale: An Industrial PerspectiveCode0
How Far Are We from Optimal Reasoning Efficiency?Code0
Show:102550
← PrevPage 211 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified