SOTAVerified

Benchmarking

Papers

Showing 4150 of 5548 papers

TitleStatusHype
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and ReasoningCode4
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression ToolkitCode4
Molecular-driven Foundation Model for Oncologic PathologyCode4
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-EncodersCode4
Accelerating Data Processing and Benchmarking of AI Models for PathologyCode4
Benchmarking Retrieval-Augmented Generation for MedicineCode4
Enabling more efficient and cost-effective AI/ML systems with Collective Mind, virtualized MLOps, MLPerf, Collective Knowledge Playground and reproducible optimization tournamentsCode4
A deep learning framework for efficient pathology image analysisCode4
Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous DrivingCode4
MTEB: Massive Text Embedding BenchmarkCode4
Show:102550
← PrevPage 5 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified