SOTAVerified

Benchmarking

Papers

Showing 13411350 of 5548 papers

TitleStatusHype
LEAF: A Benchmark for Federated SettingsCode1
Benchmarking structure-based three-dimensional molecular generative models using GenBench3D: ligand conformation quality mattersCode1
Benchmarking Image Retrieval for Visual LocalizationCode1
BioMaze: Benchmarking and Enhancing Large Language Models for Biological Pathway ReasoningCode1
LEMUR Neural Network Dataset: Towards Seamless AutoMLCode1
Less Is More: A Comparison of Active Learning Strategies for 3D Medical Image SegmentationCode1
ArabicaQA: A Comprehensive Dataset for Arabic Question AnsweringCode1
Combinatorial Optimization with Policy Adaptation using Latent Space SearchCode1
Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIsCode1
Benchmarking human visual search computational models in natural scenes: models comparison and reference datasetsCode1
Show:102550
← PrevPage 135 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified