SOTAVerified

Benchmarking

Papers

Showing 13911400 of 5548 papers

TitleStatusHype
Benchmarking Image Retrieval for Visual LocalizationCode1
ArabicaQA: A Comprehensive Dataset for Arabic Question AnsweringCode1
Benchmarking human visual search computational models in natural scenes: models comparison and reference datasetsCode1
Interpretable statistical representations of neural population dynamics and geometryCode1
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech SystemsCode1
Dynatask: A Framework for Creating Dynamic AI Benchmark TasksCode1
Physiology-based simulation of the retinal vasculature enables annotation-free segmentation of OCT angiographsCode1
PIC4rl-gym: a ROS2 modular framework for Robots Autonomous Navigation with Deep Reinforcement LearningCode1
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement LearningCode1
IntelliGraphs: Datasets for Benchmarking Knowledge Graph GenerationCode1
Show:102550
← PrevPage 140 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified