SOTAVerified

Benchmarking

Papers

Showing 37513760 of 5548 papers

TitleStatusHype
Unsupervised Spectral Demosaicing with Lightweight Spectral Attention Networks0
OpenSiteRec: An Open Dataset for Site Recommendation0
A Synthetic Benchmarking Pipeline to Compare Camera Calibration Algorithms0
Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity0
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency0
InstructEval: Systematic Evaluation of Instruction Selection Methods0
Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors0
Learning Environment Models with Continuous Stochastic Dynamics0
Principles and Guidelines for Evaluating Social Robot Navigation Algorithms0
Benchmarking Large Language Model Capabilities for Conditional Generation0
Show:102550
← PrevPage 376 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified