SOTAVerified

Benchmarking

Papers

Showing 31813190 of 5548 papers

TitleStatusHype
Detecting critical treatment effect bias in small subgroupsCode0
Leak Proof CMap; a framework for training and evaluation of cell line agnostic L1000 similarity methodsCode0
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models0
Stochastic Spiking Neural Networks with First-to-Spike Coding0
CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint MatchingCode0
Benchmarking Mobile Device Control Agents across Diverse Configurations0
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning0
ApisTox: a new benchmark dataset for the classification of small molecules toxicity on honey beesCode0
Empirical Analysis of the Dynamic Binary Value Problem with IOHprofiler0
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
Show:102550
← PrevPage 319 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified