SOTAVerified

Benchmarking

Papers

Showing 17011710 of 5548 papers

TitleStatusHype
QeMFi: A Multifidelity Dataset of Quantum Chemical Properties of Diverse MoleculesCode0
Benchmarking Apache Spark and Hadoop MapReduce on Big Data ClassificationCode0
IPC: A Benchmark Data Set for Learning with Graph-Structured DataCode0
ISImed: A Framework for Self-Supervised Learning using Intrinsic Spatial Information in Medical ImagesCode0
InViG: Benchmarking Interactive Visual Grounding with 500K Human-Robot InteractionsCode0
Anchor Points: Benchmarking Models with Much Fewer ExamplesCode0
An Auditing Test To Detect Behavioral Shift in Language ModelsCode0
Inverse Contextual Bandits: Learning How Behavior Evolves over TimeCode0
VitaGraph: Building a Knowledge Graph for Biologically Relevant Learning TasksCode0
Investigating the Impact of Hard Samples on Accuracy Reveals In-class Data ImbalanceCode0
Show:102550
← PrevPage 171 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified