SOTAVerified

Benchmarking

Papers

Showing 17611770 of 5548 papers

TitleStatusHype
Cable Tree Wiring -- Benchmarking Solvers on a Real-World Scheduling Problem with a Variety of Precedence ConstraintsCode0
inMOTIFin: a lightweight end-to-end simulation software for regulatory sequencesCode0
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation OncologyCode0
M4Fog: A Global Multi-Regional, Multi-Modal, and Multi-Stage Dataset for Marine Fog Detection and Forecasting to Bridge Ocean and AtmosphereCode0
B-XAIC Dataset: Benchmarking Explainable AI for Graph Neural Networks Using Chemical DataCode0
Benchmarking ChatGPT on Algorithmic ReasoningCode0
Analysis | OPEN | Published: 17 June 2019 Multitask learning and benchmarking with clinical time series dataCode0
Building Conformal Prediction Intervals with Approximate Message PassingCode0
Building and benchmarking an Arabic Speech Commands dataset for small-footprint keyword spottingCode0
Adaptive Visual Scene Understanding: Incremental Scene Graph GenerationCode0
Show:102550
← PrevPage 177 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified