SOTAVerified

Benchmarking

Papers

Showing 901910 of 5548 papers

TitleStatusHype
An Image Dataset for Benchmarking Recommender Systems with Raw PixelsCode1
Labelling unlabelled videos from scratch with multi-modal self-supervisionCode1
A Comprehensive Benchmark for COVID-19 Predictive Modeling Using Electronic Health Records in Intensive CareCode1
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMMCode1
AD-LLM: Benchmarking Large Language Models for Anomaly DetectionCode1
ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate ModelsCode1
CIPCaD-Bench: Continuous Industrial Process datasets for benchmarking Causal Discovery methodsCode1
Benchmarking Counterfactual Image GenerationCode1
AdsorbML: A Leap in Efficiency for Adsorption Energy Calculations using Generalizable Machine Learning PotentialsCode1
CIDEr: Consensus-based Image Description EvaluationCode1
Show:102550
← PrevPage 91 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified