SOTAVerified

Benchmarking

Papers

Showing 14411450 of 5548 papers

TitleStatusHype
NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and ResultsCode1
NuCLS: A scalable crowdsourcing, deep learning approach and dataset for nucleus classification, localization and segmentationCode1
AQuA: A Benchmarking Tool for Label Quality AssessmentCode1
Object Shape Error Response Using Bayesian 3-D Convolutional Neural Networks for Assembly Systems With Compliant PartsCode1
CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasksCode1
Benchpress: A Scalable and Versatile Workflow for Benchmarking Structure Learning AlgorithmsCode1
APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and BeyondCode1
CHOICE: Benchmarking the Remote Sensing Capabilities of Large Vision-Language ModelsCode1
CounselBench: A Large-Scale Expert Evaluation and Adversarial Benchmark of Large Language Models in Mental Health CounselingCode1
Contemporary Symbolic Regression Methods and their Relative PerformanceCode1
Show:102550
← PrevPage 145 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified