SOTAVerified

Benchmarking

Papers

Showing 14411450 of 5548 papers

TitleStatusHype
Evaluating histopathology transfer learning with ChampKitCode1
Evaluating Graph Neural Networks for Link Prediction: Current Pitfalls and New BenchmarkingCode1
BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language modelsCode1
Evaluating Multimodal Representations on Visual Semantic Textual SimilarityCode1
ISSAFE: Improving Semantic Segmentation in Accidents by Fusing Event-based DataCode1
Rethinking Machine Unlearning in Image Generation ModelsCode1
JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in CrowdsCode1
Benchmark on Drug Target Interaction Modeling from a Structure PerspectiveCode1
ClinicRealm: Re-evaluating Large Language Models with Conventional Machine Learning for Non-Generative Clinical Prediction TasksCode1
Benchpress: A Scalable and Versatile Workflow for Benchmarking Structure Learning AlgorithmsCode1
Show:102550
← PrevPage 145 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified