SOTAVerified

Benchmarking

Papers

Showing 581590 of 5548 papers

TitleStatusHype
CheX-GPT: Harnessing Large Language Models for Enhanced Chest X-ray Report LabelingCode1
Large Scale MRI Collection and Segmentation of Cirrhotic LiverCode1
CODEBench: A Neural Architecture and Hardware Accelerator Co-Design FrameworkCode1
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning AlgorithmsCode1
RobFR: Benchmarking Adversarial Robustness on Face RecognitionCode1
Evaluating Graph Neural Networks for Link Prediction: Current Pitfalls and New BenchmarkingCode1
CausalTime: Realistically Generated Time-series for Benchmarking of Causal DiscoveryCode1
Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark FrameworkCode1
CAVIAR: Co-simulation of 6G Communications, 3D Scenarios and AI for Digital TwinsCode1
A multi-schematic classifier-independent oversampling approach for imbalanced datasetsCode1
Show:102550
← PrevPage 59 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified