SOTAVerified

Benchmarking

Papers

Showing 14111420 of 5548 papers

TitleStatusHype
Automatic sleep stage classification with deep residual networks in a mixed-cohort settingCode1
Multimodal Fusion via Teacher-Student Network for Indoor Action RecognitionCode1
CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of CancerCode1
Benchmarking Vision, Language, & Action Models on Robotic Learning TasksCode1
scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell DataCode1
CRoW: Benchmarking Commonsense Reasoning in Real-World TasksCode1
MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection BenchmarkCode1
MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph DataCode1
NAS-Bench-1Shot1: Benchmarking and Dissecting One-shot Neural Architecture SearchCode1
Towards Reliable Detection of LLM-Generated Texts: A Comprehensive Evaluation Framework with CUDRTCode1
Show:102550
← PrevPage 142 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified