SOTAVerified

Benchmarking

Papers

Showing 40614070 of 5548 papers

TitleStatusHype
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency0
Systematic Comparison of Path Planning Algorithms using PathBench0
Systematic Review: Anomaly Detection in Connected and Autonomous Vehicles0
SzCORE as a benchmark: report from the seizure detection challenge at the 2025 AI in Epilepsy and Neurological Disorders Conference0
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts0
T^2K^2: The Twitter Top-K Keywords Benchmark0
TabKAN: Advancing Tabular Data Analysis using Kolmogorov-Arnold Network0
TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer0
TabularQGAN: A Quantum Generative Model for Tabular Data0
Tackling the Story Ending Biases in The Story Cloze Test0
Show:102550
← PrevPage 407 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified