SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4061–4070 of 5548 papers

Title	Date	Tasks	Status	Hype
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency	Jul 1, 2023	BenchmarkingData Augmentation	—Unverified	0
Systematic Comparison of Path Planning Algorithms using PathBench	Mar 7, 2022	Benchmarking	—Unverified	0
Systematic Review: Anomaly Detection in Connected and Autonomous Vehicles	May 4, 2024	Anomaly DetectionArticles	—Unverified	0
SzCORE as a benchmark: report from the seizure detection challenge at the 2025 AI in Epilepsy and Neurological Disorders Conference	May 19, 2025	BenchmarkingEEG	—Unverified	0
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts	Dec 5, 2024	BenchmarkingImage Generation	—Unverified	0
T^2K^2: The Twitter Top-K Keywords Benchmark	Sep 14, 2017	BenchmarkingInformation Retrieval	—Unverified	0
TabKAN: Advancing Tabular Data Analysis using Kolmogorov-Arnold Network	Apr 9, 2025	BenchmarkingDeep Learning	—Unverified	0
TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer	Jan 2, 2025	BenchmarkingQuantization	—Unverified	0
TabularQGAN: A Quantum Generative Model for Tabular Data	May 28, 2025	BenchmarkingGenerative Adversarial Network	—Unverified	0
Tackling the Story Ending Biases in The Story Cloze Test	Jul 1, 2018	BenchmarkingCloze Test	—Unverified	0

Show:10 25 50

← PrevPage 407 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified