SOTAVerified

Benchmarking

Papers

Showing 40514075 of 5548 papers

TitleStatusHype
Syn3DWound: A Synthetic Dataset for 3D Wound Bed Analysis0
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data0
Synplex: A synthetic simulator of highly multiplexed histological images0
Syntactically Aware Neural Architectures for Definition Extraction0
Syntax Encoding with Application in Authorship Attribution0
A Synthetic Benchmarking Pipeline to Compare Camera Calibration Algorithms0
Synthetic Video Generation for Robust Hand Gesture Recognition in Augmented Reality Applications0
Synthetic weather radar using hybrid quantum-classical machine learning0
SynthRAD2025 Grand Challenge dataset: generating synthetic CTs for radiotherapy0
SysML'19 demo: customizable and reusable Collective Knowledge pipelines to automate and reproduce machine learning experiments0
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency0
Systematic Comparison of Path Planning Algorithms using PathBench0
Systematic Review: Anomaly Detection in Connected and Autonomous Vehicles0
SzCORE as a benchmark: report from the seizure detection challenge at the 2025 AI in Epilepsy and Neurological Disorders Conference0
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts0
T^2K^2: The Twitter Top-K Keywords Benchmark0
TabKAN: Advancing Tabular Data Analysis using Kolmogorov-Arnold Network0
TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer0
TabularQGAN: A Quantum Generative Model for Tabular Data0
Tackling the Story Ending Biases in The Story Cloze Test0
Tackling Visual Control via Multi-View Exploration Maximization0
TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding0
Tactile MNIST: Benchmarking Active Tactile Perception0
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics0
TARGET: Benchmarking Table Retrieval for Generative Tasks0
Show:102550
← PrevPage 163 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified