SOTAVerified

Benchmarking

Papers

Showing 20512060 of 5548 papers

TitleStatusHype
Estimating transmission from genetic and epidemiological data: a metric to compare transmission trees0
Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors0
EuroCon: Benchmarking Parliament Deliberation for Political Consensus Finding0
Categorization of 33 computational methods to detect spatially variable genes from spatially resolved transcriptomics data0
CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans0
Estimating the Effect of Crosstalk Error on Circuit Fidelity Using Noisy Intermediate-Scale Quantum Devices0
Europarl-ASR: A Large Corpus of Parliamentary Debates for Streaming ASR Benchmarking and Speech Data Filtering/Verbatimization0
CATBench: A Compiler Autotuning Benchmarking Suite for Black-box Optimization0
Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection0
Benchmarking and Comparing Multi-exposure Image Fusion Algorithms0
Show:102550
← PrevPage 206 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified