SOTAVerified

Benchmarking

Papers

Showing 171180 of 5548 papers

TitleStatusHype
Fast Vision Transformers with HiLo AttentionCode2
Fino1: On the Transferability of Reasoning Enhanced LLMs to FinanceCode2
From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness BenchmarkingCode2
Exponentially Faster Language ModellingCode2
Evaluating Large-Vocabulary Object Detectors: The Devil is in the DetailsCode2
Assessing SPARQL capabilities of Large Language ModelsCode2
Event-Based Motion MagnificationCode2
Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern AnalysisCode2
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksCode2
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code GenerationCode2
Show:102550
← PrevPage 18 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified