SOTAVerified

Benchmarking

Papers

Showing 11711180 of 5548 papers

TitleStatusHype
Coarse-to-Fine Q-attention with Learned Path RankingCode1
Earnings-22: A Practical Benchmark for Accents in the WildCode1
Parameter-efficient Model Adaptation for Vision TransformersCode1
Visual Abductive ReasoningCode1
Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative ComprehensionCode1
Benchmarking Visual Localization for Autonomous NavigationCode1
minicons: Enabling Flexible Behavioral and Representational Analyses of Transformer Language ModelsCode1
Sionna: An Open-Source Library for Next-Generation Physical Layer ResearchCode1
SHEL5K: An Extended Dataset and Benchmarking for Safety Helmet DetectionCode1
ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRICode1
Show:102550
← PrevPage 118 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified