SOTAVerified

Benchmarking

Papers

Showing 13311340 of 5548 papers

TitleStatusHype
New Protocols and Negative Results for Textual Entailment Data CollectionCode1
A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character RecognitionCode1
Labelling unlabelled videos from scratch with multi-modal self-supervisionCode1
Label, Verify, Correct: A Simple Few Shot Object Detection MethodCode1
Benchmarking Test-Time Adaptation against Distribution Shifts in Image ClassificationCode1
Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIsCode1
Benchmarking Spectral Graph Neural Networks: A Comprehensive Study on Effectiveness and EfficiencyCode1
AudioMarkBench: Benchmarking Robustness of Audio WatermarkingCode1
Benchmarking Image Retrieval for Visual LocalizationCode1
ArabicaQA: A Comprehensive Dataset for Arabic Question AnsweringCode1
Show:102550
← PrevPage 134 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified