SOTAVerified

Benchmarking

Papers

Showing 671680 of 5548 papers

TitleStatusHype
Data Splits and Metrics for Method Benchmarking on Surgical Action Triplet DatasetsCode1
A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character RecognitionCode1
DCL-Net: Deep Correspondence Learning Network for 6D Pose EstimationCode1
Benchmarking Image Retrieval for Visual LocalizationCode1
A Computed Tomography Vertebral Segmentation Dataset with Anatomical Variations and Multi-Vendor Scanner DataCode1
Benchmarking Language Model Creativity: A Case Study on Code GenerationCode1
A Large-scale Comprehensive Dataset and Copy-overlap Aware Evaluation Protocol for Segment-level Video Copy DetectionCode1
Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsCode1
Attention, Please! Revisiting Attentive Probing for Masked Image ModelingCode1
Benchmarking Graph Neural Networks for FMRI analysisCode1
Show:102550
← PrevPage 68 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified