SOTAVerified

Benchmarking

Papers

Showing 11411150 of 5548 papers

TitleStatusHype
Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At ScaleCode1
Jojajovai: A Parallel Guarani-Spanish Corpus for MT BenchmarkingCode1
A Japanese Dataset for Subjective and Objective Sentiment Polarity Classification in Micro Blog DomainCode1
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object DetectionCode1
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object InteractionsCode1
Failure Detection in Medical Image Classification: A Reality Check and Benchmarking TestbedCode1
MIMII DG: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization TaskCode1
GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument RolesCode1
Optimizing Performance of Federated Person Re-identification: Benchmarking and AnalysisCode1
PyRelationAL: a python library for active learning research and developmentCode1
Show:102550
← PrevPage 115 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified