SOTAVerified

Benchmarking

Papers

Showing 16711680 of 5548 papers

TitleStatusHype
Benchmarking Sub-Genre Classification For Mainstage Dance Music0
MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context UnderstandingCode0
Mahalanobis k-NN: A Statistical Lens for Robust Point-Cloud RegistrationsCode0
VoiceWukong: Benchmarking Deepfake Voice Detection0
Selecting Differential Splicing Methods: Practical Considerations0
NeIn: Telling What You Don't Want0
RBoard: A Unified Platform for Reproducible and Reusable Recommender System Benchmarks0
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection0
Assessing SPARQL capabilities of Large Language ModelsCode2
Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E50
Show:102550
← PrevPage 168 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified