SOTAVerified

Benchmarking

Papers

Showing 40214030 of 5548 papers

TitleStatusHype
Distributed Software-Defined Network Architecture for Smart Grid Resilience to Denial-of-Service Attacks0
AI applications in forest monitoring need remote sensing benchmark datasets0
AnyTOD: A Programmable Task-Oriented Dialog System0
Causally Testing Gender Bias in LLMs: A Case Study on Occupational BiasCode0
Benchmarking person re-identification datasets and approaches for practical real-world implementationsCode0
Trial-Based Dominance Enables Non-Parametric Tests to Compare both the Speed and Accuracy of Stochastic Optimizers0
GiCCS: A German in-Context Conversational Similarity Benchmark0
Biomedical image analysis competitions: The state of current participation practice0
Automatic vehicle trajectory data reconstruction at scale0
Momentum Contrastive Pre-training for Question Answering0
Show:102550
← PrevPage 403 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified