SOTAVerified

Benchmarking

Papers

Showing 12011210 of 5548 papers

TitleStatusHype
Benchmarking of GPU-optimized Quantum-Inspired Evolutionary Optimization Algorithm using Functional Analysis0
JuStRank: Benchmarking LLM Judges for System Ranking0
Neptune: The Long Orbit to Benchmarking Long Video UnderstandingCode2
Benchmarking LLMs for Mimicking Child-Caregiver Language in Interaction0
Benchmarking Federated Learning for Semantic Datasets: Federated Scene Graph GenerationCode0
Learn How to Query from Unlabeled Data Streams in Federated LearningCode0
LCFO: Long Context and Long Form Output Dataset and Benchmarking0
Koopman Theory-Inspired Method for Learning Time Advancement Operators in Unstable Flame Front Evolution0
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual IllusionsCode0
Benchmarking learned algorithms for computed tomography image reconstruction tasks0
Show:102550
← PrevPage 121 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified