SOTAVerified

Benchmarking

Papers

Showing 961970 of 5548 papers

TitleStatusHype
Evaluating Robustness of Deep Reinforcement Learning for Autonomous Surface Vehicle Control in Field TestsCode1
EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box FunctionsCode1
FedCV: A Federated Learning Framework for Diverse Computer Vision TasksCode1
MMDetection: Open MMLab Detection Toolbox and BenchmarkCode1
Working Memory Capacity of ChatGPT: An Empirical StudyCode1
Benchmarking Embedding Aggregation Methods in Computational Pathology: A Clinical Data PerspectiveCode1
Evaluating Adversarial Attacks on ImageNet: A Reality Check on Misclassification ClassesCode1
Benchmarking End-to-End Behavioural Cloning on Video GamesCode1
Deep Learning-Based Synchronization for Uplink NB-IoTCode1
Benchmarking Natural Language Understanding Services for building Conversational AgentsCode1
Show:102550
← PrevPage 97 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified