SOTAVerified

Benchmarking

Papers

Showing 18011810 of 5548 papers

TitleStatusHype
IoT Data Trust Evaluation via Machine LearningCode0
Comparative Analysis: Violence Recognition from Videos using Transfer LearningCode0
Towards Learning Universal, Regional, and Local Hydrological Behaviors via Machine-Learning Applied to Large-Sample DatasetsCode0
Bridging the Generalisation Gap: Synthetic Data Generation for Multi-Site Clinical Model ValidationCode0
Individual Fairness Guarantees for Neural NetworksCode0
Adaptive Power System Emergency Control using Deep Reinforcement LearningCode0
InDL: A New Dataset and Benchmark for In-Diagram Logic Interpretation based on Visual IllusionCode0
BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory PerceptionCode0
Benchmarking Abstract and Reasoning Abilities Through A Theoretical PerspectiveCode0
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian ContextCode0
Show:102550
← PrevPage 181 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified