SOTAVerified

Benchmarking

Papers

Showing 44614470 of 5548 papers

TitleStatusHype
Model-predictive control and reinforcement learning in multi-energy system case studies0
Benchmarking the Benchmark -- Analysis of Synthetic NIDS Datasets0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing TasksCode0
The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech0
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval ModelsCode2
Towards Standardising Reinforcement Learning Approaches for Production Scheduling ProblemsCode1
Data Generating Process to Evaluate Causal Discovery Techniques for Time Series DataCode1
Jointly Modeling and Clustering Tensors in High Dimensions0
On the Assessment of Benchmark Suites for Algorithm Comparison0
Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning InterpretabilityCode1
Show:102550
← PrevPage 447 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified