SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4461–4470 of 5548 papers

Title	Date	Tasks	Status	Hype
Model-predictive control and reinforcement learning in multi-energy system case studies	Apr 20, 2021	BenchmarkingModel Predictive Control	—Unverified	0
Benchmarking the Benchmark -- Analysis of Synthetic NIDS Datasets	Apr 19, 2021	BenchmarkingIntrusion Detection	—Unverified	0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks	Apr 18, 2021	BenchmarkingFederated Learning	CodeCode Available	0
The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech	Apr 17, 2021	Benchmarking	—Unverified	0
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models	Apr 17, 2021	Argument RetrievalBenchmarking	CodeCode Available	2
Towards Standardising Reinforcement Learning Approaches for Production Scheduling Problems	Apr 16, 2021	Benchmarkingreinforcement-learning	CodeCode Available	1
Data Generating Process to Evaluate Causal Discovery Techniques for Time Series Data	Apr 16, 2021	BenchmarkingCausal Discovery	CodeCode Available	1
Jointly Modeling and Clustering Tensors in High Dimensions	Apr 15, 2021	BenchmarkingClustering	—Unverified	0
On the Assessment of Benchmark Suites for Algorithm Comparison	Apr 15, 2021	Benchmarking	—Unverified	0
Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability	Apr 14, 2021	BenchmarkingLink Prediction	CodeCode Available	1

Show:10 25 50

← PrevPage 447 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified