SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 391399 of 399 papers

TitleStatusHype
Data structuring for the ontological modelling of wind energy systems0
Learning Knowledge Graphs for Question Answering through Conversational Dialog0
Learning to Understand Phrases by Embedding the DictionaryCode0
Transaction Logic with (Complex) Events0
Analysis of Watson's Strategies for Playing Jeopardy!0
Collaborative ontology sharing and editing0
Organizing Linked Data Quality Related Methods0
A Dynamic Approach to Probabilistic Inference0
The Wisdom of Crowds in the Recollection of Order Information0
Show:102550
← PrevPage 40 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified