SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 221230 of 399 papers

TitleStatusHype
Large Language Models as a Tool for Mining Object Knowledge0
Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning0
Laughter During Cooperative and Competitive Games0
T-Norms Driven Loss Functions for Machine Learning0
Learning from Natural Language Explanations for Generalizable Entity Matching0
Learning Knowledge Graphs for Question Answering through Conversational Dialog0
Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker Factorization0
Transfer learning of chaotic systems0
Learning to Model the World with Language0
Controversy Rules - Discovering Regions Where Classifiers (Dis-)Agree Exceptionally0
Show:102550
← PrevPage 23 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified