SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 331340 of 399 papers

TitleStatusHype
Transfer learning of chaotic systems0
Tencent AI Lab Machine Translation Systems for WMT20 Chat Translation Task0
Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker Factorization0
Learning to Learn Variational Semantic MemoryCode0
Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-Learning0
KGPT: Knowledge-Grounded Pre-Training for Data-to-Text GenerationCode1
Patching as Translation: the Data and the MetaphorCode0
An Energy Ontology for Global City Indicators (ISO 37120)0
Domain Specific, Semi-Supervised Transfer Learning for Medical Imaging0
Transformers as Soft Reasoners over LanguageCode1
Show:102550
← PrevPage 34 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified