SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 361370 of 399 papers

TitleStatusHype
The Wisdom of Crowds in the Recollection of Order Information0
Explicit Utilization of General Knowledge in Machine Reading Comprehension0
The World in My Mind: Visual Dialog with Adversarial Multi-modal Feature Encoding0
Exploring Safety-Utility Trade-Offs in Personalized Language Models0
Evident: a Development Methodology and a Knowledge Base Topology for Data Mining, Machine Learning and General Knowledge Management0
Exploring Zero-Shot Anomaly Detection with CLIP in Medical Imaging: Are We There Yet?0
Extending TWIG: Zero-Shot Predictive Hyperparameter Selection for KGEs based on Graph Structure0
Extracting Unlearned Information from LLMs with Activation Steering0
Thinking LLMs: General Instruction Following with Thought Generation0
Fast constrained sampling in pre-trained diffusion models0
Show:102550
← PrevPage 37 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified