SOTAVerified|Agents Browse Leaderboard About

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 331–340 of 399 papers

Title	Date	Tasks	Status	Hype
On the Usage of Continual Learning for Out-of-Distribution Generalization in Pre-trained Language Models of Code	May 6, 2023	Continual LearningGeneral Knowledge	—Unverified	0
Organizing Linked Data Quality Related Methods	May 30, 2013	General Knowledge	—Unverified	0
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering	Nov 1, 2018	Factual Visual Question AnsweringGeneral Knowledge	—Unverified	0
A Joint Planning and Learning Framework for Human-Aided Decision-Making	Jun 17, 2019	Decision MakingGeneral Knowledge	—Unverified	0
PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stage Ranking	May 18, 2022	Deep LearningGeneral Knowledge	—Unverified	0
Luminoso at SemEval-2018 Task 10: Distinguishing Attributes Using Text Corpora and Relational Knowledge	Jun 5, 2018	General KnowledgeRelation Extraction	CodeCode Available	0
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs	Nov 14, 2024	General KnowledgeMath	CodeCode Available	0
Leveraging Large Language Models for Automated Dialogue Analysis	Sep 12, 2023	General KnowledgeLanguage Modeling	CodeCode Available	0
Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge	Apr 8, 2024	General KnowledgeSafety Alignment	CodeCode Available	0
Efficient Transfer Learning for Video-language Foundation Models	Nov 18, 2024	Action RecognitionFew-Shot Learning	CodeCode Available	0

Show:10 25 50

← PrevPage 34 of 40Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified