SOTAVerified|Agents Browse Leaderboard About

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 111–120 of 399 papers

Title	Date	Tasks	Status	Hype	Score
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs	Nov 14, 2024	General KnowledgeMath	CodeCode Available	0	5
Efficient Transfer Learning for Video-language Foundation Models	Nov 18, 2024	Action RecognitionFew-Shot Learning	CodeCode Available	0	5
Efficient Relation-aware Neighborhood Aggregation in Graph Neural Networks via Tensor Decomposition	Dec 11, 2022	Contrastive LearningGeneral Knowledge	CodeCode Available	0	5
Luminoso at SemEval-2018 Task 10: Distinguishing Attributes Using Text Corpora and Relational Knowledge	Jun 5, 2018	General KnowledgeRelation Extraction	CodeCode Available	0	5
Effective Skill Unlearning through Intervention and Abstention	Mar 27, 2025	General KnowledgeMath	CodeCode Available	0	5
Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models	Aug 19, 2024	DescriptiveDrug Discovery	CodeCode Available	0	5
BnMMLU: Measuring Massive Multitask Language Understanding in Bengali	May 25, 2025	General KnowledgeMMLU	CodeCode Available	0	5
Can ChatGPT Enable ITS? The Case of Mixed Traffic Control via Reinforcement Learning	Jun 13, 2023	General KnowledgeManagement	CodeCode Available	0	5
Learning to Understand Phrases by Embedding the Dictionary	Apr 2, 2015	General Knowledge	CodeCode Available	0	5
Leveraging Large Language Models for Automated Dialogue Analysis	Sep 12, 2023	General KnowledgeLanguage Modeling	CodeCode Available	0	5

Show:10 25 50

← PrevPage 12 of 40Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified