SOTAVerified|Agents Browse Leaderboard About

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 271–280 of 399 papers

Title	Date	Tasks	Status	Hype	Score
Nudging: Inference-time Alignment of LLMs via Guided Decoding	Oct 11, 2024	General KnowledgeGSM8K	—Unverified	0	0
Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving	Sep 4, 2024	Autonomous DrivingDecision Making	—Unverified	0	0
One to Many: Adaptive Instrument Segmentation via Meta Learning and Dynamic Online Adaptation in Robotic Surgical Video	Mar 24, 2021	General KnowledgeMeta-Learning	—Unverified	0	0
On the Usage of Continual Learning for Out-of-Distribution Generalization in Pre-trained Language Models of Code	May 6, 2023	Continual LearningGeneral Knowledge	—Unverified	0	0
Organizing Linked Data Quality Related Methods	May 30, 2013	General Knowledge	—Unverified	0	0
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering	Nov 1, 2018	Factual Visual Question AnsweringGeneral Knowledge	—Unverified	0	0
Learning Electromagnetic Metamaterial Physics With ChatGPT	Apr 23, 2024	General Knowledge	—Unverified	0	0
A Joint Planning and Learning Framework for Human-Aided Decision-Making	Jun 17, 2019	Decision MakingGeneral Knowledge	—Unverified	0	0
CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering	Jan 30, 2025	General KnowledgeLanguage Modeling	—Unverified	0	0
Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code	Oct 24, 2024	General KnowledgeIn-Context Learning	—Unverified	0	0

Show:10 25 50

← PrevPage 28 of 40Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified