SOTAVerified|Agents Browse Leaderboard About

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 221–230 of 399 papers

Title	Date	Tasks	Status	Hype
Are Large Language Models a Good Replacement of Taxonomies?	Jun 17, 2024	General KnowledgeKnowledge Graphs	CodeCode Available	0
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content	Jun 17, 2024	BenchmarkingGeneral Knowledge	CodeCode Available	0
Avoiding Copyright Infringement via Large Language Model Unlearning	Jun 16, 2024	General KnowledgeLanguage Modeling	CodeCode Available	0
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming	Jun 14, 2024	BenchmarkingGeneral Knowledge	—Unverified	0
Learning from Natural Language Explanations for Generalizable Entity Matching	Jun 13, 2024	Binary ClassificationDomain Generalization	—Unverified	0
Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers	Jun 7, 2024	General KnowledgeQuestion Generation	—Unverified	0
ContextFlow++: Generalist-Specialist Flow-based Generative Models with Mixed-Variable Context Encoding	Jun 2, 2024	Anomaly DetectionDensity Estimation	CodeCode Available	0
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge	May 15, 2024	General KnowledgeKnowledge Graphs	—Unverified	0
MoST: Multi-modality Scene Tokenization for Motion Prediction	Apr 30, 2024	General Knowledgemotion prediction	—Unverified	0
Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs	Apr 29, 2024	DiagnosticGeneral Knowledge	—Unverified	0

Show:10 25 50

← PrevPage 23 of 40Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified