General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 326–350 of 399 papers

Title	Date	Tasks	Status
Neural Discourse Relation Recognition with Semantic Memory	Mar 12, 2016	General KnowledgeRelation	—Unverified
Neural Regularized Domain Adaptation for Chinese Word Segmentation	Dec 1, 2017	Chinese Word SegmentationDomain Adaptation	—Unverified
Shifted Autoencoders for Point Annotation Restoration in Object Counting	Dec 12, 2023	General KnowledgeObject	—Unverified
Nudging: Inference-time Alignment of LLMs via Guided Decoding	Oct 11, 2024	General KnowledgeGSM8K	—Unverified
One to Many: Adaptive Instrument Segmentation via Meta Learning and Dynamic Online Adaptation in Robotic Surgical Video	Mar 24, 2021	General KnowledgeMeta-Learning	—Unverified
On the Usage of Continual Learning for Out-of-Distribution Generalization in Pre-trained Language Models of Code	May 6, 2023	Continual LearningGeneral Knowledge	—Unverified
Organizing Linked Data Quality Related Methods	May 30, 2013	General Knowledge	—Unverified
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering	Nov 1, 2018	Factual Visual Question AnsweringGeneral Knowledge	—Unverified
A Joint Planning and Learning Framework for Human-Aided Decision-Making	Jun 17, 2019	Decision MakingGeneral Knowledge	—Unverified
PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stage Ranking	May 18, 2022	Deep LearningGeneral Knowledge	—Unverified
Luminoso at SemEval-2018 Task 10: Distinguishing Attributes Using Text Corpora and Relational Knowledge	Jun 5, 2018	General KnowledgeRelation Extraction	CodeCode Available
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs	Nov 14, 2024	General KnowledgeMath	CodeCode Available
Leveraging Large Language Models for Automated Dialogue Analysis	Sep 12, 2023	General KnowledgeLanguage Modeling	CodeCode Available
Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge	Apr 8, 2024	General KnowledgeSafety Alignment	CodeCode Available
Efficient Transfer Learning for Video-language Foundation Models	Nov 18, 2024	Action RecognitionFew-Shot Learning	CodeCode Available
Unveiling Causal Reasoning in Large Language Models: Reality or Mirage?	Jun 26, 2025	counterfactualGeneral Knowledge	CodeCode Available
Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models	Aug 19, 2024	DescriptiveDrug Discovery	CodeCode Available
Task-Driven and Experience-Based Question Answering Corpus for In-Home Robot Application in the House3D Virtual Environment	Jun 1, 2022	General KnowledgeQuestion Answering	CodeCode Available
ContextFlow++: Generalist-Specialist Flow-based Generative Models with Mixed-Variable Context Encoding	Jun 2, 2024	Anomaly DetectionDensity Estimation	CodeCode Available
BnMMLU: Measuring Massive Multitask Language Understanding in Bengali	May 25, 2025	General KnowledgeMMLU	CodeCode Available
Avoiding Copyright Infringement via Large Language Model Unlearning	Jun 16, 2024	General KnowledgeLanguage Modeling	CodeCode Available
Learning to Understand Phrases by Embedding the Dictionary	Apr 2, 2015	General Knowledge	CodeCode Available
Efficient Relation-aware Neighborhood Aggregation in Graph Neural Networks via Tensor Decomposition	Dec 11, 2022	Contrastive LearningGeneral Knowledge	CodeCode Available
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction	May 16, 2025	General KnowledgeZero-shot Generalization	CodeCode Available
SciDeBERTa: Learning DeBERTa for Science Technology Documents and Fine-Tuning Information Extraction Tasks	Jun 8, 2022	General KnowledgeJoint Entity and Relation Extraction	CodeCode Available

Show:10 25 50

← PrevPage 14 of 16Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified