General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–175 of 399 papers

Title	Date	Tasks	Status	Score
Leveraging Large Language Models for Automated Dialogue Analysis	Sep 12, 2023	General KnowledgeLanguage Modeling	CodeCode Available	5
Foundation X: Integrating Classification, Localization, and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis	Mar 12, 2025	DiagnosticGeneral Knowledge	CodeCode Available	5
Commonsense Knowledge in Word Associations and ConceptNet	Sep 20, 2021	General KnowledgeKnowledge Graphs	CodeCode Available	5
Fed-CO2: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning	Dec 21, 2023	Domain GeneralizationFederated Learning	CodeCode Available	5
Learning to Learn Variational Semantic Memory	Oct 20, 2020	Few-Shot LearningGeneral Knowledge	CodeCode Available	5
Learning to Understand Phrases by Embedding the Dictionary	Apr 2, 2015	General Knowledge	CodeCode Available	5
Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling	Nov 15, 2022	General KnowledgeKnowledge Distillation	CodeCode Available	5
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model	Oct 26, 2023	Data AugmentationGeneral Knowledge	CodeCode Available	5
Knowledge graphs for empirical concept retrieval	Apr 10, 2024	General KnowledgeKnowledge Graphs	CodeCode Available	5
A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics	Oct 30, 2024	General KnowledgePrompt Engineering	CodeCode Available	5
Integrating Semantic Knowledge to Tackle Zero-shot Text Classification	Mar 29, 2019	ClassificationData Augmentation	CodeCode Available	5
Exploring Recommendation Capabilities of GPT-4V(ision): A Preliminary Case Study	Nov 7, 2023	General KnowledgeReading Comprehension	CodeCode Available	5
Improving Personalized Search with Regularized Low-Rank Parameter Updates	Jun 11, 2025	General KnowledgeImage Retrieval	CodeCode Available	5
Joey NMT: A Minimalist NMT Toolkit for Novices	Jul 29, 2019	General KnowledgeMachine Translation	CodeCode Available	5
Exploiting Adapters for Cross-lingual Low-resource Speech Recognition	May 18, 2021	Cross-Lingual ASRGeneral Knowledge	CodeCode Available	5
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction	May 16, 2025	General KnowledgeZero-shot Generalization	CodeCode Available	5
ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist Examination	May 22, 2023	DiversityGeneral Knowledge	CodeCode Available	5
HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models	Jun 4, 2025	BenchmarkingGeneral Knowledge	CodeCode Available	5
Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph	May 22, 2023	General KnowledgeQuestion Answering	CodeCode Available	5
Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models	Nov 27, 2023	General Knowledgeimage-classification	CodeCode Available	5
Pruning neural network models for gene regulatory dynamics using data and domain knowledge	Mar 5, 2024	General KnowledgeNetwork Pruning	CodeCode Available	5
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content	Jun 17, 2024	BenchmarkingGeneral Knowledge	CodeCode Available	5
Evaluating Polish linguistic and cultural competency in large language models	Mar 2, 2025	General Knowledge	—Unverified	0
Evaluating Consistency and Reasoning Capabilities of Large Language Models	Apr 25, 2024	General KnowledgeText Generation	—Unverified	0
Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving	Sep 4, 2024	Autonomous DrivingDecision Making	—Unverified	0

Show:10 25 50

← PrevPage 7 of 16Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified