General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–399 of 399 papers

Title	Date	Tasks	Status
Test-Time Self-Adaptive Small Language Models for Question Answering	Oct 20, 2023	General KnowledgeQuestion Answering	CodeCode Available
Connecting a French Dictionary from the Beginning of the 20th Century to Wikidata	Jun 22, 2022	General Knowledge	CodeCode Available
What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge	Dec 31, 2019	General KnowledgeKnowledge Graphs	CodeCode Available
Pruning neural network models for gene regulatory dynamics using data and domain knowledge	Mar 5, 2024	General KnowledgeNetwork Pruning	CodeCode Available
Effective Skill Unlearning through Intervention and Abstention	Mar 27, 2025	General KnowledgeMath	CodeCode Available
Learning to Learn Variational Semantic Memory	Oct 20, 2020	Few-Shot LearningGeneral Knowledge	CodeCode Available
Domain Generalization via Model-Agnostic Learning of Semantic Features	Oct 29, 2019	Domain GeneralizationGeneral Knowledge	CodeCode Available
Dive into the Resolution Augmentations and Metrics in Low Resolution Face Recognition: A Plain yet Effective New Baseline	Feb 11, 2023	Face RecognitionGeneral Knowledge	CodeCode Available
Comprehensive Fair Meta-learned Recommender System	Jun 9, 2022	counterfactualFairness	CodeCode Available
A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics	Oct 30, 2024	General KnowledgePrompt Engineering	CodeCode Available
Knowledge graphs for empirical concept retrieval	Apr 10, 2024	General KnowledgeKnowledge Graphs	CodeCode Available
Should We Really Edit Language Models? On the Evaluation of Edited Language Models	Oct 24, 2024	General KnowledgeModel Editing	CodeCode Available
Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling	Nov 15, 2022	General KnowledgeKnowledge Distillation	CodeCode Available
Joey NMT: A Minimalist NMT Toolkit for Novices	Jul 29, 2019	General KnowledgeMachine Translation	CodeCode Available
Distribution-aware Noisy-label Crack Segmentation	Oct 12, 2024	Crack SegmentationDomain Generalization	CodeCode Available
Distilling Stereo Networks for Performant and Efficient Leaner Networks	Mar 24, 2025	General KnowledgeKnowledge Distillation	CodeCode Available
Patching as Translation: the Data and the Metaphor	Aug 24, 2020	General KnowledgeProgram Repair	CodeCode Available
PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization	Nov 16, 2023	Document SummarizationGeneral Knowledge	CodeCode Available
What Makes Cryptic Crosswords Challenging for LLMs?	Dec 12, 2024	General Knowledge	CodeCode Available
Are Large Language Models a Good Replacement of Taxonomies?	Jun 17, 2024	General KnowledgeKnowledge Graphs	CodeCode Available
Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models	Apr 6, 2025	Computational EfficiencyGeneral Knowledge	CodeCode Available
Integrating Semantic Knowledge to Tackle Zero-shot Text Classification	Mar 29, 2019	ClassificationData Augmentation	CodeCode Available
Commonsense Knowledge in Word Associations and ConceptNet	Sep 20, 2021	General KnowledgeKnowledge Graphs	CodeCode Available
Improving Personalized Search with Regularized Low-Rank Parameter Updates	Jun 11, 2025	General KnowledgeImage Retrieval	CodeCode Available
Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models	Nov 27, 2023	General Knowledgeimage-classification	CodeCode Available
HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models	Jun 4, 2025	BenchmarkingGeneral Knowledge	CodeCode Available
Can ChatGPT Enable ITS? The Case of Mixed Traffic Control via Reinforcement Learning	Jun 13, 2023	General KnowledgeManagement	CodeCode Available
How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities	Mar 20, 2025	General KnowledgeLanguage Modeling	CodeCode Available
Yuanfudao at SemEval-2018 Task 11: Three-way Attention and Relational Knowledge for Commonsense Machine Comprehension	Mar 1, 2018	General KnowledgeReading Comprehension	CodeCode Available
WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models	Jul 25, 2022	Common Sense ReasoningGeneral Knowledge	CodeCode Available
PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning	Jul 16, 2025	Continual LearningGeneral Knowledge	CodeCode Available
G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks	Dec 7, 2022	General KnowledgeLanguage Modeling	CodeCode Available
Visual Question Answering: A Survey of Methods and Datasets	Jul 20, 2016	General KnowledgeSurvey	CodeCode Available
From Knowledge to Reasoning: Evaluating LLMs for Ionic Liquids Research in Chemical and Biological Engineering	May 11, 2025	BenchmarkingGeneral Knowledge	CodeCode Available
Quantized Prompt for Efficient Generalization of Vision-Language Models	Jul 15, 2024	General KnowledgeLanguage Modelling	CodeCode Available
World Knowledge in Multiple Choice Reading Comprehension	Nov 13, 2022	General KnowledgeMultiple-choice	CodeCode Available
Foundation X: Integrating Classification, Localization, and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis	Mar 12, 2025	DiagnosticGeneral Knowledge	CodeCode Available
Fed-CO2: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning	Dec 21, 2023	Domain GeneralizationFederated Learning	CodeCode Available
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model	Oct 26, 2023	Data AugmentationGeneral Knowledge	CodeCode Available
Exploring Recommendation Capabilities of GPT-4V(ision): A Preliminary Case Study	Nov 7, 2023	General KnowledgeReading Comprehension	CodeCode Available
REFinD: Relation Extraction Financial Dataset	May 22, 2023	ArticlesGeneral Knowledge	CodeCode Available
Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov Logic	Mar 18, 2025	General KnowledgeImage Captioning	CodeCode Available
Exploiting Adapters for Cross-lingual Low-resource Speech Recognition	May 18, 2021	Cross-Lingual ASRGeneral Knowledge	CodeCode Available
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content	Jun 17, 2024	BenchmarkingGeneral Knowledge	CodeCode Available
Towards Knowledge-Augmented Visual Question Answering	Dec 1, 2020	General KnowledgeGraph Attention	CodeCode Available
ExplainCPE: A Free-text Explanation Benchmark of Chinese Pharmacist Examination	May 22, 2023	DiversityGeneral Knowledge	CodeCode Available
Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph	May 22, 2023	General KnowledgeQuestion Answering	CodeCode Available
DAGPrompT: Pushing the Limits of Graph Prompting with a Distribution-aware Graph Prompt Tuning Approach	Jan 25, 2025	General KnowledgeGraph Classification	CodeCode Available
Survey on Abstractive Text Summarization: Dataset, Models, and Metrics	Dec 22, 2024	Abstractive Text SummarizationGeneral Knowledge	CodeCode Available

Show:10 25 50

← PrevPage 8 of 8Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified