General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 399 papers

Title	Date	Tasks	Status
KAER: A Knowledge Augmented Pre-Trained Language Model for Entity Resolution	Jan 12, 2023	Entity ResolutionGeneral Knowledge	—Unverified
PASS-FC: Progressive and Adaptive Search Scheme for Fact Checking of Comprehensive Claims	Apr 14, 2025	Fact CheckingGeneral Knowledge	—Unverified
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models	Feb 3, 2025	General Knowledge	—Unverified
Pilot: Building the Federated Multimodal Instruction Tuning Framework	Jan 23, 2025	General Knowledge	—Unverified
PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment	Dec 18, 2022	Data AugmentationDialogue Evaluation	—Unverified
Pretraining and Updates of Domain-Specific LLM: A Case Study in the Japanese Business Domain	Apr 12, 2024	Continual PretrainingGeneral Knowledge	—Unverified
Proceedings of the ISCA/ITG Workshop on Diversity in Large Speech and Language Models	Mar 12, 2025	DiversityGeneral Knowledge	—Unverified
Profit: Benchmarking Personalization and Robustness Trade-off in Federated Prompt Tuning	Oct 6, 2023	BenchmarkingFederated Learning	—Unverified
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian	Jul 30, 2024	Document ClassificationEntity Typing	—Unverified
QuaRTz: An Open-Domain Dataset of Qualitative Relationship Questions	Sep 8, 2019	General Knowledge	—Unverified
Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training	Jul 7, 2025	General KnowledgeMMLU	—Unverified
Rethinking Two Consensuses of the Transferability in Deep Learning	Dec 1, 2022	Deep LearningGeneral Knowledge	—Unverified
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning	Jul 3, 2024	Few-Shot LearningGeneral Knowledge	—Unverified
SAGE: Smart home Agent with Grounded Execution	Nov 1, 2023	Common Sense ReasoningGeneral Knowledge	—Unverified
SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation	Apr 29, 2025	General KnowledgeImage Segmentation	—Unverified
Learning to Adapt SAM for Segmenting Cross-domain Point Clouds	Oct 13, 2023	Domain AdaptationGeneral Knowledge	—Unverified
SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation	Jul 6, 2024	General KnowledgeImage Segmentation	—Unverified
Sample-Efficient Behavior Cloning Using General Domain Knowledge	Jan 27, 2025	Car RacingFeature Engineering	—Unverified
Scalable Multi-Domain Adaptation of Language Models using Modular Experts	Oct 14, 2024	Domain AdaptationGeneral Knowledge	—Unverified
Scene-Driven Multimodal Knowledge Graph Construction for Embodied AI	Nov 7, 2023	General Knowledgegraph construction	—Unverified
Score: A Rule Engine for the Scone Knowledge Base System	May 7, 2023	General Knowledge	—Unverified
scReader: Prompting Large Language Models to Interpret scRNA-seq Data	Dec 24, 2024	General Knowledge	—Unverified
Sculpting [CLS] Features for Pre-Trained Model-Based Class-Incremental Learning	Feb 20, 2025	class-incremental learningClass Incremental Learning	—Unverified
Semi-Supervised Medical Image Segmentation via Knowledge Mining from Large Models	Mar 10, 2025	General KnowledgeImage Segmentation	—Unverified
Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models	Feb 18, 2025	Code GenerationGeneral Knowledge	—Unverified
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge	May 15, 2024	General KnowledgeKnowledge Graphs	—Unverified
Some Epistemological Problems with the Knowledge Level in Cognitive Architectures	Nov 26, 2015	General Knowledge	—Unverified
Specifying Conceptual Models Using Restricted Natural Language	Dec 1, 2018	General Knowledge	—Unverified
Spirit Distillation: A Model Compression Method with Multi-domain Knowledge Transfer	Apr 29, 2021	General KnowledgeKnowledge Distillation	—Unverified
Spoken Conversational Search for General Knowledge	Sep 26, 2019	Conversational Question AnsweringConversational Search	—Unverified
STELLA: Towards Protein Function Prediction with Multimodal LLMs Integrating Sequence-Structure Representations	Jun 4, 2025	Drug DiscoveryGeneral Knowledge	—Unverified
Stop Words for Processing Software Engineering Documents: Do they Matter?	Mar 18, 2023	General Knowledge	—Unverified
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering	Sep 4, 2018	Factual Visual Question AnsweringGeneral Knowledge	—Unverified
Successive POI Recommendation via Brain-inspired Spatiotemporal Aware Representation	Sep 29, 2021	General Knowledge	—Unverified
TabMCQ: A Dataset of General Knowledge Tables and Multiple-choice Questions	Feb 12, 2016	General KnowledgeMultiple-choice	—Unverified
Teaching Uncertainty Quantification in Machine Learning through Use Cases	Aug 19, 2021	BIG-bench Machine LearningGeneral Knowledge	—Unverified
Tencent AI Lab Machine Translation Systems for WMT20 Chat Translation Task	Nov 1, 2020	General KnowledgeMachine Translation	—Unverified
Ten Lessons We Have Learned in the New "Sparseland": A Short Handbook for Sparse Neural Network Researchers	Feb 6, 2023	General Knowledge	—Unverified
The Scaling Law for LoRA Base on Mutual Information Upper Bound	Jan 6, 2025	General Knowledge	—Unverified
The Wisdom of Crowds in the Recollection of Order Information	Dec 1, 2009	General Knowledge	—Unverified
The World in My Mind: Visual Dialog with Adversarial Multi-modal Feature Encoding	Jun 1, 2019	General KnowledgeVisual Dialog	—Unverified
Thinking LLMs: General Instruction Following with Thought Generation	Oct 14, 2024	General KnowledgeInstruction Following	—Unverified
TOV: The Original Vision Model for Optical Remote Sensing Image Understanding via Self-supervised Learning	Apr 10, 2022	General Knowledgeobject-detection	—Unverified
Towards a Continuous Knowledge Learning Engine for Chatbots	Feb 16, 2018	General KnowledgeKnowledge Base Completion	—Unverified
Towards Few-shot Out-of-Distribution Detection	Nov 20, 2023	General KnowledgeOut-of-Distribution Detection	—Unverified
Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs	Apr 29, 2024	DiagnosticGeneral Knowledge	—Unverified
Towards Ontology Reshaping for KG Generation with User-in-the-Loop: Applied to Bosch Welding	Sep 22, 2022	General KnowledgeKnowledge Graphs	—Unverified
Transaction Logic with (Complex) Events	May 15, 2014	General Knowledge	—Unverified
Transferable Natural Language Interface to Structured Queries aided by Adversarial Generation	Dec 4, 2018	Data AugmentationDomain Adaptation	—Unverified
Transfer learning of chaotic systems	Nov 15, 2020	General KnowledgeTime Series	—Unverified

Show:10 25 50

← PrevPage 6 of 8Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified