General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 399 papers

Title	Date	Tasks	Status
ViKiNG: Vision-Based Kilometer-Scale Navigation with Geographic Hints	Feb 23, 2022	3D ReconstructionGeneral Knowledge	—Unverified
QuaRTz: An Open-Domain Dataset of Qualitative Relationship Questions	Sep 8, 2019	General Knowledge	—Unverified
Autonomous Intelligent Software Development	Aug 12, 2022	General Knowledge	—Unverified
A Unified Industrial Large Knowledge Model Framework in Industry 4.0 and Smart Manufacturing	Dec 22, 2023	General Knowledge	—Unverified
Assessing Look-Ahead Bias in Stock Return Predictions Generated By GPT Sentiment Analysis	Sep 29, 2023	General KnowledgeSentiment Analysis	—Unverified
ASLseg: Adapting SAM in the Loop for Semi-supervised Liver Tumor Segmentation	Dec 13, 2023	General KnowledgeImage Segmentation	—Unverified
Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives	May 20, 2025	Caption GenerationContrastive Learning	—Unverified
Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training	Jul 7, 2025	General KnowledgeMMLU	—Unverified
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources	Nov 22, 2015	FormGeneral Knowledge	—Unverified
Visual Question Answering as Reading Comprehension	Nov 29, 2018	Common Sense ReasoningGeneral Knowledge	—Unverified
Rethinking Two Consensuses of the Transferability in Deep Learning	Dec 1, 2022	Deep LearningGeneral Knowledge	—Unverified
A Self-Supervised Learning of a Foundation Model for Analog Layout Design Automation	Mar 28, 2025	General KnowledgeLayout Design	—Unverified
Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems	Dec 19, 2024	General KnowledgeRecommendation Systems	—Unverified
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning	Jul 3, 2024	Few-Shot LearningGeneral Knowledge	—Unverified
SAGE: Smart home Agent with Grounded Execution	Nov 1, 2023	Common Sense ReasoningGeneral Knowledge	—Unverified
Are LLMs Good Cryptic Crossword Solvers?	Mar 15, 2024	General Knowledge	—Unverified
Applying SoftTriple Loss for Supervised Language Model Fine Tuning	Dec 15, 2021	General KnowledgeLanguage Modeling	—Unverified
SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation	Apr 29, 2025	General KnowledgeImage Segmentation	—Unverified
Learning to Adapt SAM for Segmenting Cross-domain Point Clouds	Oct 13, 2023	Domain AdaptationGeneral Knowledge	—Unverified
SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation	Jul 6, 2024	General KnowledgeImage Segmentation	—Unverified
Sample-Efficient Behavior Cloning Using General Domain Knowledge	Jan 27, 2025	Car RacingFeature Engineering	—Unverified
Scalable Multi-Domain Adaptation of Language Models using Modular Experts	Oct 14, 2024	Domain AdaptationGeneral Knowledge	—Unverified
AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis	Mar 10, 2025	DiversityGeneral Knowledge	—Unverified
Scene-Driven Multimodal Knowledge Graph Construction for Embodied AI	Nov 7, 2023	General Knowledgegraph construction	—Unverified
When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering	Apr 19, 2024	General Knowledge	—Unverified
Score: A Rule Engine for the Scone Knowledge Base System	May 7, 2023	General Knowledge	—Unverified
scReader: Prompting Large Language Models to Interpret scRNA-seq Data	Dec 24, 2024	General Knowledge	—Unverified
Sculpting [CLS] Features for Pre-Trained Model-Based Class-Incremental Learning	Feb 20, 2025	class-incremental learningClass Incremental Learning	—Unverified
An Energy Ontology for Global City Indicators (ISO 37120)	Jul 19, 2020	General Knowledge	—Unverified
Analysis of Watson's Strategies for Playing Jeopardy!	Feb 4, 2014	Decision MakingGeneral Knowledge	—Unverified
An Ad-hoc graph node vector embedding algorithm for general knowledge graphs using Kinetica-Graph	Jul 22, 2024	General KnowledgeKnowledge Graphs	—Unverified
Semi-Supervised Medical Image Segmentation via Knowledge Mining from Large Models	Mar 10, 2025	General KnowledgeImage Segmentation	—Unverified
Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models	Feb 18, 2025	Code GenerationGeneral Knowledge	—Unverified
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making	May 6, 2025	Decision MakingGeneral Knowledge	—Unverified
An Adaptive Deep Learning Framework for Day-ahead Forecasting of Photovoltaic Power Generation	Sep 28, 2021	General Knowledge	—Unverified
All Roads Lead to Rome: Unveiling the Trajectory of Recommender Systems Across the LLM Era	Jul 14, 2024	AllConversational Recommendation	—Unverified
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge	May 15, 2024	General KnowledgeKnowledge Graphs	—Unverified
Some Epistemological Problems with the Knowledge Level in Cognitive Architectures	Nov 26, 2015	General Knowledge	—Unverified
Specifying Conceptual Models Using Restricted Natural Language	Dec 1, 2018	General Knowledge	—Unverified
Spirit Distillation: A Model Compression Method with Multi-domain Knowledge Transfer	Apr 29, 2021	General KnowledgeKnowledge Distillation	—Unverified
Spoken Conversational Search for General Knowledge	Sep 26, 2019	Conversational Question AnsweringConversational Search	—Unverified
STELLA: Towards Protein Function Prediction with Multimodal LLMs Integrating Sequence-Structure Representations	Jun 4, 2025	Drug DiscoveryGeneral Knowledge	—Unverified
Stop Words for Processing Software Engineering Documents: Do they Matter?	Mar 18, 2023	General Knowledge	—Unverified
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering	Sep 4, 2018	Factual Visual Question AnsweringGeneral Knowledge	—Unverified
AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking	Mar 4, 2025	ChunkingGeneral Knowledge	—Unverified
Successive POI Recommendation via Brain-inspired Spatiotemporal Aware Representation	Sep 29, 2021	General Knowledge	—Unverified
A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming	Sep 18, 2019	General KnowledgeReinforcement Learning	—Unverified
GeoSQA: A Benchmark for Scenario-based Question Answering in the Geography Domain at High School Level	Aug 20, 2019	General KnowledgeMultiple-choice	—Unverified
TabMCQ: A Dataset of General Knowledge Tables and Multiple-choice Questions	Feb 12, 2016	General KnowledgeMultiple-choice	—Unverified
"When Words Fail, Emojis Prevail": Generating Sarcastic Utterances with Emoji Using Valence Reversal and Semantic Incongruity	May 6, 2023	General KnowledgeSentence	—Unverified

Show:10 25 50

← PrevPage 7 of 8Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified