General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–200 of 399 papers

Title	Date	Tasks	Status
Adapter-based Approaches to Knowledge-enhanced Language Models -- A Survey	Nov 25, 2024	General KnowledgeKnowledge Graphs	—Unverified
GOT4Rec: Graph of Thoughts for Sequential Recommendation	Nov 22, 2024	General KnowledgeSequential Recommendation	—Unverified
GRL-Prompt: Towards Knowledge Graph based Prompt Optimization via Reinforcement Learning	Nov 19, 2024	General KnowledgePrompt Engineering	—Unverified
Efficient Transfer Learning for Video-language Foundation Models	Nov 18, 2024	Action RecognitionFew-Shot Learning	CodeCode Available
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs	Nov 14, 2024	General KnowledgeMath	CodeCode Available
Exploring Zero-Shot Anomaly Detection with CLIP in Medical Imaging: Are We There Yet?	Nov 14, 2024	Anomaly DetectionGeneral Knowledge	—Unverified
SHARP: Unlocking Interactive Hallucination via Stance Transfer in Role-Playing Agents	Nov 12, 2024	General KnowledgeHallucination	—Unverified
Extracting Unlearned Information from LLMs with Activation Steering	Nov 4, 2024	General KnowledgeInformation Retrieval	—Unverified
Evaluating Company-specific Biases in Financial Sentiment Analysis using Large Language Models	Nov 1, 2024	General KnowledgeSentiment Analysis	—Unverified
A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics	Oct 30, 2024	General KnowledgePrompt Engineering	CodeCode Available
AdaptGCD: Multi-Expert Adapter Tuning for Generalized Category Discovery	Oct 29, 2024	General KnowledgePrompt Learning	—Unverified
Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code	Oct 24, 2024	General KnowledgeIn-Context Learning	—Unverified
Fast constrained sampling in pre-trained diffusion models	Oct 24, 2024	General Knowledge	—Unverified
Should We Really Edit Language Models? On the Evaluation of Edited Language Models	Oct 24, 2024	General KnowledgeModel Editing	CodeCode Available
Boosting LLM Translation Skills without General Ability Loss via Rationale Distillation	Oct 17, 2024	General KnowledgeInstruction Following	—Unverified
Large Language Models as a Tool for Mining Object Knowledge	Oct 16, 2024	General KnowledgeKnowledge Base Construction	—Unverified
Enhance Graph Alignment for Large Language Models	Oct 15, 2024	General KnowledgeText Matching	—Unverified
MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation	Oct 15, 2024	General KnowledgeSegmentation	—Unverified
Thinking LLMs: General Instruction Following with Thought Generation	Oct 14, 2024	General KnowledgeInstruction Following	—Unverified
Scalable Multi-Domain Adaptation of Language Models using Modular Experts	Oct 14, 2024	Domain AdaptationGeneral Knowledge	—Unverified
Distribution-aware Noisy-label Crack Segmentation	Oct 12, 2024	Crack SegmentationDomain Generalization	CodeCode Available
Nudging: Inference-time Alignment of LLMs via Guided Decoding	Oct 11, 2024	General KnowledgeGSM8K	—Unverified
Few Exemplar-Based General Medical Image Segmentation via Domain-Aware Selective Adaptation	Oct 11, 2024	General KnowledgeImage Segmentation	—Unverified
Mars: Situated Inductive Reasoning in an Open-World Environment	Oct 10, 2024	Decision MakingGeneral Knowledge	—Unverified
Composite Learning Units: Generalized Learning Beyond Parameter Updates to Transform LLMs into Adaptive Reasoners	Oct 9, 2024	General Knowledge	—Unverified

Show:10 25 50

← PrevPage 8 of 16Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Chinchilla-70B (few-shot, k=5)	Accuracy	94.3	—	Unverified
2	Gopher-280B (few-shot, k=5)	Accuracy	93.9	—	Unverified
3	Chinchilla-70B (few-shot, k=5)	Accuracy	85.7	—	Unverified
4	Gopher-280B (few-shot, k=5)	Accuracy	84.8	—	Unverified
5	Gopher-280B (few-shot, k=5)	Accuracy	84.2	—	Unverified
6	Gopher-280B (few-shot, k=5)	Accuracy	84.1	—	Unverified
7	Gopher-280B (few-shot, k=5)	Accuracy	83.9	—	Unverified
8	Gopher-280B (few-shot, k=5)	Accuracy	83.3	—	Unverified
9	Gopher-280B (few-shot, k=5)	Accuracy	81.8	—	Unverified
10	Gopher-280B (few-shot, k=5)	Accuracy	81	—	Unverified