SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 251275 of 399 papers

TitleStatusHype
Microsoft Cloud-based Digitization Workflow with Rich Metadata Acquisition for Cultural Heritage Objects0
Colo-SCRL: Self-Supervised Contrastive Representation Learning for Colonoscopic Video Retrieval0
TURNER: The Uncertainty-based Retrieval Framework for Chinese NER0
Collective inference of the truth of propositions from crowd probability judgments0
MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs0
MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning0
Model Compression with Two-stage Multi-teacher Knowledge Distillation for Web Question Answering System0
Understanding Inequality of LLM Fact-Checking over Geographic Regions with Agent and Retrieval models0
Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model0
MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning0
MoST: Multi-modality Scene Tokenization for Motion Prediction0
Motif-Based Prompt Learning for Universal Cross-Domain Recommendation0
Collaborative ontology sharing and editing0
Multilingual Tourist Assistance using ChatGPT: Comparing Capabilities in Hindi, Telugu, and Kannada0
Multi-task Federated Learning with Encoder-Decoder Structure: Enabling Collaborative Learning Across Different Tasks0
Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation0
Neural Discourse Relation Recognition with Semantic Memory0
Neural Regularized Domain Adaptation for Chinese Word Segmentation0
Shifted Autoencoders for Point Annotation Restoration in Object Counting0
Universal Item Tokenization for Transferable Generative Recommendation0
Nudging: Inference-time Alignment of LLMs via Guided Decoding0
Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving0
One to Many: Adaptive Instrument Segmentation via Meta Learning and Dynamic Online Adaptation in Robotic Surgical Video0
On the Usage of Continual Learning for Out-of-Distribution Generalization in Pre-trained Language Models of Code0
Organizing Linked Data Quality Related Methods0
Show:102550
← PrevPage 11 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified