SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 201210 of 399 papers

TitleStatusHype
KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration0
KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling0
Differentially Private Distributed Learning for Language Modeling Tasks0
Key Factors Affecting European Reactions to AI in European Full and Flawed Democracies0
Deep Prompt Multi-task Network for Abuse Language Detection0
K-Link: Knowledge-Link Graph from LLMs for Enhanced Representation Learning in Multivariate Time-Series Data0
KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models0
Knowledge-aware Neural Collective Matrix Factorization for Cross-domain Recommendation0
Knowledgebra: An Algebraic Learning Framework for Knowledge Graph0
Knowledge Completion for Generics using Guided Tensor Factorization0
Show:102550
← PrevPage 21 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified