SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 151160 of 399 papers

TitleStatusHype
Differentially Private Distributed Learning for Language Modeling Tasks0
AdaptGCD: Multi-Expert Adapter Tuning for Generalized Category Discovery0
Improving Multi-label Emotion Classification by Integrating both General and Domain-specific Knowledge0
Inductive Graph Alignment Prompt: Bridging the Gap between Graph Pre-training and Inductive Fine-tuning From Spectral Perspective0
Investigating Forgetting in Pre-Trained Representations Through Continual Learning0
Deep Prompt Multi-task Network for Abuse Language Detection0
Data structuring for the ontological modelling of wind energy systems0
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming0
An Adaptive Deep Learning Framework for Day-ahead Forecasting of Photovoltaic Power Generation0
DAML-ST5: Low Resource Style Transfer via Domain Adaptive Meta Learning0
Show:102550
← PrevPage 16 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified