SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 226250 of 399 papers

TitleStatusHype
Learning Knowledge Graphs for Question Answering through Conversational Dialog0
Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker Factorization0
Transfer learning of chaotic systems0
Learning to Model the World with Language0
Controversy Rules - Discovering Regions Where Classifiers (Dis-)Agree Exceptionally0
Learning to Specialize with Knowledge Distillation for Visual Question Answering0
Transformer Based Bengali Chatbot Using General Knowledge Dataset0
Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images0
Context and Humor: Understanding Amul advertisements of India0
Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation0
Leveraging Large Language Models for enhanced personalised user experience in Smart Homes0
LLM4WM: Adapting LLM for Wireless Multi-Tasking0
Constructing Enhanced Mutual Information for Online Class-Incremental Learning0
ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis0
LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models0
Low-Resource Adaptation of Open-Domain Generative Chatbots0
Low Resource Style Transfer via Domain Adaptive Meta Learning0
Low Resource Style Transfer via Domain Adaptive Meta Learning0
TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation0
MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation0
Mars: Situated Inductive Reasoning in an Open-World Environment0
Composite Learning Units: Generalized Learning Beyond Parameter Updates to Transform LLMs into Adaptive Reasoners0
Meta-Inductive Node Classification across Graphs0
Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-Learning0
Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text0
Show:102550
← PrevPage 10 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified