SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 351375 of 399 papers

TitleStatusHype
QuaRTz: An Open-Domain Dataset of Qualitative Relationship Questions0
GeoSQA: A Benchmark for Scenario-based Question Answering in the Geography Domain at High School Level0
Joey NMT: A Minimalist NMT Toolkit for NovicesCode0
T-Norms Driven Loss Functions for Machine Learning0
Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model0
A Joint Planning and Learning Framework for Human-Aided Decision-Making0
Generating Question Relevant Captions to Aid Visual Question Answering0
The World in My Mind: Visual Dialog with Adversarial Multi-modal Feature Encoding0
Integrating Semantic Knowledge to Tackle Zero-shot Text ClassificationCode0
Transferable Natural Language Interface to Structured Queries aided by Adversarial Generation0
Specifying Conceptual Models Using Restricted Natural Language0
Learning to Specialize with Knowledge Distillation for Visual Question Answering0
Visual Question Answering as Reading Comprehension0
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering0
Explicit Utilization of General Knowledge in Machine Reading Comprehension0
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering0
Controversy Rules - Discovering Regions Where Classifiers (Dis-)Agree Exceptionally0
Knowledge Representation and Extraction at Scale0
Luminoso at SemEval-2018 Task 10: Distinguishing Attributes Using Text Corpora and Relational KnowledgeCode0
Utilisation d'une base de connaissances de sp\'ecialit\'e et de sens commun pour la simplification de comptes-rendus radiologiques (Radiological text simplification using a general knowledge base)0
Context and Humor: Understanding Amul advertisements of India0
Efficient illumination angle self-calibration in Fourier ptychography0
A Factoid Question Answering System for Vietnamese0
Yuanfudao at SemEval-2018 Task 11: Three-way Attention and Relational Knowledge for Commonsense Machine ComprehensionCode0
Towards a Continuous Knowledge Learning Engine for Chatbots0
Show:102550
← PrevPage 15 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified