SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 351399 of 399 papers

TitleStatusHype
Domain Generalization via Model-Agnostic Learning of Semantic FeaturesCode0
Model Compression with Two-stage Multi-teacher Knowledge Distillation for Web Question Answering System0
Spoken Conversational Search for General Knowledge0
A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming0
QuaRTz: An Open-Domain Dataset of Qualitative Relationship Questions0
GeoSQA: A Benchmark for Scenario-based Question Answering in the Geography Domain at High School Level0
Joey NMT: A Minimalist NMT Toolkit for NovicesCode0
T-Norms Driven Loss Functions for Machine Learning0
Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Graphical Model0
A Joint Planning and Learning Framework for Human-Aided Decision-Making0
Generating Question Relevant Captions to Aid Visual Question Answering0
The World in My Mind: Visual Dialog with Adversarial Multi-modal Feature Encoding0
Integrating Semantic Knowledge to Tackle Zero-shot Text ClassificationCode0
Transferable Natural Language Interface to Structured Queries aided by Adversarial Generation0
Specifying Conceptual Models Using Restricted Natural Language0
Learning to Specialize with Knowledge Distillation for Visual Question Answering0
Visual Question Answering as Reading Comprehension0
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering0
Explicit Utilization of General Knowledge in Machine Reading Comprehension0
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering0
Controversy Rules - Discovering Regions Where Classifiers (Dis-)Agree Exceptionally0
Knowledge Representation and Extraction at Scale0
Luminoso at SemEval-2018 Task 10: Distinguishing Attributes Using Text Corpora and Relational KnowledgeCode0
Utilisation d'une base de connaissances de sp\'ecialit\'e et de sens commun pour la simplification de comptes-rendus radiologiques (Radiological text simplification using a general knowledge base)0
Context and Humor: Understanding Amul advertisements of India0
Efficient illumination angle self-calibration in Fourier ptychography0
A Factoid Question Answering System for Vietnamese0
Yuanfudao at SemEval-2018 Task 11: Three-way Attention and Relational Knowledge for Commonsense Machine ComprehensionCode0
Towards a Continuous Knowledge Learning Engine for Chatbots0
Distributed Fine-tuning of Language Models on Private Data0
Differentially Private Distributed Learning for Language Modeling Tasks0
Neural Regularized Domain Adaptation for Chinese Word Segmentation0
Knowledge Completion for Generics using Guided Tensor Factorization0
Visual Question Answering: A Survey of Methods and DatasetsCode0
Neural Discourse Relation Recognition with Semantic Memory0
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge0
TabMCQ: A Dataset of General Knowledge Tables and Multiple-choice Questions0
Intelligent Conversational Bot for Massive Online Open Courses (MOOCs)0
Some Epistemological Problems with the Knowledge Level in Cognitive Architectures0
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources0
Data structuring for the ontological modelling of wind energy systems0
Learning Knowledge Graphs for Question Answering through Conversational Dialog0
Learning to Understand Phrases by Embedding the DictionaryCode0
Transaction Logic with (Complex) Events0
Analysis of Watson's Strategies for Playing Jeopardy!0
Organizing Linked Data Quality Related Methods0
Collaborative ontology sharing and editing0
A Dynamic Approach to Probabilistic Inference0
The Wisdom of Crowds in the Recollection of Order Information0
Show:102550
← PrevPage 8 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified