SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 301350 of 399 papers

TitleStatusHype
Autonomous Intelligent Software Development0
WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language ModelsCode0
Knowledge-aware Neural Collective Matrix Factorization for Cross-domain Recommendation0
Connecting a French Dictionary from the Beginning of the 20th Century to WikidataCode0
Comprehensive Fair Meta-learned Recommender SystemCode0
SciDeBERTa: Learning DeBERTa for Science Technology Documents and Fine-Tuning Information Extraction TasksCode0
Task-Driven and Experience-Based Question Answering Corpus for In-Home Robot Application in the House3D Virtual EnvironmentCode0
Laughter During Cooperative and Competitive Games0
Low Resource Style Transfer via Domain Adaptive Meta Learning0
PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stage Ranking0
Knowledgebra: An Algebraic Learning Framework for Knowledge Graph0
TOV: The Original Vision Model for Optical Remote Sensing Image Understanding via Self-supervised Learning0
Hierarchical Inductive Transfer for Continual Dialogue Learning0
KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models0
ViKiNG: Vision-Based Kilometer-Scale Navigation with Geographic Hints0
TURNER: The Uncertainty-based Retrieval Framework for Chinese NER0
Low Resource Style Transfer via Domain Adaptive Meta Learning0
Knowledge Matters: Radiology Report Generation with General and Specific Knowledge0
Applying SoftTriple Loss for Supervised Language Model Fine Tuning0
Hierarchical Inductive Transfer for Continual Dialogue Learning0
DAML-ST5: Low Resource Style Transfer via Domain Adaptive Meta Learning0
GFDC: Graph Function Dependence for Logically Consistent Dialogue Response Beyond Persona Data0
KALA: Knowledge-Augmented Language Model Adaptation0
Transformer Based Bengali Chatbot Using General Knowledge Dataset0
Successive POI Recommendation via Brain-inspired Spatiotemporal Aware Representation0
An Adaptive Deep Learning Framework for Day-ahead Forecasting of Photovoltaic Power Generation0
Commonsense Knowledge in Word Associations and ConceptNetCode0
Teaching Uncertainty Quantification in Machine Learning through Use Cases0
Low-Resource Adaptation of Open-Domain Generative Chatbots0
Knowledge Distillation via Instance-level Sequence Learning0
Exploiting Adapters for Cross-lingual Low-resource Speech RecognitionCode0
Explainable Hierarchical Imitation Learning for Robotic Drink Pouring0
Meta-Inductive Node Classification across Graphs0
Spirit Distillation: A Model Compression Method with Multi-domain Knowledge Transfer0
One to Many: Adaptive Instrument Segmentation via Meta Learning and Dynamic Online Adaptation in Robotic Surgical Video0
Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation0
Towards Knowledge-Augmented Visual Question AnsweringCode0
Transfer learning of chaotic systems0
Tencent AI Lab Machine Translation Systems for WMT20 Chat Translation Task0
Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker Factorization0
Learning to Learn Variational Semantic MemoryCode0
Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-Learning0
Patching as Translation: the Data and the MetaphorCode0
An Energy Ontology for Global City Indicators (ISO 37120)0
Domain Specific, Semi-Supervised Transfer Learning for Medical Imaging0
What's a Good Prediction? Challenges in evaluating an agent's knowledge0
What Does My QA Model Know? Devising Controlled Probes using Expert KnowledgeCode0
Acquiring Knowledge from Pre-trained Model to Neural Machine Translation0
Joint Embedding Learning of Educational Knowledge Graphs0
Improving Multi-label Emotion Classification by Integrating both General and Domain-specific Knowledge0
Show:102550
← PrevPage 7 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified