SOTAVerified

General Knowledge

This task aims to evaluate the ability of a model to answer general-knowledge questions.

Source: BIG-bench

Papers

Showing 201250 of 399 papers

TitleStatusHype
KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration0
KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling0
Differentially Private Distributed Learning for Language Modeling Tasks0
Key Factors Affecting European Reactions to AI in European Full and Flawed Democracies0
Deep Prompt Multi-task Network for Abuse Language Detection0
K-Link: Knowledge-Link Graph from LLMs for Enhanced Representation Learning in Multivariate Time-Series Data0
KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models0
Knowledge-aware Neural Collective Matrix Factorization for Cross-domain Recommendation0
Knowledgebra: An Algebraic Learning Framework for Knowledge Graph0
Knowledge Completion for Generics using Guided Tensor Factorization0
Transaction Logic with (Complex) Events0
Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images0
Knowledge Distillation via Instance-level Sequence Learning0
Data structuring for the ontological modelling of wind energy systems0
Transferable Natural Language Interface to Structured Queries aided by Adversarial Generation0
Knowledge Matters: Radiology Report Generation with General and Specific Knowledge0
DAML-ST5: Low Resource Style Transfer via Domain Adaptive Meta Learning0
Knowledge Representation and Extraction at Scale0
KnowRA: Knowledge Retrieval Augmented Method for Document-level Relation Extraction with Comprehensive Reasoning Abilities0
CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation0
Large Language Models as a Tool for Mining Object Knowledge0
Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning0
Laughter During Cooperative and Competitive Games0
T-Norms Driven Loss Functions for Machine Learning0
Learning from Natural Language Explanations for Generalizable Entity Matching0
Learning Knowledge Graphs for Question Answering through Conversational Dialog0
Learning Physical Common Sense as Knowledge Graph Completion via BERT Data Augmentation and Constrained Tucker Factorization0
Transfer learning of chaotic systems0
Learning to Model the World with Language0
Controversy Rules - Discovering Regions Where Classifiers (Dis-)Agree Exceptionally0
Learning to Specialize with Knowledge Distillation for Visual Question Answering0
Transformer Based Bengali Chatbot Using General Knowledge Dataset0
Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images0
Context and Humor: Understanding Amul advertisements of India0
Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation0
Leveraging Large Language Models for enhanced personalised user experience in Smart Homes0
LLM4WM: Adapting LLM for Wireless Multi-Tasking0
Constructing Enhanced Mutual Information for Online Class-Incremental Learning0
ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis0
LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models0
Low-Resource Adaptation of Open-Domain Generative Chatbots0
Low Resource Style Transfer via Domain Adaptive Meta Learning0
Low Resource Style Transfer via Domain Adaptive Meta Learning0
TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation0
MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation0
Mars: Situated Inductive Reasoning in an Open-World Environment0
Composite Learning Units: Generalized Learning Beyond Parameter Updates to Transform LLMs into Adaptive Reasoners0
Meta-Inductive Node Classification across Graphs0
Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-Learning0
Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Chinchilla-70B (few-shot, k=5)Accuracy94.3Unverified
2Gopher-280B (few-shot, k=5)Accuracy93.9Unverified
3Chinchilla-70B (few-shot, k=5)Accuracy 85.7Unverified
4Gopher-280B (few-shot, k=5)Accuracy 84.8Unverified
5Gopher-280B (few-shot, k=5)Accuracy84.2Unverified
6Gopher-280B (few-shot, k=5)Accuracy 84.1Unverified
7Gopher-280B (few-shot, k=5)Accuracy 83.9Unverified
8Gopher-280B (few-shot, k=5)Accuracy83.3Unverified
9Gopher-280B (few-shot, k=5)Accuracy 81.8Unverified
10Gopher-280B (few-shot, k=5)Accuracy 81Unverified