Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 161 papers

Title	Date	Tasks	Status	Hype	Score
Training Compute-Optimal Large Language Models	Mar 29, 2022	AnachronismsAnalogical Similarity	CodeCode Available	6	5
Factuality Enhanced Language Models for Open-Ended Text Generation	Jun 9, 2022	MisconceptionsSentence	CodeCode Available	5	5
Parting with Misconceptions about Learning-based Vehicle Motion Planning	Jun 13, 2023	MisconceptionsMotion Planning	CodeCode Available	2	5
Towards Democratizing Joint-Embedding Self-Supervised Learning	Mar 3, 2023	Data AugmentationMisconceptions	CodeCode Available	2	5
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2	5
The pitfalls of next-token prediction	Mar 11, 2024	MambaMisconceptions	CodeCode Available	2	5
Enhancing Knowledge Tracing with Concept Map and Response Disentanglement	Aug 23, 2024	DisentanglementKnowledge Tracing	CodeCode Available	1	5
TruthfulQA: Measuring How Models Mimic Human Falsehoods	Sep 8, 2021	Language ModelingLanguage Modelling	CodeCode Available	1	5
Unveiling Contrastive Learning's Capability of Neighborhood Aggregation for Collaborative Filtering	Apr 14, 2025	Collaborative FilteringContrastive Learning	CodeCode Available	1	5
Emergent Communication under Competition	Jan 25, 2021	Misconceptions	CodeCode Available	1	5
Noise-powered Multi-modal Knowledge Graph Representation Framework	Mar 11, 2024	Entity AlignmentKnowledge Graph Completion	CodeCode Available	1	5
A Tutorial on VAEs: From Bayes' Rule to Lossless Compression	Jun 18, 2020	Misconceptions	CodeCode Available	1	5
Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs	Sep 24, 2024	Knowledge TracingMisconceptions	CodeCode Available	1	5
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs	Apr 30, 2022	MisconceptionsQuestion Generation	CodeCode Available	1	5
Back to the Drawing Board: A Critical Evaluation of Poisoning Attacks on Production Federated Learning	Aug 23, 2021	Federated LearningMisconceptions	CodeCode Available	1	5
Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-Cut	Oct 2, 2022	Combinatorial OptimizationGraph Neural Network	CodeCode Available	1	5
Improving the Validity of Automatically Generated Feedback via Reinforcement Learning	Mar 2, 2024	MathMisconceptions	CodeCode Available	1	5
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines	Jun 8, 2020	Misconceptions	CodeCode Available	1	5
Laplace Redux -- Effortless Bayesian Deep Learning	Jun 28, 2021	Deep LearningMisconceptions	CodeCode Available	1	5
Re-Examining Linear Embeddings for High-Dimensional Bayesian Optimization	Jan 31, 2020	Bayesian OptimizationMisconceptions	CodeCode Available	1	5
A Variational Inequality Perspective on Generative Adversarial Networks	Feb 28, 2018	Misconceptions	CodeCode Available	0	5
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors	May 2, 2025	High School PhysicsMisconceptions	CodeCode Available	0	5
Hindsight and Sequential Rationality of Correlated Play	Dec 10, 2020	counterfactualDecision Making	CodeCode Available	0	5
Can Large Language Models Provide Security & Privacy Advice? Measuring the Ability of LLMs to Refute Misconceptions	Oct 3, 2023	MisconceptionsMultiple-choice	CodeCode Available	0	5
From Solution Synthesis to Student Attempt Synthesis for Block-Based Visual Programming Tasks	May 3, 2022	MisconceptionsProgram Synthesis	CodeCode Available	0	5

Show:10 25 50

← PrevPage 1 of 7Next →

No leaderboard results yet.