Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 161 papers

Title	Date	Tasks	Status	Hype	Score
Training Compute-Optimal Large Language Models	Mar 29, 2022	AnachronismsAnalogical Similarity	CodeCode Available	6	5
Factuality Enhanced Language Models for Open-Ended Text Generation	Jun 9, 2022	MisconceptionsSentence	CodeCode Available	5	5
Towards Democratizing Joint-Embedding Self-Supervised Learning	Mar 3, 2023	Data AugmentationMisconceptions	CodeCode Available	2	5
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2	5
Parting with Misconceptions about Learning-based Vehicle Motion Planning	Jun 13, 2023	MisconceptionsMotion Planning	CodeCode Available	2	5
The pitfalls of next-token prediction	Mar 11, 2024	MambaMisconceptions	CodeCode Available	2	5
Unveiling Contrastive Learning's Capability of Neighborhood Aggregation for Collaborative Filtering	Apr 14, 2025	Collaborative FilteringContrastive Learning	CodeCode Available	1	5
A Tutorial on VAEs: From Bayes' Rule to Lossless Compression	Jun 18, 2020	Misconceptions	CodeCode Available	1	5
Re-Examining Linear Embeddings for High-Dimensional Bayesian Optimization	Jan 31, 2020	Bayesian OptimizationMisconceptions	CodeCode Available	1	5
Emergent Communication under Competition	Jan 25, 2021	Misconceptions	CodeCode Available	1	5
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines	Jun 8, 2020	Misconceptions	CodeCode Available	1	5
Back to the Drawing Board: A Critical Evaluation of Poisoning Attacks on Production Federated Learning	Aug 23, 2021	Federated LearningMisconceptions	CodeCode Available	1	5
Noise-powered Multi-modal Knowledge Graph Representation Framework	Mar 11, 2024	Entity AlignmentKnowledge Graph Completion	CodeCode Available	1	5
Laplace Redux -- Effortless Bayesian Deep Learning	Jun 28, 2021	Deep LearningMisconceptions	CodeCode Available	1	5
TruthfulQA: Measuring How Models Mimic Human Falsehoods	Sep 8, 2021	Language ModelingLanguage Modelling	CodeCode Available	1	5
Improving the Validity of Automatically Generated Feedback via Reinforcement Learning	Mar 2, 2024	MathMisconceptions	CodeCode Available	1	5
Enhancing Knowledge Tracing with Concept Map and Response Disentanglement	Aug 23, 2024	DisentanglementKnowledge Tracing	CodeCode Available	1	5
Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-Cut	Oct 2, 2022	Combinatorial OptimizationGraph Neural Network	CodeCode Available	1	5
Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs	Sep 24, 2024	Knowledge TracingMisconceptions	CodeCode Available	1	5
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs	Apr 30, 2022	MisconceptionsQuestion Generation	CodeCode Available	1	5
A Variational Inequality Perspective on Generative Adversarial Networks	Feb 28, 2018	Misconceptions	CodeCode Available	0	5
Clarify: Improving Model Robustness With Natural Language Corrections	Feb 6, 2024	Misconceptionsmodel	CodeCode Available	0	5
Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning	May 30, 2024	MisconceptionsMultiple-choice	CodeCode Available	0	5
Resolving conceptual issues in Modern Coexistence Theory	Jan 20, 2022	Misconceptions	CodeCode Available	0	5
Can Large Language Models Provide Security & Privacy Advice? Measuring the Ability of LLMs to Refute Misconceptions	Oct 3, 2023	MisconceptionsMultiple-choice	CodeCode Available	0	5
Design Challenges and Misconceptions in Neural Sequence Labeling	Jun 12, 2018	ChunkingMisconceptions	CodeCode Available	0	5
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational Data	Aug 7, 2023	MathMisconceptions	CodeCode Available	0	5
Pay attention to your loss: understanding misconceptions about 1-Lipschitz neural networks	Apr 11, 2021	General ClassificationMisconceptions	CodeCode Available	0	5
Not All Claims are Created Equal: Choosing the Right Statistical Approach to Assess Hypotheses	Nov 10, 2019	AllBayesian Inference	CodeCode Available	0	5
Large Language Models for In-Context Student Modeling: Synthesizing Student's Behavior in Visual Programming	Oct 15, 2023	Misconceptions	CodeCode Available	0	5
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor	Dec 8, 2024	MisconceptionsMultiple-choice	CodeCode Available	0	5
How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions	Feb 1, 2019	Bilingual Lexicon InductionCross-Lingual Transfer	CodeCode Available	0	5
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors	May 2, 2025	High School PhysicsMisconceptions	CodeCode Available	0	5
A Structured Unplugged Approach for Foundational AI Literacy in Primary Education	May 27, 2025	Logical ReasoningMisconceptions	CodeCode Available	0	5
Hindsight and Sequential Rationality of Correlated Play	Dec 10, 2020	counterfactualDecision Making	CodeCode Available	0	5
Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models	Apr 2, 2024	Distractor GenerationIn-Context Learning	CodeCode Available	0	5
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context Learning	Aug 7, 2023	In-Context LearningMath	CodeCode Available	0	5
End-to-End Annotator Bias Approximation on Crowdsourced Single-Label Sentiment Analysis	Nov 3, 2021	MisconceptionsSentiment Analysis	CodeCode Available	0	5
Community detection in networks: A user guide	Jul 30, 2016	Community DetectionMisconceptions	CodeCode Available	0	5
A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice	Apr 25, 2024	Misconceptions	CodeCode Available	0	5
EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning	Sep 16, 2023	Date UnderstandingGSM8K	CodeCode Available	0	5
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation	Mar 12, 2025	counterfactualMisconceptions	CodeCode Available	0	5
A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics	May 19, 2022	MisconceptionsMisinformation	CodeCode Available	0	5
Collecting the Public Perception of AI and Robot Rights	Aug 4, 2020	Misconceptions	CodeCode Available	0	5
MalAlgoQA: Pedagogical Evaluation of Counterfactual Reasoning in Large Language Models and Implications for AI in Education	Jul 1, 2024	counterfactualCounterfactual Reasoning	CodeCode Available	0	5
Deep Curvature Suite	Dec 20, 2019	Misconceptions	CodeCode Available	0	5
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions	Jun 27, 2024	Distractor GenerationMath	CodeCode Available	0	5
From Solution Synthesis to Student Attempt Synthesis for Block-Based Visual Programming Tasks	May 3, 2022	MisconceptionsProgram Synthesis	CodeCode Available	0	5
Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt Wording	Jun 9, 2023	Misconceptions	CodeCode Available	0	5
Paths and Ambient Spaces in Neural Loss Landscapes	Mar 5, 2025	Misconceptions	CodeCode Available	0	5

Show:10 25 50

← PrevPage 1 of 4Next →

No leaderboard results yet.