Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 161 papers

Title	Date	Tasks	Status
Reply to Garcia et al.: Common mistakes in measuring frequency dependent word characteristics	May 25, 2015	Misconceptions	—Unverified
Analyzing Factors Influencing Driver Willingness to Accept Advanced Driver Assistance Systems	Feb 23, 2025	Misconceptions	—Unverified
Response to Moffat's Comment on "Towards Meaningful Statements in IR Evaluation: Mapping Evaluation Measures to Interval Scales"	Dec 22, 2022	Misconceptions	—Unverified
Response to: Significance and stability of deep learning-based identification of subtypes within major psychiatric disorders. Molecular Psychiatry (2022)	Jun 10, 2022	BIG-bench Machine LearningMisconceptions	—Unverified
Retrieval-augmented systems can be dangerous medical communicators	Feb 18, 2025	MisconceptionsRetrieval	—Unverified
Clarifying Misconceptions in COVID-19 Vaccine Sentiment and Stance Analysis and Their Implications for Vaccine Hesitancy Mitigation: A Systematic Review	Mar 23, 2025	MisconceptionsSentiment Analysis	—Unverified
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering	Sep 4, 2018	Factual Visual Question AnsweringGeneral Knowledge	—Unverified
SoK: On Gradient Leakage in Federated Learning	Apr 8, 2024	Federated LearningMisconceptions	—Unverified
Clarifying System 1 & 2 through the Common Model of Cognition	May 18, 2023	Misconceptions	—Unverified
Succinct Representations for Concepts	Mar 1, 2023	Misconceptions	—Unverified
The Bussgang Decomposition of Non-Linear Systems: Basic Theory and MIMO Extensions	May 4, 2020	MisconceptionsQuantization	—Unverified
The Essential Role of Causality in Foundation World Models for Embodied AI	Feb 6, 2024	Misconceptions	—Unverified
The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe	Mar 30, 2020	Misconceptions	—Unverified
The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment with Large Language Models	Oct 12, 2024	MisconceptionsMultiple-choice	—Unverified
The Imitation Game for Educational AI	Feb 21, 2025	Distractor GenerationMisconceptions	—Unverified
Classifier-Free Guidance is a Predictor-Corrector	Aug 16, 2024	DenoisingMisconceptions	—Unverified
The Monitor Model and its Misconceptions: A Clarification	Oct 25, 2022	Misconceptionsmodel	—Unverified
The Oversmoothing Fallacy: A Misguided Narrative in GNN Research	Jun 5, 2025	Misconceptions	—Unverified
The Singularity Controversy, Part I: Lessons Learned and Open Questions: Conclusions from the Battle on the Legitimacy of the Debate	Jan 22, 2016	Misconceptions	—Unverified
Toward In-Context Teaching: Adapting Examples to Students' Misconceptions	May 7, 2024	Misconceptions	—Unverified
Towards a Rigorous Analysis of Mutual Information in Contrastive Learning	Aug 30, 2023	Contrastive LearningMisconceptions	—Unverified
Toward Semi-Automatic Misconception Discovery Using Code Embeddings	Mar 7, 2021	Code ClassificationMisconceptions	—Unverified
LLM-based Cognitive Models of Students with Misconceptions	Oct 16, 2024	Misconceptions	—Unverified
Uncertainty Quantification in Machine Learning for Biosignal Applications -- A Review	Nov 15, 2023	DiagnosticEEG	—Unverified
Understanding the Lexical Simplification Needs of Non-Native Speakers of English	Dec 1, 2016	Complex Word IdentificationLexical Simplification	—Unverified
Unraveling the Single Tangent Space Fallacy: An Analysis and Clarification for Applying Riemannian Geometry in Robot Learning	Oct 11, 2023	Misconceptions	—Unverified
Using language models in the implicit automated assessment of mathematical short answer items	Aug 21, 2023	Misconceptions	—Unverified
Zero Shot Learning for Code Education: Rubric Sampling with Deep Learning Inference	Sep 5, 2018	MisconceptionsZero-Shot Learning	CodeCode Available
A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice	Apr 25, 2024	Misconceptions	CodeCode Available
A Structured Unplugged Approach for Foundational AI Literacy in Primary Education	May 27, 2025	Logical ReasoningMisconceptions	CodeCode Available
A Variational Inequality Perspective on Generative Adversarial Networks	Feb 28, 2018	Misconceptions	CodeCode Available
A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics	May 19, 2022	MisconceptionsMisinformation	CodeCode Available
Can Large Language Models Provide Security & Privacy Advice? Measuring the Ability of LLMs to Refute Misconceptions	Oct 3, 2023	MisconceptionsMultiple-choice	CodeCode Available
Clarify: Improving Model Robustness With Natural Language Corrections	Feb 6, 2024	Misconceptionsmodel	CodeCode Available
Collecting the Public Perception of AI and Robot Rights	Aug 4, 2020	Misconceptions	CodeCode Available
Community detection in networks: A user guide	Jul 30, 2016	Community DetectionMisconceptions	CodeCode Available
Design Challenges and Misconceptions in Neural Sequence Labeling	Jun 12, 2018	ChunkingMisconceptions	CodeCode Available
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions	Jun 27, 2024	Distractor GenerationMath	CodeCode Available
EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning	Sep 16, 2023	Date UnderstandingGSM8K	CodeCode Available
End-to-End Annotator Bias Approximation on Crowdsourced Single-Label Sentiment Analysis	Nov 3, 2021	MisconceptionsSentiment Analysis	CodeCode Available
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context Learning	Aug 7, 2023	In-Context LearningMath	CodeCode Available
Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models	Apr 2, 2024	Distractor GenerationIn-Context Learning	CodeCode Available
From Solution Synthesis to Student Attempt Synthesis for Block-Based Visual Programming Tasks	May 3, 2022	MisconceptionsProgram Synthesis	CodeCode Available
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors	May 2, 2025	High School PhysicsMisconceptions	CodeCode Available
Hindsight and Sequential Rationality of Correlated Play	Dec 10, 2020	counterfactualDecision Making	CodeCode Available
How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions	Feb 1, 2019	Bilingual Lexicon InductionCross-Lingual Transfer	CodeCode Available
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation	Mar 12, 2025	counterfactualMisconceptions	CodeCode Available
Large Language Models for In-Context Student Modeling: Synthesizing Student's Behavior in Visual Programming	Oct 15, 2023	Misconceptions	CodeCode Available
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor	Dec 8, 2024	MisconceptionsMultiple-choice	CodeCode Available
MalAlgoQA: Pedagogical Evaluation of Counterfactual Reasoning in Large Language Models and Implications for AI in Education	Jul 1, 2024	counterfactualCounterfactual Reasoning	CodeCode Available

Show:10 25 50

← PrevPage 3 of 4Next →

No leaderboard results yet.