Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 161 papers

Title	Date	Tasks	Status	Score
WatChat: Explaining perplexing programs by debugging mental models	Mar 8, 2024	counterfactualLanguage Modelling	CodeCode Available	5
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research	May 17, 2025	Misconceptionsscientific discovery	CodeCode Available	5
When big data actually are low-rank, or entrywise approximation of certain function-generated matrices	Jul 3, 2024	Misconceptions	CodeCode Available	5
Zero Shot Learning for Code Education: Rubric Sampling with Deep Learning Inference	Sep 5, 2018	MisconceptionsZero-Shot Learning	CodeCode Available	5
The Oversmoothing Fallacy: A Misguided Narrative in GNN Research	Jun 5, 2025	Misconceptions	—Unverified	0
Dynamics and triggers of misinformation on vaccines	Jul 25, 2022	MisconceptionsMisinformation	—Unverified	0
An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning	May 10, 2024	MisconceptionsMulti-agent Reinforcement Learning	—Unverified	0
Emergent Abilities in Large Language Models: A Survey	Feb 28, 2025	In-Context LearningMisconceptions	—Unverified	0
Characterizing Information Seeking Events in Health-Related Social Discourse	Aug 17, 2023	Misconceptions	—Unverified	0
The kernel perspective on dynamic mode decomposition	May 31, 2021	Misconceptions	—Unverified	0
Enforcing Interpretability and its Statistical Impacts: Trade-offs between Accuracy and Interpretability	Oct 26, 2020	Binary ClassificationLearning Theory	—Unverified	0
Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using Large Language Models to Mitigate Cognitive Bias	Jan 26, 2024	Decision MakingDiagnostic	—Unverified	0
Challenges and Trends in User Trust Discourse in AI	May 5, 2023	Misconceptions	—Unverified	0
The Singularity Controversy, Part I: Lessons Learned and Open Questions: Conclusions from the Battle on the Legitimacy of the Debate	Jan 22, 2016	Misconceptions	—Unverified	0
Toward In-Context Teaching: Adapting Examples to Students' Misconceptions	May 7, 2024	Misconceptions	—Unverified	0
Can a Hallucinating Model help in Reducing Human "Hallucination"?	May 1, 2024	HallucinationLogical Fallacies	—Unverified	0
Breaking Boundaries: A Chronology with Future Directions of Women in Exercise Physiology Research, Centred on Pregnancy	Apr 12, 2024	Misconceptions	—Unverified	0
Fine-tuning Language Models for Factuality	Nov 14, 2023	Fact CheckingMisconceptions	—Unverified	0
Finnish 5th and 6th graders' misconceptions about Artificial Intelligence	Nov 28, 2023	Misconceptions	—Unverified	0
Formalising Anti-Discrimination Law in Automated Decision Systems	Jun 29, 2024	FairnessLegal Reasoning	—Unverified	0
Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact	Feb 12, 2025	Misconceptions	—Unverified	0
On Proximity and Structural Role-based Embeddings in Networks: Misconceptions, Techniques, and Applications	Aug 22, 2019	MisconceptionsNetwork Embedding	—Unverified	0
From Intuition to Understanding: Using AI Peers to Overcome Physics Misconceptions	Apr 1, 2025	Misconceptions	—Unverified	0
From Random to Regular: Variation in the Patterning of Retinal Mosaics	Oct 22, 2019	DescriptiveDiagnostic	—Unverified	0
Towards a Rigorous Analysis of Mutual Information in Contrastive Learning	Aug 30, 2023	Contrastive LearningMisconceptions	—Unverified	0
Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction	Jan 21, 2025	Distractor GenerationMisconceptions	—Unverified	0
Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning	Jan 12, 2025	Misconceptions	—Unverified	0
Analyzing Factors Influencing Driver Willingness to Accept Advanced Driver Assistance Systems	Feb 23, 2025	Misconceptions	—Unverified	0
Toward Semi-Automatic Misconception Discovery Using Code Embeddings	Mar 7, 2021	Code ClassificationMisconceptions	—Unverified	0
LLM-based Cognitive Models of Students with Misconceptions	Oct 16, 2024	Misconceptions	—Unverified	0
A Graphical Approach to State Variable Selection in Off-policy Learning	Jan 1, 2025	Causal InferenceDimensionality Reduction	—Unverified	0
How Useful are Gradients for OOD Detection Really?	May 20, 2022	Computational EfficiencyMisconceptions	—Unverified	0
Human-centered trust framework: An HCI perspective	May 5, 2023	Misconceptions	—Unverified	0
Humans can learn to detect AI-generated texts, or at least learn when they can't	May 3, 2025	Misconceptions	—Unverified	0
Identifying science concepts and student misconceptions in an interactive essay writing tutor	Jun 1, 2012	Information RetrievalMisconceptions	—Unverified	0
Improving Automated Distractor Generation for Math Multiple-choice Questions with Overgenerate-and-rank	Apr 19, 2024	Distractor GenerationMath	—Unverified	0
Biometric recognition: why not massively adopted yet?	Feb 23, 2022	Misconceptions	—Unverified	0
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy	Dec 17, 2022	MisconceptionsObject	—Unverified	0
Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing	Apr 20, 2021	EthicsMisconceptions	—Unverified	0
Justices for Information Bottleneck Theory	May 19, 2023	Misconceptions	—Unverified	0
Knowledge Tracing in Programming Education Integrating Students' Questions	Jan 22, 2025	Knowledge TracingMisconceptions	—Unverified	0
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts	Oct 11, 2024	Holdout SetMisconceptions	—Unverified	0
Laplace Redux - Effortless Bayesian Deep Learning	May 21, 2021	Deep LearningMisconceptions	—Unverified	0
A close-up comparison of the misclassification error distance and the adjusted Rand index for external clustering evaluation	Jul 26, 2019	ClusteringMisconceptions	—Unverified	0
Learnable: Theory vs Applications	Jul 27, 2018	BIG-bench Machine LearningMisconceptions	—Unverified	0
A clarification of misconceptions, myths and desired status of artificial intelligence	Aug 3, 2020	BIG-bench Machine LearningMisconceptions	—Unverified	0
Limitations of Deep Neural Networks: a discussion of G. Marcus' critical appraisal of deep learning	Dec 22, 2020	Autonomous VehiclesDeep Learning	—Unverified	0
Listening to Patients: A Framework of Detecting and Mitigating Patient Misreport for Medical Dialogue Generation	Oct 8, 2024	Dialogue GenerationHallucination	—Unverified	0
LLM Library Learning Fails: A LEGO-Prover Case Study	Apr 3, 2025	Mathematical ReasoningMisconceptions	—Unverified	0
Machine Learning Students Overfit to Overfitting	Sep 7, 2022	Misconceptions	—Unverified	0

Show:10 25 50

← PrevPage 2 of 4Next →

No leaderboard results yet.