Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 161 papers

Title	Date	Tasks	Status
Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction	Jan 21, 2025	Distractor GenerationMisconceptions	—Unverified
Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning	Jan 12, 2025	Misconceptions	—Unverified
Analyzing Factors Influencing Driver Willingness to Accept Advanced Driver Assistance Systems	Feb 23, 2025	Misconceptions	—Unverified
Toward Semi-Automatic Misconception Discovery Using Code Embeddings	Mar 7, 2021	Code ClassificationMisconceptions	—Unverified
LLM-based Cognitive Models of Students with Misconceptions	Oct 16, 2024	Misconceptions	—Unverified
A Graphical Approach to State Variable Selection in Off-policy Learning	Jan 1, 2025	Causal InferenceDimensionality Reduction	—Unverified
How Useful are Gradients for OOD Detection Really?	May 20, 2022	Computational EfficiencyMisconceptions	—Unverified
Human-centered trust framework: An HCI perspective	May 5, 2023	Misconceptions	—Unverified
Humans can learn to detect AI-generated texts, or at least learn when they can't	May 3, 2025	Misconceptions	—Unverified
Identifying science concepts and student misconceptions in an interactive essay writing tutor	Jun 1, 2012	Information RetrievalMisconceptions	—Unverified
Improving Automated Distractor Generation for Math Multiple-choice Questions with Overgenerate-and-rank	Apr 19, 2024	Distractor GenerationMath	—Unverified
Biometric recognition: why not massively adopted yet?	Feb 23, 2022	Misconceptions	—Unverified
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy	Dec 17, 2022	MisconceptionsObject	—Unverified
Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing	Apr 20, 2021	EthicsMisconceptions	—Unverified
Justices for Information Bottleneck Theory	May 19, 2023	Misconceptions	—Unverified
Knowledge Tracing in Programming Education Integrating Students' Questions	Jan 22, 2025	Knowledge TracingMisconceptions	—Unverified
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts	Oct 11, 2024	Holdout SetMisconceptions	—Unverified
Laplace Redux - Effortless Bayesian Deep Learning	May 21, 2021	Deep LearningMisconceptions	—Unverified
A close-up comparison of the misclassification error distance and the adjusted Rand index for external clustering evaluation	Jul 26, 2019	ClusteringMisconceptions	—Unverified
Learnable: Theory vs Applications	Jul 27, 2018	BIG-bench Machine LearningMisconceptions	—Unverified
A clarification of misconceptions, myths and desired status of artificial intelligence	Aug 3, 2020	BIG-bench Machine LearningMisconceptions	—Unverified
Limitations of Deep Neural Networks: a discussion of G. Marcus' critical appraisal of deep learning	Dec 22, 2020	Autonomous VehiclesDeep Learning	—Unverified
Listening to Patients: A Framework of Detecting and Mitigating Patient Misreport for Medical Dialogue Generation	Oct 8, 2024	Dialogue GenerationHallucination	—Unverified
LLM Library Learning Fails: A LEGO-Prover Case Study	Apr 3, 2025	Mathematical ReasoningMisconceptions	—Unverified
Machine Learning Students Overfit to Overfitting	Sep 7, 2022	Misconceptions	—Unverified

Show:10 25 50

← PrevPage 4 of 7Next →

No leaderboard results yet.