Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 161 papers

Title	Date	Tasks	Status	Hype
Training Compute-Optimal Large Language Models	Mar 29, 2022	AnachronismsAnalogical Similarity	CodeCode Available	6
Factuality Enhanced Language Models for Open-Ended Text Generation	Jun 9, 2022	MisconceptionsSentence	CodeCode Available	5
The pitfalls of next-token prediction	Mar 11, 2024	MambaMisconceptions	CodeCode Available	2
Parting with Misconceptions about Learning-based Vehicle Motion Planning	Jun 13, 2023	MisconceptionsMotion Planning	CodeCode Available	2
Towards Democratizing Joint-Embedding Self-Supervised Learning	Mar 3, 2023	Data AugmentationMisconceptions	CodeCode Available	2
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2
Unveiling Contrastive Learning's Capability of Neighborhood Aggregation for Collaborative Filtering	Apr 14, 2025	Collaborative FilteringContrastive Learning	CodeCode Available	1
Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs	Sep 24, 2024	Knowledge TracingMisconceptions	CodeCode Available	1
Enhancing Knowledge Tracing with Concept Map and Response Disentanglement	Aug 23, 2024	DisentanglementKnowledge Tracing	CodeCode Available	1
Noise-powered Multi-modal Knowledge Graph Representation Framework	Mar 11, 2024	Entity AlignmentKnowledge Graph Completion	CodeCode Available	1
Improving the Validity of Automatically Generated Feedback via Reinforcement Learning	Mar 2, 2024	MathMisconceptions	CodeCode Available	1
Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-Cut	Oct 2, 2022	Combinatorial OptimizationGraph Neural Network	CodeCode Available	1
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs	Apr 30, 2022	MisconceptionsQuestion Generation	CodeCode Available	1
TruthfulQA: Measuring How Models Mimic Human Falsehoods	Sep 8, 2021	Language ModelingLanguage Modelling	CodeCode Available	1
Back to the Drawing Board: A Critical Evaluation of Poisoning Attacks on Production Federated Learning	Aug 23, 2021	Federated LearningMisconceptions	CodeCode Available	1
Laplace Redux -- Effortless Bayesian Deep Learning	Jun 28, 2021	Deep LearningMisconceptions	CodeCode Available	1
Emergent Communication under Competition	Jan 25, 2021	Misconceptions	CodeCode Available	1
A Tutorial on VAEs: From Bayes' Rule to Lossless Compression	Jun 18, 2020	Misconceptions	CodeCode Available	1
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines	Jun 8, 2020	Misconceptions	CodeCode Available	1
Re-Examining Linear Embeddings for High-Dimensional Bayesian Optimization	Jan 31, 2020	Bayesian OptimizationMisconceptions	CodeCode Available	1
The Oversmoothing Fallacy: A Misguided Narrative in GNN Research	Jun 5, 2025	Misconceptions	—Unverified	0
A Structured Unplugged Approach for Foundational AI Literacy in Primary Education	May 27, 2025	Logical ReasoningMisconceptions	CodeCode Available	0
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research	May 17, 2025	Misconceptionsscientific discovery	CodeCode Available	0
Automated Identification of Logical Errors in Programs: Advancing Scalable Analysis of Student Misconceptions	May 16, 2025	Misconceptions	—Unverified	0
Humans can learn to detect AI-generated texts, or at least learn when they can't	May 3, 2025	Misconceptions	—Unverified	0
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors	May 2, 2025	High School PhysicsMisconceptions	CodeCode Available	0
LLM Library Learning Fails: A LEGO-Prover Case Study	Apr 3, 2025	Mathematical ReasoningMisconceptions	—Unverified	0
What is AI, what is it not, how we use it in physics and how it impacts... you	Apr 2, 2025	Anomaly DetectionMisconceptions	—Unverified	0
From Intuition to Understanding: Using AI Peers to Overcome Physics Misconceptions	Apr 1, 2025	Misconceptions	—Unverified	0
Clarifying Misconceptions in COVID-19 Vaccine Sentiment and Stance Analysis and Their Implications for Vaccine Hesitancy Mitigation: A Systematic Review	Mar 23, 2025	MisconceptionsSentiment Analysis	—Unverified	0
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation	Mar 12, 2025	counterfactualMisconceptions	CodeCode Available	0
Paths and Ambient Spaces in Neural Loss Landscapes	Mar 5, 2025	Misconceptions	CodeCode Available	0
Emergent Abilities in Large Language Models: A Survey	Feb 28, 2025	In-Context LearningMisconceptions	—Unverified	0
Analyzing Factors Influencing Driver Willingness to Accept Advanced Driver Assistance Systems	Feb 23, 2025	Misconceptions	—Unverified	0
The Imitation Game for Educational AI	Feb 21, 2025	Distractor GenerationMisconceptions	—Unverified	0
Retrieval-augmented systems can be dangerous medical communicators	Feb 18, 2025	MisconceptionsRetrieval	—Unverified	0
Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact	Feb 12, 2025	Misconceptions	—Unverified	0
Knowledge Tracing in Programming Education Integrating Students' Questions	Jan 22, 2025	Knowledge TracingMisconceptions	—Unverified	0
Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction	Jan 21, 2025	Distractor GenerationMisconceptions	—Unverified	0
Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning	Jan 12, 2025	Misconceptions	—Unverified	0
Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension	Jan 2, 2025	Misconceptions	—Unverified	0
A Graphical Approach to State Variable Selection in Off-policy Learning	Jan 1, 2025	Causal InferenceDimensionality Reduction	—Unverified	0
Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation	Dec 14, 2024	Misconceptions	—Unverified	0
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor	Dec 8, 2024	MisconceptionsMultiple-choice	CodeCode Available	0
Developer Perspectives on Licensing and Copyright Issues Arising from Generative AI for Software Development	Nov 16, 2024	MisconceptionsSurvey	—Unverified	0
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology	Nov 5, 2024	MathMisconceptions	—Unverified	0
A Study on Characterization of Near-Field Sub-Regions For Phased-Array Antennas	Oct 23, 2024	Misconceptions	—Unverified	0
LLM-based Cognitive Models of Students with Misconceptions	Oct 16, 2024	Misconceptions	—Unverified	0
The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment with Large Language Models	Oct 12, 2024	MisconceptionsMultiple-choice	—Unverified	0
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts	Oct 11, 2024	Holdout SetMisconceptions	—Unverified	0

Show:10 25 50

← PrevPage 1 of 4Next →

No leaderboard results yet.