SOTAVerified|Agents Browse Leaderboard About Blog

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 161 papers

Title	Date	Tasks	Status	Score
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors	May 2, 2025	High School PhysicsMisconceptions	CodeCode Available	5
Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt Wording	Jun 9, 2023	Misconceptions	CodeCode Available	5
When big data actually are low-rank, or entrywise approximation of certain function-generated matrices	Jul 3, 2024	Misconceptions	CodeCode Available	5
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions	Jun 27, 2024	Distractor GenerationMath	CodeCode Available	5
Classifier-Free Guidance is a Predictor-Corrector	Aug 16, 2024	DenoisingMisconceptions	—Unverified	0
Enforcing Interpretability and its Statistical Impacts: Trade-offs between Accuracy and Interpretability	Oct 26, 2020	Binary ClassificationLearning Theory	—Unverified	0
Clarifying System 1 & 2 through the Common Model of Cognition	May 18, 2023	Misconceptions	—Unverified	0
Emergent Abilities in Large Language Models: A Survey	Feb 28, 2025	In-Context LearningMisconceptions	—Unverified	0
Clarifying Misconceptions in COVID-19 Vaccine Sentiment and Stance Analysis and Their Implications for Vaccine Hesitancy Mitigation: A Systematic Review	Mar 23, 2025	MisconceptionsSentiment Analysis	—Unverified	0
Dynamics and triggers of misinformation on vaccines	Jul 25, 2022	MisconceptionsMisinformation	—Unverified	0
Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using Large Language Models to Mitigate Cognitive Bias	Jan 26, 2024	Decision MakingDiagnostic	—Unverified	0
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology	Nov 5, 2024	MathMisconceptions	—Unverified	0
Analyzing Factors Influencing Driver Willingness to Accept Advanced Driver Assistance Systems	Feb 23, 2025	Misconceptions	—Unverified	0
Distortions in Judged Spatial Relations in Large Language Models	Jan 8, 2024	MisconceptionsSpatial Reasoning	—Unverified	0
Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation	Dec 14, 2024	Misconceptions	—Unverified	0
Disproving XAI Myths with Formal Methods -- Initial Results	May 13, 2023	Explainable artificial intelligenceExplainable Artificial Intelligence (XAI)	—Unverified	0
Discounted Reinforcement Learning Is Not an Optimization Problem	Oct 4, 2019	Misconceptionsreinforcement-learning	—Unverified	0
Characterizing Information Seeking Events in Health-Related Social Discourse	Aug 17, 2023	Misconceptions	—Unverified	0
Differential contributions of machine learning and statistical analysis to language and cognitive sciences	Apr 22, 2024	Misconceptions	—Unverified	0
Instructions and Guide for Diagnostic Questions: The NeurIPS 2020 Education Challenge	Jul 23, 2020	DiagnosticMisconceptions	—Unverified	0
Challenges and Trends in User Trust Discourse in AI	May 5, 2023	Misconceptions	—Unverified	0
Developer Perspectives on Licensing and Copyright Issues Arising from Generative AI for Software Development	Nov 16, 2024	MisconceptionsSurvey	—Unverified	0
A Thematic Framework for Analyzing Large-scale Self-reported Social Media Data on Opioid Use Disorder Treatment Using Buprenorphine Product	Oct 2, 2024	Misconceptions	—Unverified	0
A Graphical Approach to State Variable Selection in Off-policy Learning	Jan 1, 2025	Causal InferenceDimensionality Reduction	—Unverified	0
A clarification of misconceptions, myths and desired status of artificial intelligence	Aug 3, 2020	BIG-bench Machine LearningMisconceptions	—Unverified	0

Show:10 25 50

← PrevPage 3 of 7Next →

No leaderboard results yet.