Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 161 papers

Title	Date	Tasks	Status	Hype
A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice	Apr 25, 2024	Misconceptions	CodeCode Available	0
Student Data Paradox and Curious Case of Single Student-Tutor Model: Regressive Side Effects of Training LLMs for Personalized Learning	Apr 23, 2024	ARCCommon Sense Reasoning	—Unverified	0
Differential contributions of machine learning and statistical analysis to language and cognitive sciences	Apr 22, 2024	Misconceptions	—Unverified	0
Improving Automated Distractor Generation for Math Multiple-choice Questions with Overgenerate-and-rank	Apr 19, 2024	Distractor GenerationMath	—Unverified	0
Axiomatic modeling of fixed proportion technologies	Apr 18, 2024	Misconceptions	—Unverified	0
Breaking Boundaries: A Chronology with Future Directions of Women in Exercise Physiology Research, Centred on Pregnancy	Apr 12, 2024	Misconceptions	—Unverified	0
SoK: On Gradient Leakage in Federated Learning	Apr 8, 2024	Federated LearningMisconceptions	—Unverified	0
Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models	Apr 2, 2024	Distractor GenerationIn-Context Learning	CodeCode Available	0
Prompting the E-Brushes: Users as Authors in Generative AI	Mar 25, 2024	Misconceptions	—Unverified	0
Noise-powered Multi-modal Knowledge Graph Representation Framework	Mar 11, 2024	Entity AlignmentKnowledge Graph Completion	CodeCode Available	1
The pitfalls of next-token prediction	Mar 11, 2024	MambaMisconceptions	CodeCode Available	2
WatChat: Explaining perplexing programs by debugging mental models	Mar 8, 2024	counterfactualLanguage Modelling	CodeCode Available	0
Improving the Validity of Automatically Generated Feedback via Reinforcement Learning	Mar 2, 2024	MathMisconceptions	CodeCode Available	1
The Essential Role of Causality in Foundation World Models for Embodied AI	Feb 6, 2024	Misconceptions	—Unverified	0
Clarify: Improving Model Robustness With Natural Language Corrections	Feb 6, 2024	Misconceptionsmodel	CodeCode Available	0
Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using Large Language Models to Mitigate Cognitive Bias	Jan 26, 2024	Decision MakingDiagnostic	—Unverified	0
Distortions in Judged Spatial Relations in Large Language Models	Jan 8, 2024	MisconceptionsSpatial Reasoning	—Unverified	0
Finnish 5th and 6th graders' misconceptions about Artificial Intelligence	Nov 28, 2023	Misconceptions	—Unverified	0
Uncertainty Quantification in Machine Learning for Biosignal Applications -- A Review	Nov 15, 2023	DiagnosticEEG	—Unverified	0
Fine-tuning Language Models for Factuality	Nov 14, 2023	Fact CheckingMisconceptions	—Unverified	0
Large Language Models for In-Context Student Modeling: Synthesizing Student's Behavior in Visual Programming	Oct 15, 2023	Misconceptions	CodeCode Available	0
Unraveling the Single Tangent Space Fallacy: An Analysis and Clarification for Applying Riemannian Geometry in Robot Learning	Oct 11, 2023	Misconceptions	—Unverified	0
Can Large Language Models Provide Security & Privacy Advice? Measuring the Ability of LLMs to Refute Misconceptions	Oct 3, 2023	MisconceptionsMultiple-choice	CodeCode Available	0
Novice Learner and Expert Tutor: Evaluating Math Reasoning Abilities of Large Language Models with Misconceptions	Oct 3, 2023	MathMathematical Reasoning	—Unverified	0
EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning	Sep 16, 2023	Date UnderstandingGSM8K	CodeCode Available	0
Towards a Rigorous Analysis of Mutual Information in Contrastive Learning	Aug 30, 2023	Contrastive LearningMisconceptions	—Unverified	0
Using language models in the implicit automated assessment of mathematical short answer items	Aug 21, 2023	Misconceptions	—Unverified	0
Characterizing Information Seeking Events in Health-Related Social Discourse	Aug 17, 2023	Misconceptions	—Unverified	0
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational Data	Aug 7, 2023	MathMisconceptions	CodeCode Available	0
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context Learning	Aug 7, 2023	In-Context LearningMath	CodeCode Available	0
Parting with Misconceptions about Learning-based Vehicle Motion Planning	Jun 13, 2023	MisconceptionsMotion Planning	CodeCode Available	2
Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt Wording	Jun 9, 2023	Misconceptions	CodeCode Available	0
Dear XAI Community, We Need to Talk! Fundamental Misconceptions in Current XAI Research	Jun 7, 2023	Explainable Artificial Intelligence (XAI)Misconceptions	—Unverified	0
Justices for Information Bottleneck Theory	May 19, 2023	Misconceptions	—Unverified	0
Clarifying System 1 & 2 through the Common Model of Cognition	May 18, 2023	Misconceptions	—Unverified	0
Disproving XAI Myths with Formal Methods -- Initial Results	May 13, 2023	Explainable artificial intelligenceExplainable Artificial Intelligence (XAI)	—Unverified	0
Challenges and Trends in User Trust Discourse in AI	May 5, 2023	Misconceptions	—Unverified	0
Human-centered trust framework: An HCI perspective	May 5, 2023	Misconceptions	—Unverified	0
On the lifting and reconstruction of nonlinear systems with multiple invariant sets	Apr 24, 2023	Misconceptions	—Unverified	0
Demystifying Misconceptions in Social Bots Research	Mar 30, 2023	MisconceptionsMisinformation	—Unverified	0
Towards Democratizing Joint-Embedding Self-Supervised Learning	Mar 3, 2023	Data AugmentationMisconceptions	CodeCode Available	2
Succinct Representations for Concepts	Mar 1, 2023	Misconceptions	—Unverified	0
Response to Moffat's Comment on "Towards Meaningful Statements in IR Evaluation: Mapping Evaluation Measures to Interval Scales"	Dec 22, 2022	Misconceptions	—Unverified	0
Deep learning applied to computational mechanics: A comprehensive review, state of the art, and the classics	Dec 18, 2022	Gaussian ProcessesMisconceptions	—Unverified	0
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy	Dec 17, 2022	MisconceptionsObject	—Unverified	0
The Monitor Model and its Misconceptions: A Clarification	Oct 25, 2022	Misconceptionsmodel	—Unverified	0
Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-Cut	Oct 2, 2022	Combinatorial OptimizationGraph Neural Network	CodeCode Available	1
Machine Learning Students Overfit to Overfitting	Sep 7, 2022	Misconceptions	—Unverified	0
Dynamics and triggers of misinformation on vaccines	Jul 25, 2022	MisconceptionsMisinformation	—Unverified	0
Response to: Significance and stability of deep learning-based identification of subtypes within major psychiatric disorders. Molecular Psychiatry (2022)	Jun 10, 2022	BIG-bench Machine LearningMisconceptions	—Unverified	0

Show:10 25 50

← PrevPage 2 of 4Next →

No leaderboard results yet.