SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 76100 of 161 papers

TitleStatusHype
Towards a Rigorous Analysis of Mutual Information in Contrastive Learning0
Using language models in the implicit automated assessment of mathematical short answer items0
Characterizing Information Seeking Events in Health-Related Social Discourse0
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational DataCode0
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context LearningCode0
Parting with Misconceptions about Learning-based Vehicle Motion PlanningCode2
Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt WordingCode0
Dear XAI Community, We Need to Talk! Fundamental Misconceptions in Current XAI Research0
Justices for Information Bottleneck Theory0
Clarifying System 1 & 2 through the Common Model of Cognition0
Disproving XAI Myths with Formal Methods -- Initial Results0
Challenges and Trends in User Trust Discourse in AI0
Human-centered trust framework: An HCI perspective0
On the lifting and reconstruction of nonlinear systems with multiple invariant sets0
Demystifying Misconceptions in Social Bots Research0
Towards Democratizing Joint-Embedding Self-Supervised LearningCode2
Succinct Representations for Concepts0
Response to Moffat's Comment on "Towards Meaningful Statements in IR Evaluation: Mapping Evaluation Measures to Interval Scales"0
Deep learning applied to computational mechanics: A comprehensive review, state of the art, and the classics0
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy0
The Monitor Model and its Misconceptions: A Clarification0
Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-CutCode1
Machine Learning Students Overfit to Overfitting0
Dynamics and triggers of misinformation on vaccines0
Response to: Significance and stability of deep learning-based identification of subtypes within major psychiatric disorders. Molecular Psychiatry (2022)0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.