SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 5160 of 161 papers

TitleStatusHype
Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing0
Disproving XAI Myths with Formal Methods -- Initial Results0
Distortions in Judged Spatial Relations in Large Language Models0
Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation0
Dynamics and triggers of misinformation on vaccines0
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology0
Emergent Abilities in Large Language Models: A Survey0
A close-up comparison of the misclassification error distance and the adjusted Rand index for external clustering evaluation0
Clarifying System 1 & 2 through the Common Model of Cognition0
Automated Identification of Logical Errors in Programs: Advancing Scalable Analysis of Student Misconceptions0
Show:102550
← PrevPage 6 of 17Next →

No leaderboard results yet.