SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 3140 of 161 papers

TitleStatusHype
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit MisinformationCode0
Paths and Ambient Spaces in Neural Loss LandscapesCode0
Emergent Abilities in Large Language Models: A Survey0
Analyzing Factors Influencing Driver Willingness to Accept Advanced Driver Assistance Systems0
The Imitation Game for Educational AI0
Retrieval-augmented systems can be dangerous medical communicators0
Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact0
Knowledge Tracing in Programming Education Integrating Students' Questions0
Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction0
Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning0
Show:102550
← PrevPage 4 of 17Next →

No leaderboard results yet.