SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 8190 of 161 papers

TitleStatusHype
Limitations of Deep Neural Networks: a discussion of G. Marcus' critical appraisal of deep learning0
Listening to Patients: A Framework of Detecting and Mitigating Patient Misreport for Medical Dialogue Generation0
LLM Library Learning Fails: A LEGO-Prover Case Study0
Machine Learning Students Overfit to Overfitting0
Can a Hallucinating Model help in Reducing Human "Hallucination"?0
Math Multiple Choice Question Generation via Human-Large Language Model Collaboration0
Metagenomic Analysis using Phylogenetic Placement -- A Review of the First Decade0
A Graphical Approach to State Variable Selection in Off-policy Learning0
Neural topology optimization: the good, the bad, and the ugly0
Challenges and Trends in User Trust Discourse in AI0
Show:102550
← PrevPage 9 of 17Next →

No leaderboard results yet.