SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 6170 of 161 papers

TitleStatusHype
Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using Large Language Models to Mitigate Cognitive Bias0
Automated Identification of Logical Errors in Programs: Advancing Scalable Analysis of Student Misconceptions0
Crowdsourcing the Perception of Machine Teaching0
COVIDLies: Detecting COVID-19 Misinformation on Social Media0
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts0
Contrastive Explanations That Anticipate Human Misconceptions Can Improve Human Decision-Making Skills0
Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning0
An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning0
Formalising Anti-Discrimination Law in Automated Decision Systems0
Axiomatic modeling of fixed proportion technologies0
Show:102550
← PrevPage 7 of 17Next →

No leaderboard results yet.