SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 4150 of 161 papers

TitleStatusHype
EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context LearningCode0
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit MisinformationCode0
A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation TopicsCode0
Collecting the Public Perception of AI and Robot RightsCode0
MalAlgoQA: Pedagogical Evaluation of Counterfactual Reasoning in Large Language Models and Implications for AI in EducationCode0
Deep Curvature SuiteCode0
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice QuestionsCode0
From Solution Synthesis to Student Attempt Synthesis for Block-Based Visual Programming TasksCode0
Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt WordingCode0
Paths and Ambient Spaces in Neural Loss LandscapesCode0
Show:102550
← PrevPage 5 of 17Next →

No leaderboard results yet.