SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 151161 of 161 papers

TitleStatusHype
Deep Curvature SuiteCode0
Not All Claims are Created Equal: Choosing the Right Statistical Approach to Assess HypothesesCode0
Paths and Ambient Spaces in Neural Loss LandscapesCode0
Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt WordingCode0
Resolving conceptual issues in Modern Coexistence TheoryCode0
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational DataCode0
Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language LearningCode0
Pay attention to your loss: understanding misconceptions about 1-Lipschitz neural networksCode0
WatChat: Explaining perplexing programs by debugging mental modelsCode0
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific ResearchCode0
When big data actually are low-rank, or entrywise approximation of certain function-generated matricesCode0
Show:102550
← PrevPage 4 of 4Next →

No leaderboard results yet.