SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 110 of 161 papers

TitleStatusHype
Training Compute-Optimal Large Language ModelsCode6
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
The pitfalls of next-token predictionCode2
Towards Democratizing Joint-Embedding Self-Supervised LearningCode2
Parting with Misconceptions about Learning-based Vehicle Motion PlanningCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-CutCode1
Improving the Validity of Automatically Generated Feedback via Reinforcement LearningCode1
Laplace Redux -- Effortless Bayesian Deep LearningCode1
Enhancing Knowledge Tracing with Concept Map and Response DisentanglementCode1
Show:102550
← PrevPage 1 of 17Next →

No leaderboard results yet.