SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 1120 of 161 papers

TitleStatusHype
Improving the Validity of Automatically Generated Feedback via Reinforcement LearningCode1
Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-CutCode1
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational NeedsCode1
TruthfulQA: Measuring How Models Mimic Human FalsehoodsCode1
Back to the Drawing Board: A Critical Evaluation of Poisoning Attacks on Production Federated LearningCode1
Laplace Redux -- Effortless Bayesian Deep LearningCode1
Emergent Communication under CompetitionCode1
A Tutorial on VAEs: From Bayes' Rule to Lossless CompressionCode1
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong BaselinesCode1
Re-Examining Linear Embeddings for High-Dimensional Bayesian OptimizationCode1
Show:102550
← PrevPage 2 of 17Next →

No leaderboard results yet.