SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 1120 of 161 papers

TitleStatusHype
Re-Examining Linear Embeddings for High-Dimensional Bayesian OptimizationCode1
A Tutorial on VAEs: From Bayes' Rule to Lossless CompressionCode1
Improving the Validity of Automatically Generated Feedback via Reinforcement LearningCode1
Enhancing Knowledge Tracing with Concept Map and Response DisentanglementCode1
Emergent Communication under CompetitionCode1
Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMsCode1
Back to the Drawing Board: A Critical Evaluation of Poisoning Attacks on Production Federated LearningCode1
Inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-CutCode1
Laplace Redux -- Effortless Bayesian Deep LearningCode1
Unveiling Contrastive Learning's Capability of Neighborhood Aggregation for Collaborative FilteringCode1
Show:102550
← PrevPage 2 of 17Next →

No leaderboard results yet.