SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 101125 of 161 papers

TitleStatusHype
Factuality Enhanced Language Models for Open-Ended Text GenerationCode5
How Useful are Gradients for OOD Detection Really?0
A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation TopicsCode0
From Solution Synthesis to Student Attempt Synthesis for Block-Based Visual Programming TasksCode0
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational NeedsCode1
Training Compute-Optimal Large Language ModelsCode6
Rectified Max-Value Entropy Search for Bayesian Optimization0
Biometric recognition: why not massively adopted yet?0
Metagenomic Analysis using Phylogenetic Placement -- A Review of the First Decade0
Resolving conceptual issues in Modern Coexistence TheoryCode0
Big Data is not the New Oil: Common Misconceptions about Population Data0
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Demystifying Ten Big Ideas and Rules Every Fire Scientist & Engineer Should Know About Blackbox, Whitebox & Causal Artificial Intelligence0
End-to-End Annotator Bias Approximation on Crowdsourced Single-Label Sentiment AnalysisCode0
TruthfulQA: Measuring How Models Mimic Human FalsehoodsCode1
Back to the Drawing Board: A Critical Evaluation of Poisoning Attacks on Production Federated LearningCode1
Laplace Redux -- Effortless Bayesian Deep LearningCode1
The kernel perspective on dynamic mode decomposition0
Laplace Redux - Effortless Bayesian Deep Learning0
Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing0
Pay attention to your loss: understanding misconceptions about 1-Lipschitz neural networksCode0
Knowledge, beliefs, attitudes and perceived risk about COVID-19 vaccine and determinants of COVID-19 vaccine acceptance in Bangladesh0
Deep Discourse Analysis for Generating Personalized Feedback in Intelligent Tutor Systems0
Toward Semi-Automatic Misconception Discovery Using Code Embeddings0
Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning0
Show:102550
← PrevPage 5 of 7Next →

No leaderboard results yet.