SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 101125 of 161 papers

TitleStatusHype
Challenges and Trends in User Trust Discourse in AI0
On the lifting and reconstruction of nonlinear systems with multiple invariant sets0
Demystifying Misconceptions in Social Bots Research0
Succinct Representations for Concepts0
Response to Moffat's Comment on "Towards Meaningful Statements in IR Evaluation: Mapping Evaluation Measures to Interval Scales"0
Deep learning applied to computational mechanics: A comprehensive review, state of the art, and the classics0
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy0
The Monitor Model and its Misconceptions: A Clarification0
Machine Learning Students Overfit to Overfitting0
Dynamics and triggers of misinformation on vaccines0
Response to: Significance and stability of deep learning-based identification of subtypes within major psychiatric disorders. Molecular Psychiatry (2022)0
How Useful are Gradients for OOD Detection Really?0
A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation TopicsCode0
From Solution Synthesis to Student Attempt Synthesis for Block-Based Visual Programming TasksCode0
Rectified Max-Value Entropy Search for Bayesian Optimization0
Biometric recognition: why not massively adopted yet?0
Metagenomic Analysis using Phylogenetic Placement -- A Review of the First Decade0
Resolving conceptual issues in Modern Coexistence TheoryCode0
Big Data is not the New Oil: Common Misconceptions about Population Data0
Demystifying Ten Big Ideas and Rules Every Fire Scientist & Engineer Should Know About Blackbox, Whitebox & Causal Artificial Intelligence0
End-to-End Annotator Bias Approximation on Crowdsourced Single-Label Sentiment AnalysisCode0
The kernel perspective on dynamic mode decomposition0
Laplace Redux - Effortless Bayesian Deep Learning0
Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing0
Pay attention to your loss: understanding misconceptions about 1-Lipschitz neural networksCode0
Show:102550
← PrevPage 5 of 7Next →

No leaderboard results yet.