SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 101150 of 161 papers

TitleStatusHype
Challenges and Trends in User Trust Discourse in AI0
On the lifting and reconstruction of nonlinear systems with multiple invariant sets0
Demystifying Misconceptions in Social Bots Research0
Succinct Representations for Concepts0
Response to Moffat's Comment on "Towards Meaningful Statements in IR Evaluation: Mapping Evaluation Measures to Interval Scales"0
Deep learning applied to computational mechanics: A comprehensive review, state of the art, and the classics0
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy0
The Monitor Model and its Misconceptions: A Clarification0
Machine Learning Students Overfit to Overfitting0
Dynamics and triggers of misinformation on vaccines0
Response to: Significance and stability of deep learning-based identification of subtypes within major psychiatric disorders. Molecular Psychiatry (2022)0
How Useful are Gradients for OOD Detection Really?0
A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation TopicsCode0
From Solution Synthesis to Student Attempt Synthesis for Block-Based Visual Programming TasksCode0
Rectified Max-Value Entropy Search for Bayesian Optimization0
Biometric recognition: why not massively adopted yet?0
Metagenomic Analysis using Phylogenetic Placement -- A Review of the First Decade0
Resolving conceptual issues in Modern Coexistence TheoryCode0
Big Data is not the New Oil: Common Misconceptions about Population Data0
Demystifying Ten Big Ideas and Rules Every Fire Scientist & Engineer Should Know About Blackbox, Whitebox & Causal Artificial Intelligence0
End-to-End Annotator Bias Approximation on Crowdsourced Single-Label Sentiment AnalysisCode0
The kernel perspective on dynamic mode decomposition0
Laplace Redux - Effortless Bayesian Deep Learning0
Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing0
Pay attention to your loss: understanding misconceptions about 1-Lipschitz neural networksCode0
Knowledge, beliefs, attitudes and perceived risk about COVID-19 vaccine and determinants of COVID-19 vaccine acceptance in Bangladesh0
Deep Discourse Analysis for Generating Personalized Feedback in Intelligent Tutor Systems0
Toward Semi-Automatic Misconception Discovery Using Code Embeddings0
Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning0
Limitations of Deep Neural Networks: a discussion of G. Marcus' critical appraisal of deep learning0
Hindsight and Sequential Rationality of Correlated PlayCode0
Quantum Technology for Economists0
COVIDLies: Detecting COVID-19 Misinformation on Social Media0
Depression Status Estimation by Deep Learning based Hybrid Multi-Modal Fusion Model0
Enforcing Interpretability and its Statistical Impacts: Trade-offs between Accuracy and Interpretability0
Collecting the Public Perception of AI and Robot RightsCode0
A clarification of misconceptions, myths and desired status of artificial intelligence0
Instructions and Guide for Diagnostic Questions: The NeurIPS 2020 Education Challenge0
The Bussgang Decomposition of Non-Linear Systems: Basic Theory and MIMO Extensions0
The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe0
Crowdsourcing the Perception of Machine Teaching0
Deep Curvature SuiteCode0
Not All Claims are Created Equal: Choosing the Right Statistical Approach to Assess HypothesesCode0
From Random to Regular: Variation in the Patterning of Retinal Mosaics0
Discounted Reinforcement Learning Is Not an Optimization Problem0
On Proximity and Structural Role-based Embeddings in Networks: Misconceptions, Techniques, and Applications0
A close-up comparison of the misclassification error distance and the adjusted Rand index for external clustering evaluation0
How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some MisconceptionsCode0
Zero Shot Learning for Code Education: Rubric Sampling with Deep Learning InferenceCode0
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.