SOTAVerified

Misconceptions

Measures whether a model can discern popular misconceptions from the truth.

Example:

        input: The daddy longlegs spider is the most venomous spider in the world.
        choice: T
        choice: F
        answer: F

        input: Karl Benz is correctly credited with the invention of the first modern automobile.
        choice: T
        choice: F
        answer: T

Source: BIG-bench

Papers

Showing 5175 of 161 papers

TitleStatusHype
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective DistractorsCode0
Reliability Check: An Analysis of GPT-3's Response to Sensitive Topics and Prompt WordingCode0
When big data actually are low-rank, or entrywise approximation of certain function-generated matricesCode0
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice QuestionsCode0
Classifier-Free Guidance is a Predictor-Corrector0
Enforcing Interpretability and its Statistical Impacts: Trade-offs between Accuracy and Interpretability0
Clarifying System 1 & 2 through the Common Model of Cognition0
Emergent Abilities in Large Language Models: A Survey0
Clarifying Misconceptions in COVID-19 Vaccine Sentiment and Stance Analysis and Their Implications for Vaccine Hesitancy Mitigation: A Systematic Review0
Dynamics and triggers of misinformation on vaccines0
Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using Large Language Models to Mitigate Cognitive Bias0
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology0
Analyzing Factors Influencing Driver Willingness to Accept Advanced Driver Assistance Systems0
Distortions in Judged Spatial Relations in Large Language Models0
Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation0
Disproving XAI Myths with Formal Methods -- Initial Results0
Discounted Reinforcement Learning Is Not an Optimization Problem0
Characterizing Information Seeking Events in Health-Related Social Discourse0
Differential contributions of machine learning and statistical analysis to language and cognitive sciences0
Instructions and Guide for Diagnostic Questions: The NeurIPS 2020 Education Challenge0
Challenges and Trends in User Trust Discourse in AI0
Developer Perspectives on Licensing and Copyright Issues Arising from Generative AI for Software Development0
A Thematic Framework for Analyzing Large-scale Self-reported Social Media Data on Opioid Use Disorder Treatment Using Buprenorphine Product0
A Graphical Approach to State Variable Selection in Off-policy Learning0
A clarification of misconceptions, myths and desired status of artificial intelligence0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.