SOTAVerified|Agents Browse Leaderboard About

Known Unknowns

Language models have a tendency to generate text containing false statements that are often referred to as "Hallucinations." The primary purpose of this task is to test for this failure case by probing whether a model can correctly identify that the answer to a question is unknown. A common failure mode would be to prefer a prediction of false on unknown truth over a prediction that the answer is unknown.

Source: BIG-bench

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–15 of 15 papers

Title	Date	Tasks	Status
Domain Concretization from Examples: Addressing Missing Domain Knowledge via Robust Planning	Nov 18, 2020	Decision MakingKnown Unknowns	—Unverified
Classification Uncertainty of Deep Neural Networks Based on Gradient Information	May 22, 2018	ClassificationGeneral Classification	—Unverified
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents	Feb 27, 2024	Known UnknownsQuestion Answering	—Unverified
The known unknowns of the Hsp90 chaperone	Aug 31, 2023	Drug DesignKnown Unknowns	—Unverified
Toward Open-Set Face Recognition	May 3, 2017	Face IdentificationFace Recognition	—Unverified

Show:10 25 50

← PrevPage 2 of 2Next →

No leaderboard results yet.