SOTAVerified

Fact Checking

Papers

Showing 311320 of 669 papers

TitleStatusHype
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language ModelsCode1
Knowledge Graphs Querying0
Enhancing Large Language Models Against Inductive Instructions with Dual-critique PromptingCode0
ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media0
AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the WebCode1
Multimodal Automated Fact-Checking: A SurveyCode2
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific TablesCode1
Fact-Checking Complex Claims with Program-Guided ReasoningCode1
Complex Claim Verification with Evidence Retrieved in the WildCode1
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing0
Show:102550
← PrevPage 32 of 67Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified