SOTAVerified

Fact Checking

Papers

Showing 101125 of 669 papers

TitleStatusHype
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style AttacksCode1
End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and ModelsCode1
Evidence-based Factual Error CorrectionCode1
CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-CheckingCode1
Evidence-based Fact-Checking of Health-related ClaimsCode1
Benchmarking the Generation of Fact Checking ExplanationsCode1
Predicting Sentence-Level Factuality of News and Bias of Media OutletsCode1
Re2G: Retrieve, Rerank, GenerateCode1
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-CheckingCode1
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked ClaimsCode1
FaVIQ: FAct Verification from Information-seeking QuestionsCode1
Factify 2: A Multimodal Fake News and Satire News DatasetCode1
Fact-checking information from large language models can decrease headline discernmentCode1
Explainable Automated Fact-Checking: A SurveyCode1
A Survey on Automated Fact-CheckingCode1
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language ModelsCode1
CheckThat! at CLEF 2020: Enabling the Automatic Identification and Verification of Claims in Social MediaCode1
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language ModelsCode1
Ask to Know More: Generating Counterfactual Explanations for Fake ClaimsCode1
Team Alex at CLEF CheckThat! 2020: Identifying Check-Worthy Tweets With Transformer ModelsCode1
CREAK: A Dataset for Commonsense Reasoning over Entity KnowledgeCode1
Fact or Fiction: Verifying Scientific ClaimsCode1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
ChartCheck: Explainable Fact-Checking over Real-World Chart ImagesCode1
X-FACT: A New Benchmark Dataset for Multilingual Fact CheckingCode1
Show:102550
← PrevPage 5 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified