SOTAVerified

Fact Checking

Papers

Showing 76100 of 669 papers

TitleStatusHype
Early Rumor Detection Using Neural Hawkes Process with a New Benchmark DatasetCode1
End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and ModelsCode1
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLPCode1
Explainable Automated Fact-Checking: A SurveyCode1
Explainable Automated Fact-Checking for Public Health ClaimsCode1
An Enhanced Fake News Detection System With Fuzzy Deep LearningCode1
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkersCode1
Factify 2: A Multimodal Fake News and Satire News DatasetCode1
Complex Claim Verification with Evidence Retrieved in the WildCode1
Evidence-based Factual Error CorrectionCode1
Predicting Sentence-Level Factuality of News and Bias of Media OutletsCode1
CREAK: A Dataset for Commonsense Reasoning over Entity KnowledgeCode1
Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation GenerationCode1
Automatic Evaluation of Attribution by Large Language ModelsCode1
Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent CommunitiesCode1
AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection DatasetCode1
Get Your Vitamin C! Robust Fact Verification with Contrastive EvidenceCode1
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-CheckingCode1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
CheckThat! at CLEF 2020: Enabling the Automatic Identification and Verification of Claims in Social MediaCode1
Automatic Fake News Detection: Are Models Learning to Reason?Code1
KILT: a Benchmark for Knowledge Intensive Language TasksCode1
A Survey on Automated Fact-CheckingCode1
``Liar, Liar Pants on Fire'': A New Benchmark Dataset for Fake News DetectionCode1
CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-CheckingCode1
Show:102550
← PrevPage 4 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified