SOTAVerified

Fact Checking

Papers

Showing 5175 of 669 papers

TitleStatusHype
Evidence-based Factual Error CorrectionCode1
Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation GenerationCode1
Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent CommunitiesCode1
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked ClaimsCode1
Claim Check-Worthiness Detection as Positive Unlabelled LearningCode1
Fact-checking information from large language models can decrease headline discernmentCode1
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-CheckingCode1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
Ask to Know More: Generating Counterfactual Explanations for Fake ClaimsCode1
Automatic Evaluation of Attribution by Large Language ModelsCode1
Improving Evidence Retrieval for Automated Explainable Fact-CheckingCode1
Improving Wikipedia Verifiability with AICode1
DialFact: A Benchmark for Fact-Checking in DialogueCode1
Detecting Deepfakes Without Seeing AnyCode1
DirectQuote: A Dataset for Direct Quotation Extraction and Attribution in News ArticlesCode1
Cross-lingual COVID-19 Fake News DetectionCode1
3HAN: A Deep Neural Network for Fake News DetectionCode1
COVE: COntext and VEracity prediction for out-of-context imagesCode1
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal ExpertsCode1
Document-level Claim Extraction and Decontextualisation for Fact-CheckingCode1
Chronocept: Instilling a Sense of Time in MachinesCode1
Attribute First, then Generate: Locally-attributable Grounded Text GenerationCode1
COVID-VTS: Fact Extraction and Verification on Short Video PlatformsCode1
CREAK: A Dataset for Commonsense Reasoning over Entity KnowledgeCode1
CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-CheckingCode1
Show:102550
← PrevPage 3 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified