SOTAVerified

Fact Checking

Papers

Showing 3140 of 669 papers

TitleStatusHype
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
"Image, Tell me your story!" Predicting the original meta-context of visual misinformationCode1
OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMsCode1
Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent CommunitiesCode1
Meerkat: Audio-Visual Large Language Model for Grounding in Space and TimeCode1
An Enhanced Fake News Detection System With Fuzzy Deep LearningCode1
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language ModelsCode1
Document-level Claim Extraction and Decontextualisation for Fact-CheckingCode1
RATT: A Thought Structure for Coherent and Correct LLM ReasoningCode1
Attribute First, then Generate: Locally-attributable Grounded Text GenerationCode1
Show:102550
← PrevPage 4 of 67Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified