SOTAVerified

Fact Checking

Papers

Showing 2130 of 669 papers

TitleStatusHype
Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact VerifiersCode1
Chronocept: Instilling a Sense of Time in MachinesCode1
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language ModelsCode1
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-CheckingCode1
HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic ClaimsCode1
COVE: COntext and VEracity prediction for out-of-context imagesCode1
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal ExpertsCode1
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OasisCode1
Belief in the Machine: Investigating Epistemological Blind Spots of Language ModelsCode1
FIRE: Fact-checking with Iterative Retrieval and VerificationCode1
Show:102550
← PrevPage 3 of 67Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified