SOTAVerified

Fact Checking

Papers

Showing 581590 of 669 papers

TitleStatusHype
Verifying the Robustness of Automatic Credibility AssessmentCode0
AraFacts: The First Large Arabic Dataset of Naturally Occurring ClaimsCode0
RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-CheckingCode0
Eating Garlic Prevents COVID-19 Infection: Detecting Misinformation on the Arabic Content of TwitterCode0
Real-time Fake News from Adversarial FeedbackCode0
bgGLUE: A Bulgarian General Language Understanding Evaluation BenchmarkCode0
An Adversarial Benchmark for Fake News Detection ModelsCode0
AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM AnnotatorsCode0
DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact VerificationCode0
RED-DOT: Multimodal Fact-checking via Relevant Evidence DetectionCode0
Show:102550
← PrevPage 59 of 67Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified