SOTAVerified

Fact Checking

Papers

Showing 5175 of 669 papers

TitleStatusHype
Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models0
AIJIM: A Scalable Model for Real-Time AI in Environmental Journalism0
Entity-aware Cross-lingual Claim Detection for Automated Fact-checkingCode0
Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information0
Artificial Intelligence in Deliberation: The AI Penalty and the Emergence of a New Deliberative Divide0
A Graph-based Verification Framework for Fact-Checking0
Evaluating open-source Large Language Models for automated fact-checking0
When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation EditsCode0
Unmasking Digital Falsehoods: A Comparative Analysis of LLM-Based Misinformation Detection Strategies0
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-CheckingCode2
GraphCheck: Multi-Path Fact-Checking with Entity-Relationship GraphsCode0
A Causal Lens for Evaluating Faithfulness Metrics0
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language ModelsCode1
Verdict: A Library for Scaling Judge-Time ComputeCode3
Sarang at DEFACTIFY 4.0: Detecting AI-Generated Text Using Noised Data and an Ensemble of DeBERTa Models0
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking0
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-CheckingCode1
Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals0
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-CheckingCode0
Step-by-Step Fact Verification System for Medical Claims with Explainable ReasoningCode0
Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language ModelsCode0
Can Community Notes Replace Professional Fact-Checkers?0
HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic ClaimsCode1
Towards Effective Extraction and Evaluation of Factual Claims0
Towards Automated Fact-Checking of Real-World Claims: Exploring Task Formulation and Assessment with LLMs0
Show:102550
← PrevPage 3 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified