SOTAVerified

Fact Checking

Papers

Showing 176200 of 669 papers

TitleStatusHype
Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information0
A Graph-based Verification Framework for Fact-Checking0
Artificial Intelligence in Deliberation: The AI Penalty and the Emergence of a New Deliberative Divide0
Evaluating open-source Large Language Models for automated fact-checking0
When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation EditsCode0
Unmasking Digital Falsehoods: A Comparative Analysis of LLM-Based Misinformation Detection Strategies0
GraphCheck: Multi-Path Fact-Checking with Entity-Relationship GraphsCode0
A Causal Lens for Evaluating Faithfulness Metrics0
Sarang at DEFACTIFY 4.0: Detecting AI-Generated Text Using Noised Data and an Ensemble of DeBERTa Models0
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking0
Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals0
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-CheckingCode0
Step-by-Step Fact Verification System for Medical Claims with Explainable ReasoningCode0
Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language ModelsCode0
Can Community Notes Replace Professional Fact-Checkers?0
Towards Effective Extraction and Evaluation of Factual Claims0
Towards Automated Fact-Checking of Real-World Claims: Exploring Task Formulation and Assessment with LLMs0
Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-Checking0
CORRECT: Context- and Reference-Augmented Reasoning and Prompting for Fact-CheckingCode0
FlashCheck: Exploration of Efficient Evidence Retrieval for Fast Fact-CheckingCode0
Claim Extraction for Fact-Checking: Data, Models, and Automated Metrics0
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasksCode0
Decoding AI Judgment: How LLMs Assess News Credibility and Bias0
Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation0
Efficiency and Effectiveness of LLM-Based Summarization of Evidence in Crowdsourced Fact-Checking0
Show:102550
← PrevPage 8 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified