SOTAVerified

Fact Checking

Papers

Showing 51100 of 669 papers

TitleStatusHype
Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models0
AIJIM: A Scalable Model for Real-Time AI in Environmental Journalism0
Entity-aware Cross-lingual Claim Detection for Automated Fact-checkingCode0
Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information0
Artificial Intelligence in Deliberation: The AI Penalty and the Emergence of a New Deliberative Divide0
A Graph-based Verification Framework for Fact-Checking0
Evaluating open-source Large Language Models for automated fact-checking0
When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation EditsCode0
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-CheckingCode2
Unmasking Digital Falsehoods: A Comparative Analysis of LLM-Based Misinformation Detection Strategies0
GraphCheck: Multi-Path Fact-Checking with Entity-Relationship GraphsCode0
A Causal Lens for Evaluating Faithfulness Metrics0
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language ModelsCode1
Verdict: A Library for Scaling Judge-Time ComputeCode3
Sarang at DEFACTIFY 4.0: Detecting AI-Generated Text Using Noised Data and an Ensemble of DeBERTa Models0
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking0
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-CheckingCode1
Worse than Zero-shot? A Fact-Checking Dataset for Evaluating the Robustness of RAG Against Misleading Retrievals0
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-CheckingCode0
Step-by-Step Fact Verification System for Medical Claims with Explainable ReasoningCode0
Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language ModelsCode0
Can Community Notes Replace Professional Fact-Checkers?0
HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic ClaimsCode1
Towards Effective Extraction and Evaluation of Factual Claims0
Towards Automated Fact-Checking of Real-World Claims: Exploring Task Formulation and Assessment with LLMs0
Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-Checking0
FlashCheck: Exploration of Efficient Evidence Retrieval for Fast Fact-CheckingCode0
CORRECT: Context- and Reference-Augmented Reasoning and Prompting for Fact-CheckingCode0
Claim Extraction for Fact-Checking: Data, Models, and Automated Metrics0
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasksCode0
Decoding AI Judgment: How LLMs Assess News Credibility and Bias0
Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation0
COVE: COntext and VEracity prediction for out-of-context imagesCode1
Efficiency and Effectiveness of LLM-Based Summarization of Evidence in Crowdsourced Fact-Checking0
Automatic Fact-Checking with Frame-Semantics0
Fine-Grained Appropriate Reliance: Human-AI Collaboration with a Multi-Step Transparent Decision Workflow for Complex Task Decomposition0
Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking0
From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMsCode0
Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome WebsitesCode0
Improving Factuality with Explicit Working Memory0
Evaluating the Performance of Large Language Models in Scientific Claim Detection and Classification0
Logical Consistency of Large Language Models in Fact-checking0
ViFactCheck: A New Benchmark Dataset and Methods for Multi-domain News Fact-Checking in Vietnamese0
Face the Facts! Evaluating RAG-based Fact-checking Pipelines in Realistic SettingsCode0
Self-Adaptive Paraphrasing and Preference Learning for Improved Claim Verifiability0
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal ExpertsCode1
Exploring Multidimensional Checkworthiness: Designing AI-assisted Claim Prioritization for Human Fact-checkers0
LLMs as Debate Partners: Utilizing Genetic Algorithms and Adversarial Search for Adaptive ArgumentsCode0
Multimodal Fact-Checking with Vision Language Models: A Probing Classifier based Solution with Embedding StrategiesCode0
Anatomically-Grounded Fact Checking of Automated Chest X-ray Reports0
Show:102550
← PrevPage 2 of 14Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified