SOTAVerified

Fact Verification

Fact verification, also called "fact checking", is a process of verifying facts in natural text against a database of facts.

Papers

Showing 51100 of 216 papers

TitleStatusHype
Get Your Vitamin C! Robust Fact Verification with Contrastive EvidenceCode1
Hierarchical Multi-head Attentive Network for Evidence-aware Fake News DetectionCode1
Explain and Predict, and then Predict AgainCode1
Evidence-based Factual Error CorrectionCode1
A Paragraph-level Multi-task Learning Model for Scientific Fact-VerificationCode1
LOREN: Logic-Regularized Reasoning for Interpretable Fact VerificationCode1
Where Are the Facts? Searching for Fact-checked Information to Alleviate the Spread of Fake NewsCode1
KILT: a Benchmark for Knowledge Intensive Language TasksCode1
Transformer-XH: Multi-Evidence Reasoning with eXtra Hop AttentionCode1
Revealing the Importance of Semantic Retrieval for Machine Reading at ScaleCode1
GEAR: Graph-based Evidence Aggregating and Reasoning for Fact VerificationCode1
DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact VerificationCode0
ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific ChartsCode0
Improving the fact-checking performance of language models by relying on their entailment ability0
Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion0
Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning0
Synthetic News Generation for Fake News Classification0
Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models0
Step-by-Step Fact Verification System for Medical Claims with Explainable ReasoningCode0
CMQCIC-Bench: A Chinese Benchmark for Evaluating Large Language Models in Medical Quality Control Indicator Calculation0
FlashCheck: Exploration of Efficient Evidence Retrieval for Fast Fact-CheckingCode0
Fine-Grained Appropriate Reliance: Human-AI Collaboration with a Multi-Step Transparent Decision Workflow for Complex Task Decomposition0
Learning to Verify Summary Facts with Fine-Grained LLM FeedbackCode0
ZeFaV: Boosting Large Language Models for Zero-shot Fact VerificationCode0
FactLens: Benchmarking Fine-Grained Fact Verification0
AMREx: AMR for Explainable Fact Verification0
TabVer: Tabular Fact Verification with Natural LogicCode0
Augmenting the Veracity and Explanations of Complex Fact Checking via Iterative Self-Revision with LLMs0
ChronoFact: Timeline-based Temporal Fact Verification0
A Little Human Data Goes A Long WayCode0
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison FeedbackCode0
Overview of Factify5WQA: Fact Verification through 5W Question-Answering0
Take It Easy: Label-Adaptive Self-Rationalization for Fact Verification and Explanation GenerationCode0
Zero-Shot Fact Verification via Natural Logic and Large Language ModelsCode0
Claim Verification in the Age of Large Language Models: A Survey0
Fact or Fiction? Improving Fact Verification with Knowledge Graphs through Simplified Subgraph RetrievalsCode0
Improving Retrieval Augmented Language Model with Self-Reasoning0
LookupForensics: A Large-Scale Multi-Task Dataset for Multi-Phase Image-Based Fact Verification0
Evidence-Based Temporal Fact Verification0
Multimodal Misinformation Detection using Large Vision-Language Models0
Scalable and Domain-General Abstractive Proposition Segmentation0
Molecular Facts: Desiderata for Decontextualization in LLM Fact VerificationCode0
Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments0
FactGenius: Combining Zero-Shot Prompting and Fuzzy Relation Mining to Improve Fact Verification with Knowledge GraphsCode0
Mining the Explainability and Generalization: Fact Verification Based on Self-Instruction0
Multi-Evidence based Fact Verification via A Confidential Graph Neural NetworkCode0
ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge Source0
Hypothesis Testing Prompting Improves Deductive Reasoning in Large Language Models0
Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization0
Towards a Search Engine for Machines: Unified Ranking for Multiple Retrieval-Augmented Large Language ModelsCode0
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Re2GKILT-AC78.53Unverified
2intersectKILT-AC71.28Unverified
3WikipediaKILT-AC65.68Unverified
4KGIKILT-AC64.41Unverified
5Multitask DPR + BARTKILT-AC63.94Unverified
6BERT + DPRKILT-AC58.58Unverified
7RAGKILT-AC53.45Unverified
8BART + DPRKILT-AC47.68Unverified
9NSMNKILT-AC41.88Unverified
10T5-baseKILT-AC0Unverified
#ModelMetricClaimedVerifiedStatus
1ProoFVer-SBAccuracy79.47Unverified
2DREAMAccuracy76.85Unverified
3RoBERTa-Base Joint MSPP FlexibleAccuracy75.36Unverified
4RoBERTa-Base Joint MSPPAccuracy74.39Unverified
5KGATAccuracy74.1Unverified
6RAGAccuracy72.5Unverified
7GEARAccuracy71.6Unverified
#ModelMetricClaimedVerifiedStatus
1DanFEVER XLM-RoBERTa LargeF10.9Unverified