SOTAVerified

Fact Verification

Fact verification, also called "fact checking", is a process of verifying facts in natural text against a database of facts.

Papers

Showing 125 of 216 papers

TitleStatusHype
DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact VerificationCode0
Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact VerifiersCode1
ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific ChartsCode0
Reasoning-Table: Exploring Reinforcement Learning for Table ReasoningCode2
Table-R1: Inference-Time Scaling for Table ReasoningCode1
Improving the fact-checking performance of language models by relying on their entailment ability0
Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion0
Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning0
Synthetic News Generation for Fake News Classification0
Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models0
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-CheckingCode2
HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference OptimizationCode1
Benchmarking Retrieval-Augmented Generation in Multi-Modal ContextsCode2
Step-by-Step Fact Verification System for Medical Claims with Explainable ReasoningCode0
CMQCIC-Bench: A Chinese Benchmark for Evaluating Large Language Models in Medical Quality Control Indicator Calculation0
FlashCheck: Exploration of Efficient Evidence Retrieval for Fast Fact-CheckingCode0
Fine-Grained Appropriate Reliance: Human-AI Collaboration with a Multi-Step Transparent Decision Workflow for Complex Task Decomposition0
SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented GenerationCode2
Assessing the Limitations of Large Language Models in Clinical Fact DecompositionCode1
Learning to Verify Summary Facts with Fine-Grained LLM FeedbackCode0
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OasisCode1
ZeFaV: Boosting Large Language Models for Zero-shot Fact VerificationCode0
FactLens: Benchmarking Fine-Grained Fact Verification0
TabVer: Tabular Fact Verification with Natural LogicCode0
AMREx: AMR for Explainable Fact Verification0
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Re2GKILT-AC78.53Unverified
2intersectKILT-AC71.28Unverified
3WikipediaKILT-AC65.68Unverified
4KGIKILT-AC64.41Unverified
5Multitask DPR + BARTKILT-AC63.94Unverified
6BERT + DPRKILT-AC58.58Unverified
7RAGKILT-AC53.45Unverified
8BART + DPRKILT-AC47.68Unverified
9NSMNKILT-AC41.88Unverified
10T5-baseKILT-AC0Unverified
#ModelMetricClaimedVerifiedStatus
1ProoFVer-SBAccuracy79.47Unverified
2DREAMAccuracy76.85Unverified
3RoBERTa-Base Joint MSPP FlexibleAccuracy75.36Unverified
4RoBERTa-Base Joint MSPPAccuracy74.39Unverified
5KGATAccuracy74.1Unverified
6RAGAccuracy72.5Unverified
7GEARAccuracy71.6Unverified
#ModelMetricClaimedVerifiedStatus
1DanFEVER XLM-RoBERTa LargeF10.9Unverified