SOTAVerified

Fact Verification

Fact verification, also called "fact checking", is a process of verifying facts in natural text against a database of facts.

Papers

Showing 150 of 216 papers

TitleStatusHype
Loki: An Open-Source Tool for Fact VerificationCode5
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksCode4
AlignScore: Evaluating Factual Consistency with a Unified Alignment FunctionCode4
ReAct: Synergizing Reasoning and Acting in Language ModelsCode4
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-ReflectionCode4
Learning to Filter Context for Retrieval-Augmented GenerationCode2
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table UnderstandingCode2
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMsCode2
Reasoning-Table: Exploring Reinforcement Learning for Table ReasoningCode2
SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented GenerationCode2
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-CheckingCode2
Precise Zero-Shot Dense Retrieval without Relevance LabelsCode2
TART: An Open-Source Tool-Augmented Framework for Explainable Table-based ReasoningCode2
Benchmarking Retrieval-Augmented Generation in Multi-Modal ContextsCode2
Revealing the Importance of Semantic Retrieval for Machine Reading at ScaleCode1
EX-FEVER: A Dataset for Multi-hop Explainable Fact VerificationCode1
Assessing the Limitations of Large Language Models in Clinical Fact DecompositionCode1
Re2G: Retrieve, Rerank, GenerateCode1
Evidentiality-guided Generation for Knowledge-Intensive NLP TasksCode1
ProoFVer: Natural Logic Theorem Proving for Fact VerificationCode1
ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning ExamplesCode1
CREAK: A Dataset for Commonsense Reasoning over Entity KnowledgeCode1
Evidence-based Factual Error CorrectionCode1
Explain and Predict, and then Predict AgainCode1
Large Language Models are few(1)-shot Table ReasonersCode1
LOREN: Logic-Regularized Reasoning for Interpretable Fact VerificationCode1
KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language ModelsCode1
HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference OptimizationCode1
KILT: a Benchmark for Knowledge Intensive Language TasksCode1
Paragraph-based Transformer Pre-training for Multi-Sentence InferenceCode1
GEAR: Graph-based Evidence Aggregating and Reasoning for Fact VerificationCode1
Decker: Double Check with Heterogeneous Knowledge for Commonsense Fact VerificationCode1
Evidence-based Factual Error CorrectionCode1
GERE: Generative Evidence Retrieval for Fact VerificationCode1
Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature PerspectiveCode1
Factual Confidence of LLMs: on Reliability and Robustness of Current EstimatorsCode1
FaVIQ: FAct Verification from Information-seeking QuestionsCode1
FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured informationCode1
A Paragraph-level Multi-task Learning Model for Scientific Fact-VerificationCode1
Hierarchical Multi-head Attentive Network for Evidence-aware Fake News DetectionCode1
Attribute First, then Generate: Locally-attributable Grounded Text GenerationCode1
H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on TablesCode1
AdaptKeyBERT: An Attention-Based approach towards Few-Shot & Zero-Shot Domain Adaptation of KeyBERTCode1
COVID-VTS: Fact Extraction and Verification on Short Video PlatformsCode1
Factify 2: A Multimodal Fake News and Satire News DatasetCode1
CHECKWHY: Causal Fact Verification via Argument StructureCode1
ChartCheck: Explainable Fact-Checking over Real-World Chart ImagesCode1
FactKG: Fact Verification via Reasoning on Knowledge GraphsCode1
Get Your Vitamin C! Robust Fact Verification with Contrastive EvidenceCode1
PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-trainingCode1
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Re2GKILT-AC78.53Unverified
2intersectKILT-AC71.28Unverified
3WikipediaKILT-AC65.68Unverified
4KGIKILT-AC64.41Unverified
5Multitask DPR + BARTKILT-AC63.94Unverified
6BERT + DPRKILT-AC58.58Unverified
7RAGKILT-AC53.45Unverified
8BART + DPRKILT-AC47.68Unverified
9NSMNKILT-AC41.88Unverified
10T5-baseKILT-AC0Unverified
#ModelMetricClaimedVerifiedStatus
1ProoFVer-SBAccuracy79.47Unverified
2DREAMAccuracy76.85Unverified
3RoBERTa-Base Joint MSPP FlexibleAccuracy75.36Unverified
4RoBERTa-Base Joint MSPPAccuracy74.39Unverified
5KGATAccuracy74.1Unverified
6RAGAccuracy72.5Unverified
7GEARAccuracy71.6Unverified
#ModelMetricClaimedVerifiedStatus
1DanFEVER XLM-RoBERTa LargeF10.9Unverified