Fact Checking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 669 papers

Title	Date	Tasks	Status	Hype
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents	Apr 16, 2024	Fact CheckingRetrieval-augmented Generation	CodeCode Available	7
Loki: An Open-Source Tool for Fact Verification	Oct 2, 2024	Claim VerificationFact Checking	CodeCode Available	5
Semantic Operators: A Declarative Model for Rich, AI-based Data Processing	Jul 16, 2024	Extreme Multi-Label ClassificationFact Checking	CodeCode Available	5
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation	Aug 8, 2024	ChunkingFact Checking	CodeCode Available	4
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain	Sep 8, 2023	Fact CheckingKnowledge Graphs	CodeCode Available	4
Verdict: A Library for Scaling Judge-Time Compute	Feb 25, 2025	Fact CheckingHallucination	CodeCode Available	3
Search Arena: Analyzing Search-Augmented LLMs	Jun 5, 2025	Fact Checking	CodeCode Available	2
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking	Mar 2, 2025	Fact CheckingFact Verification	CodeCode Available	2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild	Jul 4, 2024	Chart UnderstandingDecision Making	CodeCode Available	2
OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs	May 9, 2024	BenchmarkingFact Checking	CodeCode Available	2
KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking	Apr 3, 2024	Fact CheckingForm	CodeCode Available	2
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios	Jul 25, 2023	Code GenerationFact Checking	CodeCode Available	2
RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit	Jun 8, 2023	Answer GenerationFact Checking	CodeCode Available	2
Multimodal Automated Fact-Checking: A Survey	May 22, 2023	Fact CheckingMisinformation	CodeCode Available	2
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models	Mar 15, 2023	Fact CheckingHallucination	CodeCode Available	2
Atlas: Few-shot Learning with Retrieval Augmented Language Models	Aug 5, 2022	Fact CheckingFew-Shot Learning	CodeCode Available	2
SGPT: GPT Sentence Embeddings for Semantic Search	Feb 17, 2022	Argument RetrievalBiomedical Information Retrieval	CodeCode Available	2
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models	Apr 17, 2021	Argument RetrievalBenchmarking	CodeCode Available	2
The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability	Jan 28, 2020	BIG-bench Machine LearningFact Checking	CodeCode Available	2
Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers	Jun 16, 2025	Fact CheckingFact Verification	CodeCode Available	1
Chronocept: Instilling a Sense of Time in Machines	May 12, 2025	Fact CheckingRAG	CodeCode Available	1
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models	Feb 25, 2025	Fact Checking	CodeCode Available	1
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking	Feb 22, 2025	Fact Checking	CodeCode Available	1
HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims	Feb 17, 2025	BenchmarkingFact Checking	CodeCode Available	1
COVE: COntext and VEracity prediction for out-of-context images	Feb 3, 2025	Fact CheckingMisinformation	CodeCode Available	1
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts	Dec 13, 2024	Claim VerificationFact Checking	CodeCode Available	1
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis	Nov 29, 2024	BenchmarkingClaim Verification	CodeCode Available	1
Belief in the Machine: Investigating Epistemological Blind Spots of Language Models	Oct 28, 2024	Epistemic ReasoningFact Checking	CodeCode Available	1
FIRE: Fact-checking with Iterative Retrieval and Verification	Oct 17, 2024	Claim VerificationFact Checking	CodeCode Available	1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims	Oct 16, 2024	Fact CheckingLanguage Modeling	CodeCode Available	1
"Image, Tell me your story!" Predicting the original meta-context of visual misinformation	Aug 19, 2024	Fact CheckingMisinformation	CodeCode Available	1
OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs	Aug 6, 2024	Fact Checking	CodeCode Available	1
Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities	Jul 10, 2024	counterfactualFact Checking	CodeCode Available	1
Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time	Jul 1, 2024	AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking	CodeCode Available	1
An Enhanced Fake News Detection System With Fuzzy Deep Learning	Jun 24, 2024	Deep LearningFact Checking	CodeCode Available	1
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models	Jun 17, 2024	BenchmarkingFact Checking	CodeCode Available	1
Document-level Claim Extraction and Decontextualisation for Fact-Checking	Jun 5, 2024	Extractive SummarizationFact Checking	CodeCode Available	1
RATT: A Thought Structure for Coherent and Correct LLM Reasoning	Jun 4, 2024	Decision MakingFact Checking	CodeCode Available	1
Attribute First, then Generate: Locally-attributable Grounded Text Generation	Mar 25, 2024	AttributeDocument Summarization	CodeCode Available	1
Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables	Feb 20, 2024	Fact CheckingGraph Neural Network	CodeCode Available	1
LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations	Jan 23, 2024	counterfactualFact Checking	CodeCode Available	1
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers	Nov 15, 2023	Fact CheckingSentence	CodeCode Available	1
ChartCheck: Explainable Fact-Checking over Real-World Chart Images	Nov 13, 2023	Fact CheckingFact Verification	CodeCode Available	1
Massive Editing for Large Language Models via Meta Learning	Nov 8, 2023	Fact CheckingLanguage Modeling	CodeCode Available	1
Detecting Deepfakes Without Seeing Any	Nov 2, 2023	DeepFake DetectionFace Swapping	CodeCode Available	1
Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media	Oct 27, 2023	Cross-Lingual TransferFact Checking	CodeCode Available	1
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks	Oct 16, 2023	ArticlesFact Checking	CodeCode Available	1
QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking	Oct 11, 2023	Decision MakingFact Checking	CodeCode Available	1
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking	Sep 15, 2023	Claim VerificationExplanation Generation	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 14Next →

All datasets SciFact (BEIR)CLIMATE-FEVER (BEIR)FEVER (BEIR)AVeriTeC .CDCD LIAR2

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	monoT5-3B	nDCG@10	0.78	—	Unverified
2	SGPT-BE-5.8B	nDCG@10	0.75	—	Unverified
3	BM25+CE	nDCG@10	0.69	—	Unverified
4	SGPT-CE-6.1B	nDCG@10	0.68	—	Unverified
5	ColBERT	nDCG@10	0.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SGPT-BE-5.8B	nDCG@10	0.31	—	Unverified
2	monoT5-3B	nDCG@10	0.28	—	Unverified
3	BM25+CE	nDCG@10	0.25	—	Unverified
4	SGPT-CE-6.1B	nDCG@10	0.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	monoT5-3B	nDCG@10	0.85	—	Unverified
2	BM25+CE	nDCG@10	0.82	—	Unverified
3	SGPT-BE-5.8B	nDCG@10	0.78	—	Unverified
4	SGPT-CE-6.1B	nDCG@10	0.73	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HerO	Question Only score	0.48	—	Unverified
2	CTU AIC	Question Only score	0.46	—	Unverified
3	InFact	Question Only score	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Abc	0..5sec	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MA-CIN	Precision	0.26	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FDHN	Accuracy (Test)	0.7	—	Unverified