Fact Checking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 669 papers

Title	Date	Tasks	Status	Hype	Score
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents	Apr 16, 2024	Fact CheckingRetrieval-augmented Generation	CodeCode Available	7	5
Loki: An Open-Source Tool for Fact Verification	Oct 2, 2024	Claim VerificationFact Checking	CodeCode Available	5	5
Semantic Operators: A Declarative Model for Rich, AI-based Data Processing	Jul 16, 2024	Extreme Multi-Label ClassificationFact Checking	CodeCode Available	5	5
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation	Aug 8, 2024	ChunkingFact Checking	CodeCode Available	4	5
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain	Sep 8, 2023	Fact CheckingKnowledge Graphs	CodeCode Available	4	5
Verdict: A Library for Scaling Judge-Time Compute	Feb 25, 2025	Fact CheckingHallucination	CodeCode Available	3	5
Search Arena: Analyzing Search-Augmented LLMs	Jun 5, 2025	Fact Checking	CodeCode Available	2	5
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild	Jul 4, 2024	Chart UnderstandingDecision Making	CodeCode Available	2	5
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models	Mar 15, 2023	Fact CheckingHallucination	CodeCode Available	2	5
RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit	Jun 8, 2023	Answer GenerationFact Checking	CodeCode Available	2	5
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios	Jul 25, 2023	Code GenerationFact Checking	CodeCode Available	2	5
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking	Mar 2, 2025	Fact CheckingFact Verification	CodeCode Available	2	5
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models	Apr 17, 2021	Argument RetrievalBenchmarking	CodeCode Available	2	5
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2	5
KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking	Apr 3, 2024	Fact CheckingForm	CodeCode Available	2	5
Atlas: Few-shot Learning with Retrieval Augmented Language Models	Aug 5, 2022	Fact CheckingFew-Shot Learning	CodeCode Available	2	5
OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs	May 9, 2024	BenchmarkingFact Checking	CodeCode Available	2	5
SGPT: GPT Sentence Embeddings for Semantic Search	Feb 17, 2022	Argument RetrievalBiomedical Information Retrieval	CodeCode Available	2	5
Multimodal Automated Fact-Checking: A Survey	May 22, 2023	Fact CheckingMisinformation	CodeCode Available	2	5
The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability	Jan 28, 2020	BIG-bench Machine LearningFact Checking	CodeCode Available	2	5
Claim Check-Worthiness Detection as Positive Unlabelled Learning	Mar 5, 2020	Fact CheckingRumour Detection	CodeCode Available	1	5
ChartCheck: Explainable Fact-Checking over Real-World Chart Images	Nov 13, 2023	Fact CheckingFact Verification	CodeCode Available	1	5
Factify 2: A Multimodal Fake News and Satire News Dataset	Apr 8, 2023	ArticlesClaim Verification	CodeCode Available	1	5
Explainable Automated Fact-Checking: A Survey	Nov 7, 2020	Fact CheckingSurvey	CodeCode Available	1	5
AmbiFC: Fact-Checking Ambiguous Claims with Evidence	Apr 1, 2021	Claim VerificationEvidence Selection	CodeCode Available	1	5
Explainable Automated Fact-Checking for Public Health Claims	Oct 19, 2020	Explanation GenerationFact Checking	CodeCode Available	1	5
Evidence-based Fact-Checking of Health-related Claims	Nov 1, 2021	ArticlesFact Checking	CodeCode Available	1	5
CheckThat! at CLEF 2020: Enabling the Automatic Identification and Verification of Claims in Social Media	Jan 21, 2020	Fact CheckingTask 2	CodeCode Available	1	5
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers	Nov 15, 2023	Fact CheckingSentence	CodeCode Available	1	5
Fact-Checking Complex Claims with Program-Guided Reasoning	May 22, 2023	Fact CheckingIn-Context Learning	CodeCode Available	1	5
Evaluating the Factual Consistency of Abstractive Text Summarization	Oct 28, 2019	Abstractive Text SummarizationFact Checking	CodeCode Available	1	5
Evidence-based Factual Error Correction	Jun 2, 2021	Fact CheckingFact Verification	CodeCode Available	1	5
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models	Feb 25, 2025	Fact Checking	CodeCode Available	1	5
Early Rumor Detection Using Neural Hawkes Process with a New Benchmark Dataset	Jun 5, 2023	Fact Checking	CodeCode Available	1	5
Automatic Fake News Detection: Are Models Learning to Reason?	May 17, 2021	Fact CheckingFake News Detection	CodeCode Available	1	5
Editing Factual Knowledge in Language Models	Apr 16, 2021	Fact CheckingMeta-Learning	CodeCode Available	1	5
DialFact: A Benchmark for Fact-Checking in Dialogue	Oct 15, 2021	Claim VerificationFact Checking	CodeCode Available	1	5
An Enhanced Fake News Detection System With Fuzzy Deep Learning	Jun 24, 2024	Deep LearningFact Checking	CodeCode Available	1	5
Attribute First, then Generate: Locally-attributable Grounded Text Generation	Mar 25, 2024	AttributeDocument Summarization	CodeCode Available	1	5
Belief in the Machine: Investigating Epistemological Blind Spots of Language Models	Oct 28, 2024	Epistemic ReasoningFact Checking	CodeCode Available	1	5
Detecting Deepfakes Without Seeing Any	Nov 2, 2023	DeepFake DetectionFace Swapping	CodeCode Available	1	5
Automatic Evaluation of Attribution by Large Language Models	May 10, 2023	Fact CheckingLanguage Modeling	CodeCode Available	1	5
DirectQuote: A Dataset for Direct Quotation Extraction and Attribution in News Articles	Oct 15, 2021	ArticlesAttribute	CodeCode Available	1	5
AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web	May 22, 2023	Claim VerificationFact Checking	CodeCode Available	1	5
AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset	May 7, 2021	ArticlesDialect Identification	CodeCode Available	1	5
Document-level Claim Extraction and Decontextualisation for Fact-Checking	Jun 5, 2024	Extractive SummarizationFact Checking	CodeCode Available	1	5
End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and Models	May 25, 2022	ArticlesClaim Verification	CodeCode Available	1	5
Benchmarking the Generation of Fact Checking Explanations	Aug 29, 2023	Abstractive Text SummarizationArticles	CodeCode Available	1	5
BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking	Feb 22, 2025	Fact Checking	CodeCode Available	1	5
COVID-VTS: Fact Extraction and Verification on Short Video Platforms	Feb 15, 2023	Fact CheckingFact Selection	CodeCode Available	1	5

Show:10 25 50

← PrevPage 1 of 14Next →

All datasets SciFact (BEIR)CLIMATE-FEVER (BEIR)FEVER (BEIR)AVeriTeC .CDCD LIAR2

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	monoT5-3B	nDCG@10	0.78	—	Unverified
2	SGPT-BE-5.8B	nDCG@10	0.75	—	Unverified
3	BM25+CE	nDCG@10	0.69	—	Unverified
4	SGPT-CE-6.1B	nDCG@10	0.68	—	Unverified
5	ColBERT	nDCG@10	0.67	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SGPT-BE-5.8B	nDCG@10	0.31	—	Unverified
2	monoT5-3B	nDCG@10	0.28	—	Unverified
3	BM25+CE	nDCG@10	0.25	—	Unverified
4	SGPT-CE-6.1B	nDCG@10	0.16	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	monoT5-3B	nDCG@10	0.85	—	Unverified
2	BM25+CE	nDCG@10	0.82	—	Unverified
3	SGPT-BE-5.8B	nDCG@10	0.78	—	Unverified
4	SGPT-CE-6.1B	nDCG@10	0.73	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HerO	Question Only score	0.48	—	Unverified
2	CTU AIC	Question Only score	0.46	—	Unverified
3	InFact	Question Only score	0.45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Abc	0..5sec	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MA-CIN	Precision	0.26	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FDHN	Accuracy (Test)	0.7	—	Unverified