| MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents | Apr 16, 2024 | Fact CheckingRetrieval-augmented Generation | CodeCode Available | 7 |
| Loki: An Open-Source Tool for Fact Verification | Oct 2, 2024 | Claim VerificationFact Checking | CodeCode Available | 5 |
| Semantic Operators: A Declarative Model for Rich, AI-based Data Processing | Jul 16, 2024 | Extreme Multi-Label ClassificationFact Checking | CodeCode Available | 5 |
| Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation | Aug 8, 2024 | ChunkingFact Checking | CodeCode Available | 4 |
| Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain | Sep 8, 2023 | Fact CheckingKnowledge Graphs | CodeCode Available | 4 |
| Verdict: A Library for Scaling Judge-Time Compute | Feb 25, 2025 | Fact CheckingHallucination | CodeCode Available | 3 |
| Search Arena: Analyzing Search-Augmented LLMs | Jun 5, 2025 | Fact Checking | CodeCode Available | 2 |
| SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking | Mar 2, 2025 | Fact CheckingFact Verification | CodeCode Available | 2 |
| ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild | Jul 4, 2024 | Chart UnderstandingDecision Making | CodeCode Available | 2 |
| OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs | May 9, 2024 | BenchmarkingFact Checking | CodeCode Available | 2 |
| KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking | Apr 3, 2024 | Fact CheckingForm | CodeCode Available | 2 |
| FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios | Jul 25, 2023 | Code GenerationFact Checking | CodeCode Available | 2 |
| RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit | Jun 8, 2023 | Answer GenerationFact Checking | CodeCode Available | 2 |
| Multimodal Automated Fact-Checking: A Survey | May 22, 2023 | Fact CheckingMisinformation | CodeCode Available | 2 |
| SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models | Mar 15, 2023 | Fact CheckingHallucination | CodeCode Available | 2 |
| Atlas: Few-shot Learning with Retrieval Augmented Language Models | Aug 5, 2022 | Fact CheckingFew-Shot Learning | CodeCode Available | 2 |
| SGPT: GPT Sentence Embeddings for Semantic Search | Feb 17, 2022 | Argument RetrievalBiomedical Information Retrieval | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models | Apr 17, 2021 | Argument RetrievalBenchmarking | CodeCode Available | 2 |
| The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability | Jan 28, 2020 | BIG-bench Machine LearningFact Checking | CodeCode Available | 2 |
| Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers | Jun 16, 2025 | Fact CheckingFact Verification | CodeCode Available | 1 |
| Chronocept: Instilling a Sense of Time in Machines | May 12, 2025 | Fact CheckingRAG | CodeCode Available | 1 |
| FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models | Feb 25, 2025 | Fact Checking | CodeCode Available | 1 |
| BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking | Feb 22, 2025 | Fact Checking | CodeCode Available | 1 |
| HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims | Feb 17, 2025 | BenchmarkingFact Checking | CodeCode Available | 1 |
| COVE: COntext and VEracity prediction for out-of-context images | Feb 3, 2025 | Fact CheckingMisinformation | CodeCode Available | 1 |
| DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts | Dec 13, 2024 | Claim VerificationFact Checking | CodeCode Available | 1 |
| Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis | Nov 29, 2024 | BenchmarkingClaim Verification | CodeCode Available | 1 |
| Belief in the Machine: Investigating Epistemological Blind Spots of Language Models | Oct 28, 2024 | Epistemic ReasoningFact Checking | CodeCode Available | 1 |
| FIRE: Fact-checking with Iterative Retrieval and Verification | Oct 17, 2024 | Claim VerificationFact Checking | CodeCode Available | 1 |
| HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World Claims | Oct 16, 2024 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| "Image, Tell me your story!" Predicting the original meta-context of visual misinformation | Aug 19, 2024 | Fact CheckingMisinformation | CodeCode Available | 1 |
| OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs | Aug 6, 2024 | Fact Checking | CodeCode Available | 1 |
| Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Jul 10, 2024 | counterfactualFact Checking | CodeCode Available | 1 |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Jul 1, 2024 | AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking | CodeCode Available | 1 |
| An Enhanced Fake News Detection System With Fuzzy Deep Learning | Jun 24, 2024 | Deep LearningFact Checking | CodeCode Available | 1 |
| MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models | Jun 17, 2024 | BenchmarkingFact Checking | CodeCode Available | 1 |
| Document-level Claim Extraction and Decontextualisation for Fact-Checking | Jun 5, 2024 | Extractive SummarizationFact Checking | CodeCode Available | 1 |
| RATT: A Thought Structure for Coherent and Correct LLM Reasoning | Jun 4, 2024 | Decision MakingFact Checking | CodeCode Available | 1 |
| Attribute First, then Generate: Locally-attributable Grounded Text Generation | Mar 25, 2024 | AttributeDocument Summarization | CodeCode Available | 1 |
| Heterogeneous Graph Reasoning for Fact Checking over Texts and Tables | Feb 20, 2024 | Fact CheckingGraph Neural Network | CodeCode Available | 1 |
| LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations | Jan 23, 2024 | counterfactualFact Checking | CodeCode Available | 1 |
| Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers | Nov 15, 2023 | Fact CheckingSentence | CodeCode Available | 1 |
| ChartCheck: Explainable Fact-Checking over Real-World Chart Images | Nov 13, 2023 | Fact CheckingFact Verification | CodeCode Available | 1 |
| Massive Editing for Large Language Models via Meta Learning | Nov 8, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| Detecting Deepfakes Without Seeing Any | Nov 2, 2023 | DeepFake DetectionFace Swapping | CodeCode Available | 1 |
| Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media | Oct 27, 2023 | Cross-Lingual TransferFact Checking | CodeCode Available | 1 |
| Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks | Oct 16, 2023 | ArticlesFact Checking | CodeCode Available | 1 |
| QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking | Oct 11, 2023 | Decision MakingFact Checking | CodeCode Available | 1 |
| HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking | Sep 15, 2023 | Claim VerificationExplanation Generation | CodeCode Available | 1 |