| MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents | Apr 16, 2024 | Fact CheckingRetrieval-augmented Generation | CodeCode Available | 7 |
| Loki: An Open-Source Tool for Fact Verification | Oct 2, 2024 | Claim VerificationFact Checking | CodeCode Available | 5 |
| Semantic Operators: A Declarative Model for Rich, AI-based Data Processing | Jul 16, 2024 | Extreme Multi-Label ClassificationFact Checking | CodeCode Available | 5 |
| Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation | Aug 8, 2024 | ChunkingFact Checking | CodeCode Available | 4 |
| Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain | Sep 8, 2023 | Fact CheckingKnowledge Graphs | CodeCode Available | 4 |
| Verdict: A Library for Scaling Judge-Time Compute | Feb 25, 2025 | Fact CheckingHallucination | CodeCode Available | 3 |
| Search Arena: Analyzing Search-Augmented LLMs | Jun 5, 2025 | Fact Checking | CodeCode Available | 2 |
| SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking | Mar 2, 2025 | Fact CheckingFact Verification | CodeCode Available | 2 |
| ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild | Jul 4, 2024 | Chart UnderstandingDecision Making | CodeCode Available | 2 |
| OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs | May 9, 2024 | BenchmarkingFact Checking | CodeCode Available | 2 |
| KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking | Apr 3, 2024 | Fact CheckingForm | CodeCode Available | 2 |
| FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios | Jul 25, 2023 | Code GenerationFact Checking | CodeCode Available | 2 |
| RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit | Jun 8, 2023 | Answer GenerationFact Checking | CodeCode Available | 2 |
| Multimodal Automated Fact-Checking: A Survey | May 22, 2023 | Fact CheckingMisinformation | CodeCode Available | 2 |
| SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models | Mar 15, 2023 | Fact CheckingHallucination | CodeCode Available | 2 |
| Atlas: Few-shot Learning with Retrieval Augmented Language Models | Aug 5, 2022 | Fact CheckingFew-Shot Learning | CodeCode Available | 2 |
| SGPT: GPT Sentence Embeddings for Semantic Search | Feb 17, 2022 | Argument RetrievalBiomedical Information Retrieval | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models | Apr 17, 2021 | Argument RetrievalBenchmarking | CodeCode Available | 2 |
| The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability | Jan 28, 2020 | BIG-bench Machine LearningFact Checking | CodeCode Available | 2 |
| Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers | Jun 16, 2025 | Fact CheckingFact Verification | CodeCode Available | 1 |
| Chronocept: Instilling a Sense of Time in Machines | May 12, 2025 | Fact CheckingRAG | CodeCode Available | 1 |
| FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models | Feb 25, 2025 | Fact Checking | CodeCode Available | 1 |
| BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking | Feb 22, 2025 | Fact Checking | CodeCode Available | 1 |
| HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims | Feb 17, 2025 | BenchmarkingFact Checking | CodeCode Available | 1 |