| PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants | Jul 21, 2025 | Fact Checking | —Unverified | 0 |
| DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact Verification | Jul 8, 2025 | ARCArithmetic Reasoning | CodeCode Available | 0 |
| Recon, Answer, Verify: Agents in Search of Truth | Jul 4, 2025 | Fact Checking | —Unverified | 0 |
| Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicine | Jun 25, 2025 | Fact CheckingNavigate | CodeCode Available | 0 |
| The Next Phase of Scientific Fact-Checking: Advanced Evidence Retrieval from Complex Structured Academic Papers | Jun 25, 2025 | Fact CheckingRetrieval | —Unverified | 0 |
| Veracity: An Open-Source AI Fact-Checking System | Jun 18, 2025 | Fact CheckingMisinformation | —Unverified | 0 |
| Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers | Jun 16, 2025 | Fact CheckingFact Verification | CodeCode Available | 1 |
| SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists | Jun 16, 2025 | Fact CheckingTAG | —Unverified | 0 |
| RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking | Jun 14, 2025 | Explanation GenerationFact Checking | CodeCode Available | 0 |
| In Crowd Veritas: Leveraging Human Intelligence To Fight Misinformation | Jun 10, 2025 | Fact CheckingMisinformation | —Unverified | 0 |
| ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific Charts | Jun 10, 2025 | Fact CheckingFact Verification | CodeCode Available | 0 |
| Combating Misinformation in the Arab World: Challenges & Opportunities | Jun 5, 2025 | DiversityFact Checking | —Unverified | 0 |
| Search Arena: Analyzing Search-Augmented LLMs | Jun 5, 2025 | Fact Checking | CodeCode Available | 2 |
| SUCEA: Reasoning-Intensive Retrieval for Adversarial Fact-checking through Claim Decomposition and Editing | Jun 5, 2025 | Fact CheckingMisinformation | CodeCode Available | 0 |
| Facts are Harder Than Opinions -- A Multilingual, Comparative Analysis of LLM-Based Fact-Checking Reliability | Jun 4, 2025 | DiversityFact Checking | —Unverified | 0 |
| Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs | May 30, 2025 | Fact CheckingHallucination | —Unverified | 0 |
| Verify-in-the-Graph: Entity Disambiguation Enhancement for Complex Claim Verification with Interactive Graph Representation | May 29, 2025 | Claim VerificationEntity Disambiguation | —Unverified | 0 |
| Community Moderation and the New Epistemology of Fact Checking on Social Media | May 26, 2025 | Fact CheckingMisinformation | —Unverified | 0 |
| From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation | May 24, 2025 | ArticlesBenchmarking | —Unverified | 0 |
| Social Good or Scientific Curiosity? Uncovering the Research Framing Behind NLP Artefacts | May 24, 2025 | Fact CheckingHate Speech Detection | —Unverified | 0 |
| Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models | May 24, 2025 | Binary ClassificationEthics | —Unverified | 0 |
| Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection | May 23, 2025 | Fact CheckingHallucination | —Unverified | 0 |
| Resolving Conflicting Evidence in Automated Fact-Checking: A Study on Retrieval-Augmented LLMs | May 23, 2025 | Fact CheckingRAG | CodeCode Available | 0 |
| EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions | May 22, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| CUB: Benchmarking Context Utilisation Techniques for Language Models | May 22, 2025 | BenchmarkingFact Checking | —Unverified | 0 |
| Improving the fact-checking performance of language models by relying on their entailment ability | May 21, 2025 | Fact CheckingFact Verification | —Unverified | 0 |
| UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking | May 21, 2025 | BenchmarkingClaim Verification | CodeCode Available | 0 |
| MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations | May 20, 2025 | Fact CheckingHallucination | CodeCode Available | 0 |
| Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation | May 18, 2025 | Fact CheckingForm | —Unverified | 0 |
| SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval | May 15, 2025 | Fact CheckingRetrieval | —Unverified | 0 |
| FACTors: A New Dataset for Studying the Fact-checking Ecosystem | May 14, 2025 | Fact Checking | CodeCode Available | 0 |
| Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reasoning? | May 13, 2025 | Chart Question AnsweringFact Checking | CodeCode Available | 0 |
| Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information | May 13, 2025 | ArticlesFact Checking | —Unverified | 0 |
| SciCom Wiki: Fact-Checking and FAIR Knowledge Distribution for Scientific Videos and Podcasts | May 12, 2025 | Fact CheckingKnowledge Graphs | —Unverified | 0 |
| Computational Fact-Checking of Online Discourse: Scoring scientific accuracy in climate change related news articles | May 12, 2025 | ArticlesFact Checking | —Unverified | 0 |
| Chronocept: Instilling a Sense of Time in Machines | May 12, 2025 | Fact CheckingRAG | CodeCode Available | 1 |
| TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking | May 11, 2025 | Fact CheckingFew-Shot Learning | —Unverified | 0 |
| Holmes: Automated Fact Check with Large Language Models | May 6, 2025 | Fact CheckingRetrieval | —Unverified | 0 |
| A Generative-AI-Driven Claim Retrieval System Capable of Detecting and Retrieving Claims from Social Media Platforms in Multiple Languages | Apr 29, 2025 | Fact Checking | CodeCode Available | 0 |
| Detecting Manipulated Contents Using Knowledge-Grounded Inference | Apr 29, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| Pushing the boundary on Natural Language Inference | Apr 25, 2025 | Fact CheckingInformation Retrieval | —Unverified | 0 |
| Assessing the Potential of Generative Agents in Crowdsourced Fact-Checking | Apr 24, 2025 | Decision MakingFact Checking | —Unverified | 0 |
| PASS-FC: Progressive and Adaptive Search Scheme for Fact Checking of Comprehensive Claims | Apr 14, 2025 | Fact CheckingGeneral Knowledge | —Unverified | 0 |
| OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens | Apr 9, 2025 | Fact CheckingHallucination | —Unverified | 0 |
| BOOST: Bootstrapping Strategy-Driven Reasoning Programs for Program-Guided Fact-Checking | Apr 3, 2025 | Claim VerificationDiversity | —Unverified | 0 |
| If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs | Mar 30, 2025 | Fact CheckingLifelong learning | —Unverified | 0 |
| Understanding Inequality of LLM Fact-Checking over Geographic Regions with Agent and Retrieval models | Mar 28, 2025 | Fact CheckingGeneral Knowledge | —Unverified | 0 |
| MultiClaimNet: A Massively Multilingual Dataset of Fact-Checked Claim Clusters | Mar 28, 2025 | ClusteringFact Checking | —Unverified | 0 |
| Fact-checking AI-generated news reports: Can LLMs catch their own lies? | Mar 24, 2025 | DiagnosticFact Checking | —Unverified | 0 |
| Can LLMs Automate Fact-Checking Article Writing? | Mar 22, 2025 | ArticlesFact Checking | —Unverified | 0 |