| Debate-to-Detect: Reformulating Misinformation Detection as a Real-World Debate with Large Language Models | May 24, 2025 | Binary ClassificationEthics | —Unverified | 0 |
| Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection | May 23, 2025 | Fact CheckingHallucination | —Unverified | 0 |
| Resolving Conflicting Evidence in Automated Fact-Checking: A Study on Retrieval-Augmented LLMs | May 23, 2025 | Fact CheckingRAG | CodeCode Available | 0 |
| EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions | May 22, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| CUB: Benchmarking Context Utilisation Techniques for Language Models | May 22, 2025 | BenchmarkingFact Checking | —Unverified | 0 |
| Improving the fact-checking performance of language models by relying on their entailment ability | May 21, 2025 | Fact CheckingFact Verification | —Unverified | 0 |
| UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking | May 21, 2025 | BenchmarkingClaim Verification | CodeCode Available | 0 |
| MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations | May 20, 2025 | Fact CheckingHallucination | CodeCode Available | 0 |
| Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation | May 18, 2025 | Fact CheckingForm | —Unverified | 0 |
| SemEval-2025 Task 7: Multilingual and Crosslingual Fact-Checked Claim Retrieval | May 15, 2025 | Fact CheckingRetrieval | —Unverified | 0 |