SOTAVerified

Fact Checking

Papers

Showing 110 of 669 papers

TitleStatusHype
MiniCheck: Efficient Fact-Checking of LLMs on Grounding DocumentsCode7
Semantic Operators: A Declarative Model for Rich, AI-based Data ProcessingCode5
Loki: An Open-Source Tool for Fact VerificationCode5
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical DomainCode4
Verdict: A Library for Scaling Judge-Time ComputeCode3
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-CheckingCode2
Atlas: Few-shot Learning with Retrieval Augmented Language ModelsCode2
KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual CheckingCode2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
Show:102550
← PrevPage 1 of 67Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified