SOTAVerified

Fact Checking

Papers

Showing 351375 of 669 papers

TitleStatusHype
MultiFC: A Real-World Multi-Domain Dataset for Evidence-Based Fact Checking of Claims0
Multimodal Large Language Models to Support Real-World Fact-Checking0
Multimodal Misinformation Detection by Learning from Synthetic Data with Multimodal LLMs0
Multimodal Misinformation Detection using Large Vision-Language Models0
Multi-task Retrieval for Knowledge-Intensive Tasks0
Natural Language Deduction through Search over Statement Compositions0
Neural Check-Worthiness Ranking with Weak Supervision: Finding Sentences for Fact-Checking0
Neural Machine Translation for Fact-checking Temporal Claims0
Neural Re-rankers for Evidence Retrieval in the FEVEROUS Task0
New contexts, old heuristics: How young people in India and the US trust online content in the age of generative AI0
News Verifiers Showdown: A Comparative Performance Evaluation of ChatGPT 3.5, ChatGPT 4.0, Bing AI, and Bard in News Fact-Checking0
Numerically Grounded Language Models for Semantic Error Correction0
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens0
Online Continual Knowledge Learning for Language Models0
On Representation Learning for Scientific News Articles Using Heterogeneous Knowledge Graphs0
Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources0
Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports0
Overview of the CLEF-2019 CheckThat!: Automatic Identification and Verification of Claims0
Overview of the CLEF--2021 CheckThat! Lab on Detecting Check-Worthy Claims, Previously Fact-Checked Claims, and Fake News0
Overview of the GermEval 2021 Shared Task on the Identification of Toxic, Engaging, and Fact-Claiming Comments0
PANACEA: An Automated Misinformation Detection System on COVID-190
PASS-FC: Progressive and Adaptive Search Scheme for Fact Checking of Comprehensive Claims0
PerCQA: Persian Community Question Answering Dataset0
PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants0
Poly-FEVER: A Multilingual Fact Verification Benchmark for Hallucination Detection in Large Language Models0
Show:102550
← PrevPage 15 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified