SOTAVerified

Fact Checking

Papers

Showing 301325 of 669 papers

TitleStatusHype
Evaluating the Performance of Large Language Models in Scientific Claim Detection and Classification0
Catching Chameleons: Detecting Evolving Disinformation Generated using Large Language Models0
Evaluating open-source Large Language Models for automated fact-checking0
Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation0
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting0
Can We Spot the "Fake News" Before It Was Even Written?0
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate0
A High Precision Pipeline for Financial Knowledge Graph Construction0
Ev2R: Evaluating Evidence Retrieval in Automated Fact-Checking0
Can LLMs Automate Fact-Checking Article Writing?0
Fact-Checking Generative AI: Ontology-Driven Biological Graphs for Disease-Gene Link Verification0
Entity-based Claim Representation Improves Fact-Checking of Medical Content in Tweets0
Can Knowledge Graph Embeddings Tell Us What Fact-checked Claims Are About?0
Can Community Notes Replace Professional Fact-Checkers?0
Entanglement: Balancing Punishment and Compensation, Repeated Dilemma Game-Theoretic Analysis of Maximum Compensation Problem for Bypass and Least Cost Paths in Fact-Checking, Case of Fake News with Weak Wallace's Law0
Bridging History with AI A Comparative Evaluation of GPT 3.5, GPT4, and GoogleBARD in Predictive Accuracy and Fact Checking0
A Semantics-Aware Approach to Automated Claim Verification0
A Graph-based Verification Framework for Fact-Checking0
Surprising Efficacy of Fine-Tuned Transformers for Fact-Checking over Larger Language Models0
BRENDA: Browser Extension for Fake News Detection0
BREAKING! Presenting Fake News Corpus for Automated Fact Checking0
BOOST: Bootstrapping Strategy-Driven Reasoning Programs for Program-Guided Fact-Checking0
Artificial Intelligence in Deliberation: The AI Penalty and the Emergence of a New Deliberative Divide0
Aggregating Pairwise Semantic Differences for Few-Shot Claim Veracity Classification0
A Context-Aware Approach for Detecting Check-Worthy Claims in Political Debates0
Show:102550
← PrevPage 13 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified