SOTAVerified

Fact Checking

Papers

Showing 301350 of 669 papers

TitleStatusHype
News Verifiers Showdown: A Comparative Performance Evaluation of ChatGPT 3.5, ChatGPT 4.0, Bing AI, and Bard in News Fact-Checking0
RETA-LLM: A Retrieval-Augmented Large Language Model ToolkitCode2
Early Rumor Detection Using Neural Hawkes Process with a New Benchmark DatasetCode1
bgGLUE: A Bulgarian General Language Understanding Evaluation BenchmarkCode0
Check-COVID: Fact-Checking COVID-19 News Claims with Scientific EvidenceCode0
Scientific Fact-Checking: A Survey of Resources and Approaches0
Give Me More Details: Improving Fact-Checking with Latent Retrieval0
Detecting Check-Worthy Claims in Political Debates, Speeches, and Interviews Using Audio DataCode0
OverPrompt: Enhancing ChatGPT through Efficient In-Context LearningCode0
SAIL: Search-Augmented Instruction Learning0
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language ModelsCode1
Knowledge Graphs Querying0
Enhancing Large Language Models Against Inductive Instructions with Dual-critique PromptingCode0
ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media0
AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the WebCode1
Multimodal Automated Fact-Checking: A SurveyCode2
Fact-Checking Complex Claims with Program-Guided ReasoningCode1
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific TablesCode1
Complex Claim Verification with Evidence Retrieved in the WildCode1
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing0
Bridging History with AI A Comparative Evaluation of GPT 3.5, GPT4, and GoogleBARD in Predictive Accuracy and Fact Checking0
aedFaCT: Scientific Fact-Checking Made Easier via Semi-Automatic Discovery of Relevant Expert OpinionsCode0
Automatic Evaluation of Attribution by Large Language ModelsCode1
FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering0
NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-CheckingCode0
Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive TasksCode1
The Intended Uses of Automated Fact-Checking Artefacts: Why, How and WhoCode0
Toxic comments reduce the activity of volunteer editors on Wikipedia0
Using Multiple RDF Knowledge Graphs for Enriching ChatGPT Responses0
An Entity-based Claim Extraction Pipeline for Real-world Biomedical Fact-checking0
Factify 2: A Multimodal Fake News and Satire News DatasetCode1
Interpretable Unified Language CheckingCode1
Accuracy and Political Bias of News Source Credibility Ratings by Large Language ModelsCode1
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language ModelsCode2
Verifying the Robustness of Automatic Credibility AssessmentCode0
Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine MisinformationCode1
WiCE: Real-World Entailment for Claims in WikipediaCode1
PANACEA: An Automated Misinformation Detection System on COVID-190
Implicit Temporal Reasoning for Evidence-Based Fact-CheckingCode0
COVID-VTS: Fact Extraction and Verification on Short Video PlatformsCode1
Reading and Reasoning over Chart Images for Evidence-based Automated Fact-Checking0
Predicting Sentence-Level Factuality of News and Bias of Media OutletsCode1
Logically at Factify 2: A Multi-Modal Fact Checking System Based on Evidence Retrieval techniques and Transformer Encoder Architecture0
The State of Human-centered NLP Technology for Fact-checking0
Human-in-the-loop Evaluation for Early Misinformation Detection: A Case Study of COVID-19 TreatmentsCode0
Check-worthy Claim Detection across Topics for Automated Fact-checking0
A Modality-level Explainable Framework for Misinformation Checking in Social Networks0
CliMedBERT: A Pre-trained Language Model for Climate and Health-related Text0
Autonomation, not Automation: Activities and Needs of Fact-checkers as a Basis for Designing Human-Centered AI Systems0
Did They Really Tweet That? Querying Fact-Checking Sites and Politwoops to Determine Tweet Misattribution0
Show:102550
← PrevPage 7 of 14Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified