SOTAVerified

Fact Checking

Papers

Showing 376400 of 669 papers

TitleStatusHype
Fraunhofer SIT at CheckThat! 2023: Tackling Classification Uncertainty Using Model Souping on the Example of Check-Worthiness Classification0
Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to Estimate the Check-Worthiness of Multi-Modal Tweets0
Hallucination is the last thing you need0
News Verifiers Showdown: A Comparative Performance Evaluation of ChatGPT 3.5, ChatGPT 4.0, Bing AI, and Bard in News Fact-Checking0
bgGLUE: A Bulgarian General Language Understanding Evaluation BenchmarkCode0
Check-COVID: Fact-Checking COVID-19 News Claims with Scientific EvidenceCode0
Scientific Fact-Checking: A Survey of Resources and Approaches0
Give Me More Details: Improving Fact-Checking with Latent Retrieval0
OverPrompt: Enhancing ChatGPT through Efficient In-Context LearningCode0
Detecting Check-Worthy Claims in Political Debates, Speeches, and Interviews Using Audio DataCode0
SAIL: Search-Augmented Instruction Learning0
ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media0
Knowledge Graphs Querying0
Enhancing Large Language Models Against Inductive Instructions with Dual-critique PromptingCode0
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing0
Bridging History with AI A Comparative Evaluation of GPT 3.5, GPT4, and GoogleBARD in Predictive Accuracy and Fact Checking0
aedFaCT: Scientific Fact-Checking Made Easier via Semi-Automatic Discovery of Relevant Expert OpinionsCode0
FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering0
NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-CheckingCode0
The Intended Uses of Automated Fact-Checking Artefacts: Why, How and WhoCode0
Toxic comments reduce the activity of volunteer editors on Wikipedia0
Using Multiple RDF Knowledge Graphs for Enriching ChatGPT Responses0
An Entity-based Claim Extraction Pipeline for Real-world Biomedical Fact-checking0
Verifying the Robustness of Automatic Credibility AssessmentCode0
PANACEA: An Automated Misinformation Detection System on COVID-190
Show:102550
← PrevPage 16 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified