SOTAVerified

Fact Checking

Papers

Showing 276300 of 669 papers

TitleStatusHype
Learning Source Biases: Multisource Misspecifications and Their Impact on Predictions0
Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation GenerationCode1
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-CheckingCode1
Zero-shot Audio Topic Reranking using Large Language Models0
Analysis of Disinformation and Fake News Detection Using Fine-Tuned Large Language Model0
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical DomainCode4
FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking0
Detecting Out-of-Context Image-Caption Pairs in News: A Counter-Intuitive MethodCode0
Benchmarking the Generation of Fact Checking ExplanationsCode1
Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge NeuronsCode1
Fact-checking information from large language models can decrease headline discernmentCode1
Designing and Evaluating Presentation Strategies for Fact-Checked ContentCode0
Position: Key Claims in LLM Research Have a Long Tail of Footnotes0
Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI0
Breaking Language Barriers with MMTweets: Advancing Cross-Lingual Debunked Narrative Retrieval for Fact-Checking0
Fact-Checking Generative AI: Ontology-Driven Biological Graphs for Disease-Gene Link Verification0
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from TextCode0
Fact-Checking of AI-Generated Reports0
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain ScenariosCode2
MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question AnsweringCode0
Fraunhofer SIT at CheckThat! 2023: Tackling Classification Uncertainty Using Model Souping on the Example of Check-Worthiness Classification0
ChatGPT vs. Google: A Comparative Study of Search Performance and User Experience0
Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to Estimate the Check-Worthiness of Multi-Modal Tweets0
3HAN: A Deep Neural Network for Fake News DetectionCode1
Hallucination is the last thing you need0
Show:102550
← PrevPage 12 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified