SOTAVerified

Fact Checking

Papers

Showing 251300 of 669 papers

TitleStatusHype
Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkersCode1
Automated Fact-Checking in Dialogue: Are Specialized Models Needed?0
Fine-tuning Language Models for Factuality0
ChartCheck: Explainable Fact-Checking over Real-World Chart ImagesCode1
Trusted Source Alignment in Large Language Models0
Massive Editing for Large Language Models via Meta LearningCode1
Causal Question Answering with Reinforcement LearningCode0
Detecting Deepfakes Without Seeing AnyCode1
Lost in Translation -- Multilingual Misinformation and its Evolution0
Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social MediaCode1
From Chaos to Clarity: Claim Normalization to Empower Fact-CheckingCode0
Right, No Matter Why: AI Fact-checking and AI Authority in Health-related Inquiry Settings0
Ask To The Point: Open-Domain Entity-Centric Question GenerationCode0
Optimizing Retrieval-augmented Reader Models via Token EliminationCode0
The Perils & Promises of Fact-checking with Large Language Models0
Explaining Interactions Between Text SpansCode0
Large Language Models Help Humans Verify Truthfulness -- Except When They Are Convincingly Wrong0
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style AttacksCode1
Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation0
Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking0
QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-CheckingCode1
AutoHall: Automated Hallucination Dataset Generation for Large Language Models0
Prompt, Condition, and Generate: Classification of Unsupported Claims with In-Context Learning0
Leveraging Social Discourse to Measure Check-worthiness of Claims for Fact-checking0
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across ParagraphsCode0
Learning Source Biases: Multisource Misspecifications and Their Impact on Predictions0
Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation GenerationCode1
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-CheckingCode1
Zero-shot Audio Topic Reranking using Large Language Models0
Analysis of Disinformation and Fake News Detection Using Fine-Tuned Large Language Model0
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical DomainCode4
FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking0
Detecting Out-of-Context Image-Caption Pairs in News: A Counter-Intuitive MethodCode0
Benchmarking the Generation of Fact Checking ExplanationsCode1
Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge NeuronsCode1
Fact-checking information from large language models can decrease headline discernmentCode1
Designing and Evaluating Presentation Strategies for Fact-Checked ContentCode0
Position: Key Claims in LLM Research Have a Long Tail of Footnotes0
Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI0
Breaking Language Barriers with MMTweets: Advancing Cross-Lingual Debunked Narrative Retrieval for Fact-Checking0
Fact-Checking Generative AI: Ontology-Driven Biological Graphs for Disease-Gene Link Verification0
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from TextCode0
Fact-Checking of AI-Generated Reports0
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain ScenariosCode2
MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question AnsweringCode0
Fraunhofer SIT at CheckThat! 2023: Tackling Classification Uncertainty Using Model Souping on the Example of Check-Worthiness Classification0
ChatGPT vs. Google: A Comparative Study of Search Performance and User Experience0
Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to Estimate the Check-Worthiness of Multi-Modal Tweets0
3HAN: A Deep Neural Network for Fake News DetectionCode1
Hallucination is the last thing you need0
Show:102550
← PrevPage 6 of 14Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified