SOTAVerified

Fact Checking

Papers

Showing 351375 of 669 papers

TitleStatusHype
Ask To The Point: Open-Domain Entity-Centric Question GenerationCode0
Explaining Interactions Between Text SpansCode0
The Perils & Promises of Fact-checking with Large Language Models0
Optimizing Retrieval-augmented Reader Models via Token EliminationCode0
Large Language Models Help Humans Verify Truthfulness -- Except When They Are Convincingly Wrong0
Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation0
Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking0
AutoHall: Automated Hallucination Dataset Generation for Large Language Models0
Prompt, Condition, and Generate: Classification of Unsupported Claims with In-Context Learning0
Leveraging Social Discourse to Measure Check-worthiness of Claims for Fact-checking0
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across ParagraphsCode0
Learning Source Biases: Multisource Misspecifications and Their Impact on Predictions0
Zero-shot Audio Topic Reranking using Large Language Models0
Analysis of Disinformation and Fake News Detection Using Fine-Tuned Large Language Model0
FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking0
Detecting Out-of-Context Image-Caption Pairs in News: A Counter-Intuitive MethodCode0
Designing and Evaluating Presentation Strategies for Fact-Checked ContentCode0
Position: Key Claims in LLM Research Have a Long Tail of Footnotes0
Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI0
Breaking Language Barriers with MMTweets: Advancing Cross-Lingual Debunked Narrative Retrieval for Fact-Checking0
Fact-Checking Generative AI: Ontology-Driven Biological Graphs for Disease-Gene Link Verification0
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from TextCode0
Fact-Checking of AI-Generated Reports0
MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question AnsweringCode0
ChatGPT vs. Google: A Comparative Study of Search Performance and User Experience0
Show:102550
← PrevPage 15 of 27Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified