SOTAVerified

Fact Checking

Papers

Showing 351400 of 669 papers

TitleStatusHype
Ask To The Point: Open-Domain Entity-Centric Question GenerationCode0
Explaining Interactions Between Text SpansCode0
The Perils & Promises of Fact-checking with Large Language Models0
Optimizing Retrieval-augmented Reader Models via Token EliminationCode0
Large Language Models Help Humans Verify Truthfulness -- Except When They Are Convincingly Wrong0
Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation0
Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking0
AutoHall: Automated Hallucination Dataset Generation for Large Language Models0
Prompt, Condition, and Generate: Classification of Unsupported Claims with In-Context Learning0
Leveraging Social Discourse to Measure Check-worthiness of Claims for Fact-checking0
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across ParagraphsCode0
Learning Source Biases: Multisource Misspecifications and Their Impact on Predictions0
Zero-shot Audio Topic Reranking using Large Language Models0
Analysis of Disinformation and Fake News Detection Using Fine-Tuned Large Language Model0
FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking0
Detecting Out-of-Context Image-Caption Pairs in News: A Counter-Intuitive MethodCode0
Designing and Evaluating Presentation Strategies for Fact-Checked ContentCode0
Position: Key Claims in LLM Research Have a Long Tail of Footnotes0
Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI0
Breaking Language Barriers with MMTweets: Advancing Cross-Lingual Debunked Narrative Retrieval for Fact-Checking0
Fact-Checking Generative AI: Ontology-Driven Biological Graphs for Disease-Gene Link Verification0
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from TextCode0
Fact-Checking of AI-Generated Reports0
MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question AnsweringCode0
ChatGPT vs. Google: A Comparative Study of Search Performance and User Experience0
Fraunhofer SIT at CheckThat! 2023: Tackling Classification Uncertainty Using Model Souping on the Example of Check-Worthiness Classification0
Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to Estimate the Check-Worthiness of Multi-Modal Tweets0
Hallucination is the last thing you need0
News Verifiers Showdown: A Comparative Performance Evaluation of ChatGPT 3.5, ChatGPT 4.0, Bing AI, and Bard in News Fact-Checking0
bgGLUE: A Bulgarian General Language Understanding Evaluation BenchmarkCode0
Check-COVID: Fact-Checking COVID-19 News Claims with Scientific EvidenceCode0
Scientific Fact-Checking: A Survey of Resources and Approaches0
Give Me More Details: Improving Fact-Checking with Latent Retrieval0
OverPrompt: Enhancing ChatGPT through Efficient In-Context LearningCode0
Detecting Check-Worthy Claims in Political Debates, Speeches, and Interviews Using Audio DataCode0
SAIL: Search-Augmented Instruction Learning0
ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media0
Knowledge Graphs Querying0
Enhancing Large Language Models Against Inductive Instructions with Dual-critique PromptingCode0
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing0
Bridging History with AI A Comparative Evaluation of GPT 3.5, GPT4, and GoogleBARD in Predictive Accuracy and Fact Checking0
aedFaCT: Scientific Fact-Checking Made Easier via Semi-Automatic Discovery of Relevant Expert OpinionsCode0
FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering0
NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-CheckingCode0
The Intended Uses of Automated Fact-Checking Artefacts: Why, How and WhoCode0
Toxic comments reduce the activity of volunteer editors on Wikipedia0
Using Multiple RDF Knowledge Graphs for Enriching ChatGPT Responses0
An Entity-based Claim Extraction Pipeline for Real-world Biomedical Fact-checking0
Verifying the Robustness of Automatic Credibility AssessmentCode0
PANACEA: An Automated Misinformation Detection System on COVID-190
Show:102550
← PrevPage 8 of 14Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.78Unverified
2SGPT-BE-5.8BnDCG@100.75Unverified
3BM25+CEnDCG@100.69Unverified
4SGPT-CE-6.1BnDCG@100.68Unverified
5ColBERTnDCG@100.67Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BnDCG@100.31Unverified
2monoT5-3BnDCG@100.28Unverified
3BM25+CEnDCG@100.25Unverified
4SGPT-CE-6.1BnDCG@100.16Unverified
#ModelMetricClaimedVerifiedStatus
1monoT5-3BnDCG@100.85Unverified
2BM25+CEnDCG@100.82Unverified
3SGPT-BE-5.8BnDCG@100.78Unverified
4SGPT-CE-6.1BnDCG@100.73Unverified
#ModelMetricClaimedVerifiedStatus
1HerOQuestion Only score0.48Unverified
2CTU AICQuestion Only score0.46Unverified
3InFactQuestion Only score0.45Unverified
#ModelMetricClaimedVerifiedStatus
1Abc0..5sec2Unverified
#ModelMetricClaimedVerifiedStatus
1MA-CINPrecision0.26Unverified
#ModelMetricClaimedVerifiedStatus
1FDHNAccuracy (Test)0.7Unverified