SOTAVerified

Misinformation

Papers

Showing 171180 of 1282 papers

TitleStatusHype
GenAI vs. Human Fact-Checkers: Accurate Ratings, Flawed Rationales0
Local Differences, Global Lessons: Insights from Organisation Policies for International Legislation0
Can Community Notes Replace Professional Fact-Checkers?0
A Baseline Method for Removing Invisible Image Watermarks using Deep Image Prior0
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the WildCode0
LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic DataCode0
Implicit Repair with Reinforcement Learning in Emergent Communication0
HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic ClaimsCode1
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities0
Competing LLM Agents in a Non-Cooperative Game of Opinion Polarisation0
Show:102550
← PrevPage 18 of 129Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TOKOFOUAverage F189.7Unverified