SOTAVerified

Misinformation

Papers

Showing 176200 of 1282 papers

TitleStatusHype
Robustness of Misinformation Classification Systems to Adversarial Examples Through BeamAttackCode0
Persona-Assigned Large Language Models Exhibit Human-Like Motivated ReasoningCode0
Social Media Can Reduce Misinformation When Public Scrutiny is High0
Veracity: An Open-Source AI Fact-Checking System0
PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning0
RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-CheckingCode0
Step-by-Step Reasoning Attack: Revealing 'Erased' Knowledge in Large Language Models0
Dataset of News Articles with Provenance Metadata for Media Relevance Assessment0
Can LLMs Ground when they (Don't) Know: A Study on Direct and Loaded Political Questions0
In Crowd Veritas: Leveraging Human Intelligence To Fight Misinformation0
Evaluation empirique de la sécurisation et de l'alignement de ChatGPT et Gemini: analyse comparative des vulnérabilités par expérimentations de jailbreaks0
Societal AI Research Has Become Less Interdisciplinary0
Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework0
DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection0
Intentionally Unintentional: GenAI Exceptionalism and the First Amendment0
SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media PlatformsCode0
Combating Misinformation in the Arab World: Challenges & Opportunities0
When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models0
SUCEA: Reasoning-Intensive Retrieval for Adversarial Fact-checking through Claim Decomposition and EditingCode0
Is Perturbation-Based Image Protection Disruptive to Image Editing?0
Facts are Harder Than Opinions -- A Multilingual, Comparative Analysis of LLM-Based Fact-Checking Reliability0
Weak Supervision for Real World Graphs0
Goal-Aware Identification and Rectification of Misinformation in Multi-Agent SystemsCode0
XMAD-Bench: Cross-Domain Multilingual Audio Deepfake BenchmarkCode0
Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation0
Show:102550
← PrevPage 8 of 52Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TOKOFOUAverage F189.7Unverified