| OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems | Feb 21, 2024 | Logical Fallacies | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| Robust and Explainable Identification of Logical Fallacies in Natural Language Arguments | Dec 12, 2022 | Data AugmentationLogical Fallacies | CodeCode Available | 1 |
| Logical Fallacy Detection | Feb 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Leveraging Context for Multimodal Fallacy Classification in Political Debates | Jul 21, 2025 | Argument MiningLogical Fallacies | CodeCode Available | 0 |
| Are Large Language Models Good at Detecting Propaganda? | May 19, 2025 | ArticlesLogical Fallacies | —Unverified | 0 |
| SLURG: Investigating the Feasibility of Generating Synthetic Online Fallacious Discourse | Apr 16, 2025 | DiversityLogical Fallacies | —Unverified | 0 |
| Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-based Test Oracles | Apr 9, 2025 | Logical FallaciesLogical Reasoning | CodeCode Available | 0 |
| Large Language Models Are Better Logical Fallacy Reasoners with Counterargument, Explanation, and Goal-Aware Prompt Formulation | Mar 30, 2025 | Logical FallaciesLogical Fallacy Detection | CodeCode Available | 0 |
| RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises | Feb 18, 2025 | Logical Fallacies | CodeCode Available | 0 |