| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 | 5 |
| ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models | Jul 7, 2024 | FairnessGeneral Knowledge | CodeCode Available | 1 | 5 |
| Deductive Verification of Chain-of-Thought Reasoning | Jun 6, 2023 | Logical Reasoning | CodeCode Available | 1 | 5 |
| Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation | Dec 25, 2023 | Knowledge GraphsLogical Reasoning | CodeCode Available | 1 | 5 |
| ExAIS: Executable AI Semantics | Feb 20, 2022 | Logical Reasoningvalid | CodeCode Available | 1 | 5 |
| Discriminative Reasoning for Document-level Relation Extraction | Jun 3, 2021 | Document-level Relation ExtractionLogical Reasoning | CodeCode Available | 1 | 5 |
| FaiRR: Faithful and Robust Deductive Reasoning over Natural Language | Mar 19, 2022 | Fact SelectionLogical Reasoning | CodeCode Available | 1 | 5 |
| Explicit Planning Helps Language Models in Logical Reasoning | Mar 28, 2023 | Logical ReasoningMultiple-choice | CodeCode Available | 1 | 5 |
| Large Language Models are Better Reasoners with Self-Verification | Dec 19, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 1 | 5 |
| Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles | Aug 21, 2023 | Logical Reasoning | CodeCode Available | 1 | 5 |
| LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers | Oct 23, 2023 | Logical Reasoning | CodeCode Available | 1 | 5 |
| Certified Deductive Reasoning with Language Models | Jun 6, 2023 | Logical Reasoningvalid | CodeCode Available | 1 | 5 |
| Chain of Images for Intuitively Reasoning | Nov 9, 2023 | Common Sense ReasoningLanguage Modelling | CodeCode Available | 1 | 5 |
| AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension | Mar 16, 2022 | Logical ReasoningMachine Reading Comprehension | CodeCode Available | 1 | 5 |
| Automatic Curriculum Expert Iteration for Reliable LLM Reasoning | Oct 10, 2024 | HallucinationLogical Reasoning | CodeCode Available | 1 | 5 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 | 5 |
| Counterfactual reasoning: Do language models need world knowledge for causal understanding? | Dec 6, 2022 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 | 5 |
| A Neuro-vector-symbolic Architecture for Solving Raven's Progressive Matrices | Mar 9, 2022 | Logical Reasoning | CodeCode Available | 1 | 5 |
| Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios | May 26, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 | 5 |
| AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models | Feb 24, 2025 | Logical ReasoningMultiple-choice | CodeCode Available | 1 | 5 |
| ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models | Feb 14, 2023 | Decision MakingLesion Segmentation | CodeCode Available | 1 | 5 |
| Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning | May 21, 2023 | Abstract Meaning RepresentationContrastive Learning | CodeCode Available | 1 | 5 |
| CHECKWHY: Causal Fact Verification via Argument Structure | Aug 20, 2024 | Fact VerificationLogical Reasoning | CodeCode Available | 1 | 5 |
| AbductionRules: Training Transformers to Explain Unexpected Inputs | Mar 23, 2022 | Common Sense ReasoningLogical Reasoning | CodeCode Available | 1 | 5 |