| GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models | Oct 7, 2024 | GSM8KLogical Reasoning | CodeCode Available | 1 |
| Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios | May 26, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| Discriminative Reasoning for Document-level Relation Extraction | Jun 3, 2021 | Document-level Relation ExtractionLogical Reasoning | CodeCode Available | 1 |
| ExAIS: Executable AI Semantics | Feb 20, 2022 | Logical Reasoningvalid | CodeCode Available | 1 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| Explicit Planning Helps Language Models in Logical Reasoning | Mar 28, 2023 | Logical ReasoningMultiple-choice | CodeCode Available | 1 |
| AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension | Mar 16, 2022 | Logical ReasoningMachine Reading Comprehension | CodeCode Available | 1 |
| Large Language Models for Planning: A Comprehensive and Systematic Survey | May 26, 2025 | Logical ReasoningNavigate | CodeCode Available | 1 |
| COLLIE: Systematic Construction of Constrained Text Generation Tasks | Jul 17, 2023 | Logical ReasoningSentence | CodeCode Available | 1 |
| Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4 | Apr 7, 2023 | Logical ReasoningNatural Language Inference | CodeCode Available | 1 |
| Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace | Oct 30, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 1 |
| Learning to Reason via Mixture-of-Thought for Logical Reasoning | May 21, 2025 | Logical ReasoningNatural Language Inference | CodeCode Available | 1 |
| ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression | Dec 4, 2024 | 2kLogical Reasoning | CodeCode Available | 1 |
| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 |
| Logical Message Passing Networks with One-hop Inference on Atomic Formulas | Jan 21, 2023 | Complex Query AnsweringGraph Representation Learning | CodeCode Available | 1 |
| Logical Neural Networks | Jun 23, 2020 | Automated Theorem ProvingLogical Reasoning | CodeCode Available | 1 |
| Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning | Oct 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Do PLMs Know and Understand Ontological Knowledge? | Sep 12, 2023 | Logical ReasoningMemorization | CodeCode Available | 1 |
| Deductive Verification of Chain-of-Thought Reasoning | Jun 6, 2023 | Logical Reasoning | CodeCode Available | 1 |
| Domain Specific Question Answering Over Knowledge Graphs Using Logical Programming and Large Language Models | Mar 3, 2023 | Knowledge GraphsLogical Reasoning | CodeCode Available | 1 |
| A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners | Jun 16, 2024 | Logical Reasoning | CodeCode Available | 1 |
| LogiCoT: Logical Chain-of-Thought Instruction-Tuning | May 20, 2023 | Logical ReasoningText Generation | CodeCode Available | 1 |
| Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs | Oct 22, 2020 | Complex Query AnsweringKnowledge Graphs | CodeCode Available | 1 |
| LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts | Jul 6, 2024 | Logical ReasoningMathematical Reasoning | CodeCode Available | 1 |
| Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |