| GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models | Oct 7, 2024 | GSM8KLogical Reasoning | CodeCode Available | 1 |
| HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models | Sep 6, 2023 | General KnowledgeLogical Reasoning | CodeCode Available | 1 |
| DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy | Oct 28, 2023 | Logical Reasoning | CodeCode Available | 1 |
| FOLIO: Natural Language Reasoning with First-Order Logic | Sep 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From LSAT: The Progress and Challenges of Complex Reasoning | Aug 2, 2021 | Few-Shot LearningLogical Reasoning | CodeCode Available | 1 |
| AbductionRules: Training Transformers to Explain Unexpected Inputs | Mar 23, 2022 | Common Sense ReasoningLogical Reasoning | CodeCode Available | 1 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning | Oct 13, 2023 | Data AugmentationLogical Reasoning | CodeCode Available | 1 |
| Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation | Dec 24, 2024 | Graph Question AnsweringHallucination | CodeCode Available | 1 |
| ExAIS: Executable AI Semantics | Feb 20, 2022 | Logical Reasoningvalid | CodeCode Available | 1 |
| ConditionalQA: A Complex Reading Comprehension Dataset with Conditional Answers | Oct 13, 2021 | Logical ReasoningQuestion Answering | CodeCode Available | 1 |
| Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4 | Apr 7, 2023 | Logical ReasoningNatural Language Inference | CodeCode Available | 1 |
| Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Conditional and Modal Reasoning in Large Language Models | Jan 30, 2024 | Logical Reasoning | CodeCode Available | 1 |
| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 |
| Complex Logical Reasoning over Knowledge Graphs using Large Language Models | May 2, 2023 | Knowledge GraphsLogical Reasoning | CodeCode Available | 1 |
| Explicit Planning Helps Language Models in Logical Reasoning | Mar 28, 2023 | Logical ReasoningMultiple-choice | CodeCode Available | 1 |
| Counterfactual reasoning: Do language models need world knowledge for causal understanding? | Dec 6, 2022 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| Fact-driven Logical Reasoning for Machine Reading Comprehension | May 21, 2021 | Logical ReasoningMachine Reading Comprehension | CodeCode Available | 1 |
| ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models | Jul 7, 2024 | FairnessGeneral Knowledge | CodeCode Available | 1 |
| Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning | May 21, 2023 | Abstract Meaning RepresentationContrastive Learning | CodeCode Available | 1 |
| COLLIE: Systematic Construction of Constrained Text Generation Tasks | Jul 17, 2023 | Logical ReasoningSentence | CodeCode Available | 1 |
| Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace | Oct 30, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 1 |
| AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models | Feb 24, 2025 | Logical ReasoningMultiple-choice | CodeCode Available | 1 |
| End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking | Feb 11, 2022 | Logical Reasoning | CodeCode Available | 1 |