| Unveiling the Magic of Code Reasoning through Hypothesis Decomposition and Amendment | Feb 17, 2025 | HallucinationLogical Reasoning | CodeCode Available | 2 | 5 |
| PaLM: Scaling Language Modeling with Pathways | Apr 5, 2022 | Auto DebuggingCode Generation | CodeCode Available | 2 | 5 |
| A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity | Feb 8, 2023 | Code GenerationHallucination | CodeCode Available | 1 | 5 |
| Learning to Reason via Mixture-of-Thought for Logical Reasoning | May 21, 2025 | Logical ReasoningNatural Language Inference | CodeCode Available | 1 | 5 |
| LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers | Oct 23, 2023 | Logical Reasoning | CodeCode Available | 1 | 5 |
| LeanReasoner: Boosting Complex Logical Reasoning with Lean | Mar 20, 2024 | Automated Theorem ProvingLogical Reasoning | CodeCode Available | 1 | 5 |
| LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles | Aug 21, 2023 | Logical Reasoning | CodeCode Available | 1 | 5 |
| End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking | Feb 11, 2022 | Logical Reasoning | CodeCode Available | 1 | 5 |
| LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models | Jan 1, 2024 | Code GenerationIn-Context Learning | CodeCode Available | 1 | 5 |
| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 | 5 |
| Learning Deductive Reasoning from Synthetic Corpus based on Formal Logic | Aug 11, 2023 | Formal LogicLogical Reasoning | CodeCode Available | 1 | 5 |
| Improving Large Language Models in Event Relation Logical Prediction | Oct 13, 2023 | counterfactualEvent Relation Extraction | CodeCode Available | 1 | 5 |
| Large Language Models are Better Reasoners with Self-Verification | Dec 19, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 1 | 5 |
| Alice: Proactive Learning with Teacher's Demonstrations for Weak-to-Strong Generalization | Apr 9, 2025 | Logical ReasoningMathematical Reasoning | CodeCode Available | 1 | 5 |
| Large Language Models for Planning: A Comprehensive and Systematic Survey | May 26, 2025 | Logical ReasoningNavigate | CodeCode Available | 1 | 5 |
| AI Descartes: Combining Data and Theory for Derivable Scientific Discovery | Sep 3, 2021 | Automated Theorem ProvingBIG-bench Machine Learning | CodeCode Available | 1 | 5 |
| Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning | Sep 29, 2022 | Logical ReasoningMath | CodeCode Available | 1 | 5 |
| Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples | Nov 22, 2021 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 | 5 |
| Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming | May 5, 2023 | Logical Reasoning | CodeCode Available | 1 | 5 |
| Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace | Oct 30, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 1 | 5 |
| Evolving Scientific Discovery by Unifying Data and Background Knowledge with AI Hilbert | Aug 18, 2023 | Equation DiscoveryLogical Reasoning | CodeCode Available | 1 | 5 |
| ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models | Jul 7, 2024 | FairnessGeneral Knowledge | CodeCode Available | 1 | 5 |
| Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation | Feb 10, 2025 | Logical Reasoning | CodeCode Available | 1 | 5 |
| HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models | Sep 6, 2023 | General KnowledgeLogical Reasoning | CodeCode Available | 1 | 5 |
| Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World | Oct 16, 2023 | Few-Shot LearningForm | CodeCode Available | 1 | 5 |
| Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation | Dec 24, 2024 | Graph Question AnsweringHallucination | CodeCode Available | 1 | 5 |
| GLoRE: Evaluating Logical Reasoning of Large Language Models | Oct 13, 2023 | Logical ReasoningNatural Language Understanding | CodeCode Available | 1 | 5 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 | 5 |
| Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios | May 26, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 | 5 |
| GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models | Oct 7, 2024 | GSM8KLogical Reasoning | CodeCode Available | 1 | 5 |
| A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners | Jun 16, 2024 | Logical Reasoning | CodeCode Available | 1 | 5 |
| Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning | May 21, 2023 | Abstract Meaning RepresentationContrastive Learning | CodeCode Available | 1 | 5 |
| Domain Specific Question Answering Over Knowledge Graphs Using Logical Programming and Large Language Models | Mar 3, 2023 | Knowledge GraphsLogical Reasoning | CodeCode Available | 1 | 5 |
| BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs | May 18, 2025 | Logical Reasoning | CodeCode Available | 1 | 5 |
| FOLIO: Natural Language Reasoning with First-Order Logic | Sep 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs | Oct 22, 2020 | Complex Query AnsweringKnowledge Graphs | CodeCode Available | 1 | 5 |
| DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy | Oct 28, 2023 | Logical Reasoning | CodeCode Available | 1 | 5 |
| Discriminative Reasoning for Document-level Relation Extraction | Jun 3, 2021 | Document-level Relation ExtractionLogical Reasoning | CodeCode Available | 1 | 5 |
| Counterfactual reasoning: Do language models need world knowledge for causal understanding? | Dec 6, 2022 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 | 5 |
| DAGN: Discourse-Aware Graph Network for Logical Reasoning | Mar 26, 2021 | Logical ReasoningSentence | CodeCode Available | 1 | 5 |
| ConditionalQA: A Complex Reading Comprehension Dataset with Conditional Answers | Oct 13, 2021 | Logical ReasoningQuestion Answering | CodeCode Available | 1 | 5 |
| Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation | Dec 25, 2023 | Knowledge GraphsLogical Reasoning | CodeCode Available | 1 | 5 |
| Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond | Jun 16, 2023 | BenchmarkingEvidence Selection | CodeCode Available | 1 | 5 |
| Deductive Verification of Chain-of-Thought Reasoning | Jun 6, 2023 | Logical Reasoning | CodeCode Available | 1 | 5 |
| From LSAT: The Progress and Challenges of Complex Reasoning | Aug 2, 2021 | Few-Shot LearningLogical Reasoning | CodeCode Available | 1 | 5 |
| Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding | Jul 11, 2024 | EEGLanguage Modeling | CodeCode Available | 1 | 5 |
| Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning | Oct 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension | Mar 16, 2022 | Logical ReasoningMachine Reading Comprehension | CodeCode Available | 1 | 5 |
| Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? | May 22, 2025 | Logical Reasoning | CodeCode Available | 1 | 5 |
| Automatic Curriculum Expert Iteration for Reliable LLM Reasoning | Oct 10, 2024 | HallucinationLogical Reasoning | CodeCode Available | 1 | 5 |