| Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World | Oct 16, 2023 | Few-Shot LearningForm | CodeCode Available | 1 |
| DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy | Oct 28, 2023 | Logical Reasoning | CodeCode Available | 1 |
| FaiRR: Faithful and Robust Deductive Reasoning over Natural Language | Mar 19, 2022 | Fact SelectionLogical Reasoning | CodeCode Available | 1 |
| Explicit Planning Helps Language Models in Logical Reasoning | Mar 28, 2023 | Logical ReasoningMultiple-choice | CodeCode Available | 1 |
| ExAIS: Executable AI Semantics | Feb 20, 2022 | Logical Reasoningvalid | CodeCode Available | 1 |
| From LSAT: The Progress and Challenges of Complex Reasoning | Aug 2, 2021 | Few-Shot LearningLogical Reasoning | CodeCode Available | 1 |
| Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs | Oct 22, 2020 | Complex Query AnsweringKnowledge Graphs | CodeCode Available | 1 |
| A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners | Jun 16, 2024 | Logical Reasoning | CodeCode Available | 1 |
| Domain Specific Question Answering Over Knowledge Graphs Using Logical Programming and Large Language Models | Mar 3, 2023 | Knowledge GraphsLogical Reasoning | CodeCode Available | 1 |
| BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs | May 18, 2025 | Logical Reasoning | CodeCode Available | 1 |
| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 |
| Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4 | Apr 7, 2023 | Logical ReasoningNatural Language Inference | CodeCode Available | 1 |
| End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking | Feb 11, 2022 | Logical Reasoning | CodeCode Available | 1 |
| Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fact-driven Logical Reasoning for Machine Reading Comprehension | May 21, 2021 | Logical ReasoningMachine Reading Comprehension | CodeCode Available | 1 |
| ElecBench: a Power Dispatch Evaluation Benchmark for Large Language Models | Jul 7, 2024 | FairnessGeneral Knowledge | CodeCode Available | 1 |
| Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs | Feb 18, 2024 | Logical Reasoning | CodeCode Available | 1 |
| Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond | Jun 16, 2023 | BenchmarkingEvidence Selection | CodeCode Available | 1 |
| Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation | Dec 25, 2023 | Knowledge GraphsLogical Reasoning | CodeCode Available | 1 |
| AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading Comprehension | Mar 16, 2022 | Logical ReasoningMachine Reading Comprehension | CodeCode Available | 1 |
| Automatic Curriculum Expert Iteration for Reliable LLM Reasoning | Oct 10, 2024 | HallucinationLogical Reasoning | CodeCode Available | 1 |
| Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? | May 22, 2025 | Logical Reasoning | CodeCode Available | 1 |
| HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models | Sep 6, 2023 | General KnowledgeLogical Reasoning | CodeCode Available | 1 |
| Discriminative Reasoning for Document-level Relation Extraction | Jun 3, 2021 | Document-level Relation ExtractionLogical Reasoning | CodeCode Available | 1 |
| A Neuro-vector-symbolic Architecture for Solving Raven's Progressive Matrices | Mar 9, 2022 | Logical Reasoning | CodeCode Available | 1 |