| On the Potential of CLIP for Compositional Logical Reasoning | Aug 30, 2023 | Logical ReasoningVisual Reasoning | —Unverified | 0 |
| OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving? | Nov 9, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation | Feb 27, 2025 | Data AugmentationLogical Reasoning | —Unverified | 0 |
| P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for data pruning in LLM Training | Aug 10, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| Pathformer: Recursive Path Query Encoding for Complex Logical Query Answering | Jun 21, 2024 | Knowledge GraphsLogical Reasoning | —Unverified | 0 |
| PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering | May 29, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains | Oct 11, 2024 | Large Language ModelLogical Reasoning | —Unverified | 0 |
| Physics of Language Models: Part 3.2, Knowledge Manipulation | Sep 25, 2023 | AttributeLanguage Modelling | —Unverified | 0 |
| PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs | Feb 12, 2024 | Instruction FollowingLogical Reasoning | —Unverified | 0 |
| POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications | Apr 21, 2025 | HallucinationLogical Reasoning | —Unverified | 0 |
| Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment | Jun 17, 2024 | Logical ReasoningMath | —Unverified | 0 |
| ProSLM : A Prolog Synergized Language Model for explainable Domain Specific Knowledge Based Question Answering | Sep 17, 2024 | Formal LogicLanguage Modeling | —Unverified | 0 |
| Provable Failure of Language Models in Learning Majority Boolean Logic via Gradient Descent | Apr 7, 2025 | Logical Reasoning | —Unverified | 0 |
| Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling | Mar 5, 2025 | In-Context LearningLogical Reasoning | —Unverified | 0 |
| PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving | Apr 15, 2025 | Logical ReasoningVisual Question Answering (VQA) | —Unverified | 0 |
| Puzzle Solving using Reasoning of Large Language Models: A Survey | Feb 17, 2024 | Logical ReasoningSurvey | —Unverified | 0 |
| Quantifying Adaptability in Pre-trained Language Models with 500 Tasks | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Quantifying Logical Consistency in Transformers via Query-Key Alignment | Feb 24, 2025 | Logical Reasoningvalid | —Unverified | 0 |
| Teaching Pretrained Models with Commonsense Reasoning: A Preliminary KB-Based Approach | Sep 20, 2019 | Few-Shot LearningLogical Reasoning | —Unverified | 0 |
| Quantum Structure in Cognition and the Foundations of Human Reasoning | Dec 30, 2014 | Decision MakingLogical Reasoning | —Unverified | 0 |
| Quantum Structure of Negation and Conjunction in Human Thought | Mar 14, 2015 | Logical ReasoningNegation | —Unverified | 0 |
| Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding | Apr 4, 2024 | Logical FallaciesLogical Reasoning | —Unverified | 0 |
| Reasoning Algorithmically in Graph Neural Networks | Feb 21, 2024 | Combinatorial OptimizationEdge Classification | —Unverified | 0 |
| Reasoning-Aware Query-Focused Summarization over Multi-Table Data | Dec 12, 2024 | Logical ReasoningQuery-focused Summarization | —Unverified | 0 |
| Reasoning in Neurosymbolic AI | May 22, 2025 | FairnessLogical Reasoning | —Unverified | 0 |
| Reasoning in Transformers - Mitigating Spurious Correlations and Reasoning Shortcuts | Mar 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reasoning in Vector Space: An Exploratory Study of Question Answering | Nov 19, 2015 | Common Sense ReasoningLogical Reasoning | —Unverified | 0 |
| Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation | Mar 12, 2025 | Allcounterfactual | —Unverified | 0 |
| Reasoning Like Program Executors | Nov 16, 2021 | Logical ReasoningMath | —Unverified | 0 |
| Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification | Apr 7, 2025 | Logical ReasoningMath | —Unverified | 0 |
| Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs | Oct 26, 2024 | DiagnosticLogical Reasoning | —Unverified | 0 |
| Reasoning over Logically Interacted Conditions for Question Answering | May 25, 2022 | Logical ReasoningQuestion Answering | —Unverified | 0 |
| Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning | Jan 14, 2025 | Logical ReasoningMulti-hop Question Answering | —Unverified | 0 |
| Reasoning with Multi-Structure Commonsense Knowledge in Visual Dialog | Apr 10, 2022 | Logical ReasoningSentence | —Unverified | 0 |
| Reduced Implication-bias Logic Loss for Neuro-Symbolic Learning | Aug 14, 2022 | Logical Reasoning | —Unverified | 0 |
| Retrieval-Augmented Neural Response Generation Using Logical Reasoning and Relevance Scoring | Oct 20, 2023 | Logical ReasoningResponse Generation | —Unverified | 0 |
| Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions | Feb 25, 2025 | Inductive BiasLogical Reasoning | —Unverified | 0 |
| Reverse Thinking Makes LLMs Stronger Reasoners | Nov 29, 2024 | Data AugmentationKnowledge Distillation | —Unverified | 0 |
| RLSF: Reinforcement Learning via Symbolic Feedback | May 26, 2024 | Logical ReasoningNatural Language Understanding | —Unverified | 0 |
| Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning | Mar 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| S^2-MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency | Feb 7, 2025 | Logical Reasoning | —Unverified | 0 |
| SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas | May 20, 2025 | BenchmarkingLogical Reasoning | —Unverified | 0 |
| Scales and Hedges in a Logic with Analogous Semantics | Jan 21, 2022 | Decision MakingLogical Reasoning | —Unverified | 0 |
| Scallop: A Language for Neurosymbolic Programming | Apr 10, 2023 | Logical ReasoningNegation | —Unverified | 0 |
| Scallop: From Probabilistic Deductive Databases to Scalable Differentiable Reasoning | Dec 1, 2021 | Logical ReasoningQuestion Answering | —Unverified | 0 |
| SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity | Dec 30, 2024 | BenchmarkingCode Generation | —Unverified | 0 |
| Selection-Inference: Exploiting Large Language Models for Interpretable Logical Reasoning | May 19, 2022 | Logical Reasoning | —Unverified | 0 |
| Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs | May 19, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 |
| SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment | Nov 27, 2024 | ClassificationDecision Making | —Unverified | 0 |
| ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning | Mar 26, 2025 | Logical Reasoning | —Unverified | 0 |