| FEVO: Financial Knowledge Expansion and Reasoning Evolution for Large Language Models | Jul 8, 2025 | Logical ReasoningReinforcement Learning (RL) | —Unverified | 0 |
| MiCo: Multi-image Contrast for Reinforcement Visual Reasoning | Jun 27, 2025 | Logical ReasoningRepresentation Learning | —Unverified | 0 |
| Discrete JEPA: Learning Discrete Token Representations without Reconstruction | Jun 17, 2025 | Logical Reasoning | —Unverified | 0 |
| CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making | Jun 15, 2025 | Answer GenerationDecision Making | —Unverified | 0 |
| SoundMind: RL-Incentivized Logic Reasoning for Audio-Language Models | Jun 15, 2025 | Logical ReasoningReinforcement Learning (RL) | CodeCode Available | 5 |
| Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving | Jun 12, 2025 | Logical ReasoningMathematical Problem-Solving | —Unverified | 0 |
| TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games | Jun 11, 2025 | Logical ReasoningMath | —Unverified | 0 |
| EviNet: Evidential Reasoning Network for Resilient Graph Learning in the Open and Noisy Environments | Jun 8, 2025 | Graph LearningLogical Reasoning | CodeCode Available | 0 |
| Are LLMs Reliable Translators of Logical Reasoning Across Lexically Diversified Contexts? | Jun 5, 2025 | Formal LogicIn-Context Learning | CodeCode Available | 0 |
| Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study | Jun 5, 2025 | Logical Reasoning | CodeCode Available | 0 |
| Towards Geometry Problem Solving in the Large Model Era: A Survey | Jun 3, 2025 | Geometry Problem SolvingLogical Reasoning | —Unverified | 0 |
| VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL | May 29, 2025 | Arithmetic ReasoningImage Generation | —Unverified | 0 |
| Continuous Chain of Thought Enables Parallel Exploration and Reasoning | May 29, 2025 | Logical Reasoning | —Unverified | 0 |
| Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models | May 29, 2025 | Logical ReasoningMath | —Unverified | 0 |
| Climate Finance Bench | May 28, 2025 | Logical ReasoningQuantization | CodeCode Available | 0 |
| MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs | May 27, 2025 | Logical ReasoningMME | —Unverified | 0 |
| A Structured Unplugged Approach for Foundational AI Literacy in Primary Education | May 27, 2025 | Logical ReasoningMisconceptions | CodeCode Available | 0 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| SV-TrustEval-C: Evaluating Structure and Semantic Reasoning in Large Language Models for Source Code Vulnerability Analysis | May 27, 2025 | Logical ReasoningVulnerability Detection | CodeCode Available | 0 |
| Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects | May 26, 2025 | Autonomous DrivingLogical Reasoning | CodeCode Available | 2 |
| Large Language Models for Planning: A Comprehensive and Systematic Survey | May 26, 2025 | Logical ReasoningNavigate | CodeCode Available | 1 |
| Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles | May 26, 2025 | ARCLogical Reasoning | —Unverified | 0 |
| SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond | May 26, 2025 | Logical ReasoningReinforcement Learning (RL) | CodeCode Available | 2 |
| Interleaved Reasoning for Large Language Models via Reinforcement Learning | May 26, 2025 | Logical ReasoningMath | —Unverified | 0 |
| CP-Router: An Uncertainty-Aware Router Between LLM and LRM | May 26, 2025 | Conformal PredictionLogical Reasoning | —Unverified | 0 |
| Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers | May 26, 2025 | Logical ReasoningMathematical Problem-Solving | CodeCode Available | 0 |
| ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding | May 25, 2025 | Chart UnderstandingLogical Reasoning | CodeCode Available | 0 |
| MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning | May 23, 2025 | Logical Reasoning | —Unverified | 0 |
| Towards Competent AI for Fundamental Analysis in Finance: A Benchmark Dataset and Evaluation | May 22, 2025 | Financial AnalysisLogical Reasoning | —Unverified | 0 |
| Reasoning in Neurosymbolic AI | May 22, 2025 | FairnessLogical Reasoning | —Unverified | 0 |
| Sudoku-Bench: Evaluating creative reasoning with Sudoku variants | May 22, 2025 | DiversityLogical Reasoning | CodeCode Available | 0 |
| Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? | May 22, 2025 | Logical Reasoning | CodeCode Available | 1 |
| NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning | May 21, 2025 | General Reinforcement LearningLogical Reasoning | CodeCode Available | 1 |
| Learning to Reason via Mixture-of-Thought for Logical Reasoning | May 21, 2025 | Logical ReasoningNatural Language Inference | CodeCode Available | 1 |
| Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning | May 20, 2025 | Logical ReasoningMathematical Reasoning | —Unverified | 0 |
| SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas | May 20, 2025 | BenchmarkingLogical Reasoning | —Unverified | 0 |
| Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues? | May 19, 2025 | Logical ReasoningOptical Character Recognition | CodeCode Available | 1 |
| BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs | May 18, 2025 | Logical Reasoning | CodeCode Available | 1 |
| Curriculum Abductive Learning | May 18, 2025 | Logical Reasoning | —Unverified | 0 |
| LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images? | May 18, 2025 | Logical ReasoningMultimodal Reasoning | CodeCode Available | 1 |
| System Prompt Poisoning: Persistent Attacks on Large Language Models Beyond User Injection | May 10, 2025 | Logical ReasoningRAG | —Unverified | 0 |
| Learning Symbolic Persistent Macro-Actions for POMDP Solving Over Time | May 6, 2025 | Computational EfficiencyDecision Making | —Unverified | 0 |
| HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking | May 5, 2025 | Logical Reasoning | —Unverified | 0 |
| Reasoning Capabilities and Invariability of Large Language Models | May 1, 2025 | Logical Reasoning | CodeCode Available | 0 |
| A Report on the llms evaluating the high school questions | Apr 30, 2025 | Logical Reasoning | —Unverified | 0 |
| LR-IAD:Mask-Free Industrial Anomaly Detection with Logical Reasoning | Apr 28, 2025 | Anomaly DetectionLogical Reasoning | CodeCode Available | 0 |
| POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications | Apr 21, 2025 | HallucinationLogical Reasoning | —Unverified | 0 |
| CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs | Apr 21, 2025 | Claim VerificationLogical Reasoning | CodeCode Available | 0 |
| InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners | Apr 19, 2025 | Action GenerationLogical Reasoning | CodeCode Available | 2 |