| FEVO: Financial Knowledge Expansion and Reasoning Evolution for Large Language Models | Jul 8, 2025 | Logical ReasoningReinforcement Learning (RL) | —Unverified | 0 |
| MiCo: Multi-image Contrast for Reinforcement Visual Reasoning | Jun 27, 2025 | Logical ReasoningRepresentation Learning | —Unverified | 0 |
| Discrete JEPA: Learning Discrete Token Representations without Reconstruction | Jun 17, 2025 | Logical Reasoning | —Unverified | 0 |
| CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making | Jun 15, 2025 | Answer GenerationDecision Making | —Unverified | 0 |
| SoundMind: RL-Incentivized Logic Reasoning for Audio-Language Models | Jun 15, 2025 | Logical ReasoningReinforcement Learning (RL) | CodeCode Available | 5 |
| Motion-R1: Chain-of-Thought Reasoning and Reinforcement Learning for Human Motion Generation | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving | Jun 12, 2025 | Logical ReasoningMathematical Problem-Solving | —Unverified | 0 |
| TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games | Jun 11, 2025 | Logical ReasoningMath | —Unverified | 0 |
| EviNet: Evidential Reasoning Network for Resilient Graph Learning in the Open and Noisy Environments | Jun 8, 2025 | Graph LearningLogical Reasoning | CodeCode Available | 0 |
| Are LLMs Reliable Translators of Logical Reasoning Across Lexically Diversified Contexts? | Jun 5, 2025 | Formal LogicIn-Context Learning | CodeCode Available | 0 |
| Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study | Jun 5, 2025 | Logical Reasoning | CodeCode Available | 0 |
| Towards Geometry Problem Solving in the Large Model Era: A Survey | Jun 3, 2025 | Geometry Problem SolvingLogical Reasoning | —Unverified | 0 |
| VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL | May 29, 2025 | Arithmetic ReasoningImage Generation | —Unverified | 0 |
| Continuous Chain of Thought Enables Parallel Exploration and Reasoning | May 29, 2025 | Logical Reasoning | —Unverified | 0 |
| Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models | May 29, 2025 | Logical ReasoningMath | —Unverified | 0 |
| Climate Finance Bench | May 28, 2025 | Logical ReasoningQuantization | CodeCode Available | 0 |
| MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs | May 27, 2025 | Logical ReasoningMME | —Unverified | 0 |
| A Structured Unplugged Approach for Foundational AI Literacy in Primary Education | May 27, 2025 | Logical ReasoningMisconceptions | CodeCode Available | 0 |
| SV-TrustEval-C: Evaluating Structure and Semantic Reasoning in Large Language Models for Source Code Vulnerability Analysis | May 27, 2025 | Logical ReasoningVulnerability Detection | CodeCode Available | 0 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects | May 26, 2025 | Autonomous DrivingLogical Reasoning | CodeCode Available | 2 |
| Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles | May 26, 2025 | ARCLogical Reasoning | —Unverified | 0 |
| Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers | May 26, 2025 | Logical ReasoningMathematical Problem-Solving | CodeCode Available | 0 |
| CP-Router: An Uncertainty-Aware Router Between LLM and LRM | May 26, 2025 | Conformal PredictionLogical Reasoning | —Unverified | 0 |
| Interleaved Reasoning for Large Language Models via Reinforcement Learning | May 26, 2025 | Logical ReasoningMath | —Unverified | 0 |