| Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs | Jun 13, 2024 | Arithmetic ReasoningFact Verification | CodeCode Available | 2 | 5 |
| PaLM: Scaling Language Modeling with Pathways | Apr 5, 2022 | Auto DebuggingCode Generation | CodeCode Available | 2 | 5 |
| ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning | Mar 19, 2022 | Chart Question AnsweringLogical Reasoning | CodeCode Available | 2 | 5 |
| Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects | May 26, 2025 | Autonomous DrivingLogical Reasoning | CodeCode Available | 2 | 5 |
| Easy Problems That LLMs Get Wrong | May 30, 2024 | Common Sense ReasoningLogical Reasoning | CodeCode Available | 2 | 5 |
| Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation | Feb 26, 2025 | Code GenerationHumanEval | CodeCode Available | 2 | 5 |
| Ontology Embedding: A Survey of Methods, Applications and Resources | Jun 16, 2024 | Logical ReasoningOntology Embedding | CodeCode Available | 2 | 5 |
| MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems | Apr 6, 2024 | Logical ReasoningMath | CodeCode Available | 2 | 5 |
| Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning | May 20, 2023 | Logical Reasoning | CodeCode Available | 2 | 5 |
| LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Jan 14, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |
| Cumulative Reasoning with Large Language Models | Aug 8, 2023 | Decision MakingLogical Reasoning | CodeCode Available | 2 | 5 |
| Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation | Dec 5, 2023 | Logical Reasoning | CodeCode Available | 2 | 5 |
| MedAgent-Pro: Towards Evidence-based Multi-modal Medical Diagnosis via Reasoning Agentic Workflow | Mar 21, 2025 | DiagnosticLogical Reasoning | CodeCode Available | 2 | 5 |
| Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples | Jun 9, 2024 | ARCDiversity | CodeCode Available | 2 | 5 |
| InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners | Apr 19, 2025 | Action GenerationLogical Reasoning | CodeCode Available | 2 | 5 |
| LangBridge: Multilingual Reasoning Without Multilingual Supervision | Jan 19, 2024 | Code CompletionLogical Reasoning | CodeCode Available | 2 | 5 |
| Large Language Models are Zero-Shot Reasoners | May 24, 2022 | Arithmetic ReasoningCommon Sense Reasoning | CodeCode Available | 2 | 5 |
| FlashRNN: Optimizing Traditional RNNs on Modern Hardware | Dec 10, 2024 | GPULogical Reasoning | CodeCode Available | 2 | 5 |
| Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing | Apr 3, 2025 | BenchmarkingLogical Reasoning | CodeCode Available | 2 | 5 |
| Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus | Nov 19, 2024 | Formal LogicLogical Reasoning | CodeCode Available | 2 | 5 |
| LTNtorch: PyTorch Implementation of Logic Tensor Networks | Sep 24, 2024 | Binary ClassificationLogical Reasoning | CodeCode Available | 2 | 5 |
| Evaluating the World Model Implicit in a Generative Model | Jun 6, 2024 | Logical Reasoningmodel | CodeCode Available | 2 | 5 |
| Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving | May 24, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |
| Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review | Oct 4, 2024 | Knowledge DistillationLogical Reasoning | CodeCode Available | 2 | 5 |
| Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 | Mar 31, 2025 | Logical ReasoningMultiple-choice | CodeCode Available | 2 | 5 |