| Logical forms complement probability in understanding language model (and human) performance | Feb 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DMWM: Dual-Mind World Model with Long-Term Imagination | Feb 11, 2025 | Logical Reasoning | —Unverified | 0 |
| Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation | Feb 10, 2025 | Logical Reasoning | CodeCode Available | 1 |
| Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation | Feb 10, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| S^2-MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency | Feb 7, 2025 | Logical Reasoning | —Unverified | 0 |
| SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs | Feb 5, 2025 | Knowledge GraphsLogical Reasoning | —Unverified | 0 |
| Automating Mathematical Proof Generation Using Large Language Model Agents and Knowledge Graphs | Feb 4, 2025 | Formal LogicKnowledge Graphs | —Unverified | 0 |
| Standard Neural Computation Alone Is Insufficient for Logical Intelligence | Feb 4, 2025 | Inductive LearningLogical Reasoning | —Unverified | 0 |
| ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning | Feb 3, 2025 | Logical Reasoning | —Unverified | 0 |
| Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability | Jan 30, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Town Hall Debate Prompting: Enhancing Logical Reasoning in LLMs through Multi-Persona Interaction | Jan 28, 2025 | Logical ReasoningMultiple-choice | —Unverified | 0 |
| Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers | Jan 28, 2025 | Logical Reasoning | —Unverified | 0 |
| DBRouting: Routing End User Queries to Databases for Answerability | Jan 27, 2025 | Logical ReasoningSemantic Parsing | —Unverified | 0 |
| SedarEval: Automated Evaluation using Self-Adaptive Rubrics | Jan 26, 2025 | Logical Reasoning | CodeCode Available | 0 |
| A Causality-aware Paradigm for Evaluating Creativity of Multimodal Large Language Models | Jan 25, 2025 | Logical Reasoning | —Unverified | 0 |
| JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning in Large Language Models | Jan 24, 2025 | Logical Reasoning | CodeCode Available | 0 |
| VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning | Jan 24, 2025 | Logical Reasoning | —Unverified | 0 |
| PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation | Jan 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| Assessing the Alignment of FOL Closeness Metrics with Human Judgement | Jan 15, 2025 | Logical ReasoningSensitivity | CodeCode Available | 0 |
| LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process Thinking | Jan 14, 2025 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning | Jan 14, 2025 | Logical ReasoningMulti-hop Question Answering | —Unverified | 0 |
| TimeLogic: A Temporal Logic Benchmark for Video QA | Jan 13, 2025 | 2kAction Segmentation | —Unverified | 0 |
| Neural Probabilistic Circuits: Enabling Compositional and Interpretable Predictions through Logical Reasoning | Jan 13, 2025 | Attributecounterfactual | —Unverified | 0 |
| Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization | Jan 9, 2025 | Information RetrievalLogical Reasoning | —Unverified | 0 |
| Enhancing Transformers for Generalizable First-Order Logical Entailment | Jan 1, 2025 | Logical ReasoningOut-of-Distribution Generalization | —Unverified | 0 |