| STAR: A Benchmark for Situated Reasoning in Real-World Videos | May 15, 2024 | DiagnosticLogical Reasoning | —Unverified | 0 |
| MetaReflection: Learning Instructions for Language Agents using Past Reflections | May 13, 2024 | Logical ReasoningQuestion Answering | —Unverified | 0 |
| MathDivide: Improved mathematical reasoning by large language models | May 12, 2024 | GSM8KLogical Reasoning | —Unverified | 0 |
| Logical Negation Augmenting and Debiasing for Prompt-based Methods | May 8, 2024 | Logical ReasoningNegation | —Unverified | 0 |
| Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics | May 7, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning | May 2, 2024 | Knowledge GraphsLogical Reasoning | —Unverified | 0 |
| SuperCLUE-Fin: Graded Fine-Grained Analysis of Chinese LLMs on Diverse Financial Tasks and Applications | Apr 29, 2024 | Computational EfficiencyLogical Reasoning | —Unverified | 0 |
| Cantor: Inspiring Multimodal Chain-of-Thought of MLLM | Apr 24, 2024 | Decision MakingLogical Reasoning | —Unverified | 0 |
| Aligning Knowledge Graphs Provided by Humans and Generated from Neural Networks in Specific Tasks | Apr 23, 2024 | Knowledge GraphsLogical Reasoning | CodeCode Available | 0 |
| LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models | Apr 23, 2024 | Logical ReasoningQuestion Answering | CodeCode Available | 1 |
| Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs | Apr 15, 2024 | Bias DetectionLogical Reasoning | —Unverified | 0 |
| MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems | Apr 6, 2024 | Logical ReasoningMath | CodeCode Available | 2 |
| CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues | Apr 4, 2024 | ChatbotInstruction Following | —Unverified | 0 |
| Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding | Apr 4, 2024 | Logical FallaciesLogical Reasoning | —Unverified | 0 |
| I-Design: Personalized LLM Interior Designer | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Guided Interpretable Video Action Reasoning | Apr 2, 2024 | Action RecognitionDecision Making | CodeCode Available | 0 |
| Advancing LLM Reasoning Generalists with Preference Trees | Apr 2, 2024 | BenchmarkingCode Generation | CodeCode Available | 3 |
| Classifying Conspiratorial Narratives At Scale: False Alarms and Erroneous Connections | Mar 29, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Sphere Neural-Networks for Rational Reasoning | Mar 22, 2024 | HallucinationLogical Reasoning | —Unverified | 0 |
| LeanReasoner: Boosting Complex Logical Reasoning with Lean | Mar 20, 2024 | Automated Theorem ProvingLogical Reasoning | CodeCode Available | 1 |
| Natural Language as Policies: Reasoning for Coordinate-Level Embodied Control with LLMs | Mar 20, 2024 | Logical ReasoningPrompt Engineering | —Unverified | 0 |
| Reasoning in Transformers - Mitigating Spurious Correlations and Reasoning Shortcuts | Mar 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations | Mar 12, 2024 | Decision MakingLogical Reasoning | CodeCode Available | 0 |
| Learning Guided Automated Reasoning: A Brief Survey | Mar 6, 2024 | Automated Theorem ProvingLogical Reasoning | —Unverified | 0 |
| Fuzzy Datalog^ over Arbitrary t-Norms | Mar 5, 2024 | Knowledge GraphsLogical Reasoning | —Unverified | 0 |