| Why should we ever automate moral decision making? | Jul 10, 2024 | Decision MakingEthics | —Unverified | 0 |
| Analyzing Large language models chatbots: An experimental approach using a probability test | Jul 10, 2024 | ChatbotLogical Reasoning | —Unverified | 0 |
| Are Large Language Models Strategic Decision Makers? A Study of Performance and Bias in Two-Player Non-Zero-Sum Games | Jul 5, 2024 | Logical Reasoning | —Unverified | 0 |
| Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring | Jul 4, 2024 | Logical Reasoning | —Unverified | 0 |
| FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts | Jun 27, 2024 | Decision MakingLogical Reasoning | —Unverified | 0 |
| Categorical Syllogisms Revisited: A Review of the Logical Reasoning Abilities of LLMs for Analyzing Categorical Syllogism | Jun 26, 2024 | Logical Reasoning | —Unverified | 0 |
| LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic | Jun 25, 2024 | ARCLogical Reasoning | —Unverified | 0 |
| Large Language Models Are Cross-Lingual Knowledge-Free Reasoners | Jun 24, 2024 | Cross-Lingual TransferLogical Reasoning | CodeCode Available | 0 |
| Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models | Jun 24, 2024 | Logical ReasoningNatural Language Understanding | CodeCode Available | 0 |
| Imperative Learning: A Self-supervised Neuro-Symbolic Learning Framework for Robot Autonomy | Jun 23, 2024 | Bilevel OptimizationImitation Learning | —Unverified | 0 |
| Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference | Jun 21, 2024 | Logical Reasoning | —Unverified | 0 |
| Pathformer: Recursive Path Query Encoding for Complex Logical Query Answering | Jun 21, 2024 | Knowledge GraphsLogical Reasoning | —Unverified | 0 |
| The neural correlates of logical-mathematical symbol systems processing resemble that of spatial cognition more than natural language processing | Jun 20, 2024 | Logical Reasoning | —Unverified | 0 |
| Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models | Jun 18, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment | Jun 17, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars | Jun 16, 2024 | Automated Theorem ProvingLogical Reasoning | CodeCode Available | 0 |
| City-LEO: Toward Transparent City Management Using LLM with End-to-End Optimization | Jun 16, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Evaluating ChatGPT-4 Vision on Brazil's National Undergraduate Computer Science Exam | Jun 14, 2024 | FairnessLogical Reasoning | CodeCode Available | 0 |
| Dual Thinking and Logical Processing -- Are Multi-modal Large Language Models Closing the Gap with Human Vision ? | Jun 11, 2024 | Autonomous DrivingDeep Learning | CodeCode Available | 0 |
| Large Language Models are Limited in Out-of-Context Knowledge Reasoning | Jun 11, 2024 | AttributeLogical Reasoning | CodeCode Available | 0 |
| Improving Multi-hop Logical Reasoning in Knowledge Graphs with Context-Aware Query Representation Learning | Jun 11, 2024 | Knowledge GraphsLogical Reasoning | CodeCode Available | 0 |
| LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low-Resource and Extinct Languages | Jun 10, 2024 | Logical Reasoning | CodeCode Available | 0 |
| LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning | Jun 9, 2024 | Code GenerationHierarchical Reinforcement Learning | —Unverified | 0 |
| Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation | Jun 8, 2024 | Abstractive Text SummarizationDialogue Generation | —Unverified | 0 |
| On the Hardness of Probabilistic Neurosymbolic Learning | Jun 6, 2024 | Logical Reasoning | CodeCode Available | 0 |
| How Truncating Weights Improves Reasoning in Language Models | Jun 5, 2024 | Logical Reasoning | —Unverified | 0 |
| Bi-Chainer: Automated Large Language Models Reasoning with Bidirectional Chaining | Jun 5, 2024 | Logical Reasoning | —Unverified | 0 |
| Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks | Jun 4, 2024 | Code GenerationLogical Reasoning | —Unverified | 0 |
| Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities | Jun 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Synergistic Approach In Network Intrusion Detection By Neurosymbolic AI | Jun 3, 2024 | Intrusion DetectionLogical Reasoning | —Unverified | 0 |
| Logical Reasoning with Relation Network for Inductive Knowledge Graph Completion | Jun 3, 2024 | Inductive knowledge graph completionKnowledge Graph Completion | —Unverified | 0 |
| Brainstorming Brings Power to Large Language Models of Knowledge Reasoning | Jun 2, 2024 | Logical ReasoningReading Comprehension | —Unverified | 0 |
| A Closer Look at Logical Reasoning with LLMs: The Choice of Tool Matters | Jun 1, 2024 | Logical ReasoningTranslation | CodeCode Available | 0 |
| PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering | May 29, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving? | May 28, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| RLSF: Reinforcement Learning via Symbolic Feedback | May 26, 2024 | Logical ReasoningNatural Language Understanding | —Unverified | 0 |
| Distilling Instruction-following Abilities of Large Language Models with Task-aware Curriculum Planning | May 22, 2024 | Code GenerationInstruction Following | —Unverified | 0 |
| LLM+Reasoning+Planning for supporting incomplete user queries in presence of APIs | May 21, 2024 | Logical Reasoning | —Unverified | 0 |
| STAR: A Benchmark for Situated Reasoning in Real-World Videos | May 15, 2024 | DiagnosticLogical Reasoning | —Unverified | 0 |
| MetaReflection: Learning Instructions for Language Agents using Past Reflections | May 13, 2024 | Logical ReasoningQuestion Answering | —Unverified | 0 |
| MathDivide: Improved mathematical reasoning by large language models | May 12, 2024 | GSM8KLogical Reasoning | —Unverified | 0 |
| Logical Negation Augmenting and Debiasing for Prompt-based Methods | May 8, 2024 | Logical ReasoningNegation | —Unverified | 0 |
| Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics | May 7, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning | May 2, 2024 | Knowledge GraphsLogical Reasoning | —Unverified | 0 |
| SuperCLUE-Fin: Graded Fine-Grained Analysis of Chinese LLMs on Diverse Financial Tasks and Applications | Apr 29, 2024 | Computational EfficiencyLogical Reasoning | —Unverified | 0 |
| Cantor: Inspiring Multimodal Chain-of-Thought of MLLM | Apr 24, 2024 | Decision MakingLogical Reasoning | —Unverified | 0 |
| Aligning Knowledge Graphs Provided by Humans and Generated from Neural Networks in Specific Tasks | Apr 23, 2024 | Knowledge GraphsLogical Reasoning | CodeCode Available | 0 |
| Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs | Apr 15, 2024 | Bias DetectionLogical Reasoning | —Unverified | 0 |
| Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding | Apr 4, 2024 | Logical FallaciesLogical Reasoning | —Unverified | 0 |
| CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues | Apr 4, 2024 | ChatbotInstruction Following | —Unverified | 0 |