| Interactive Visual Assessment for Text-to-Image Generation Models | Nov 23, 2024 | Image GenerationLogical Reasoning | —Unverified | 0 |
| XAgents: A Framework for Interpretable Rule-Based Multi-Agents Cooperation | Nov 21, 2024 | Feature CorrelationLogical Reasoning | —Unverified | 0 |
| Testing Uncertainty of Large Language Models for Physics Knowledge and Reasoning | Nov 18, 2024 | Logical ReasoningMultiple-choice | —Unverified | 0 |
| Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm | Nov 16, 2024 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash | Nov 15, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Symbolic-AI-Fusion Deep Learning (SAIF-DL): Encoding Knowledge into Training with Answer Set Programming Loss Penalties by a Novel Loss Function Approach | Nov 13, 2024 | Logical Reasoning | —Unverified | 0 |
| Building Trustworthy AI: Transparent AI Systems via Large Language Models, Ontologies, and Logical Reasoning (TranspNet) | Nov 13, 2024 | Logical ReasoningRAG | —Unverified | 0 |
| OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving? | Nov 9, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Knowledge Authoring with Factual English, Rules, and Actions | Nov 9, 2024 | Logical Reasoning | —Unverified | 0 |
| How Transformers Solve Propositional Logic Problems: A Mechanistic Analysis | Nov 6, 2024 | Logical Reasoning | —Unverified | 0 |
| Formal Logic-guided Robust Federated Learning against Poisoning Attacks | Nov 5, 2024 | Federated LearningFormal Logic | —Unverified | 0 |
| On Memorization of Large Language Models in Logical Reasoning | Oct 30, 2024 | Logical ReasoningMemorization | —Unverified | 0 |
| Leveraging LLMs for Hypothetical Deduction in Logical Inference: A Neuro-Symbolic Approach | Oct 29, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Combining Domain-Specific Models and LLMs for Automated Disease Phenotyping from Survey Data | Oct 28, 2024 | Logical Reasoningnamed-entity-recognition | —Unverified | 0 |
| Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs | Oct 26, 2024 | DiagnosticLogical Reasoning | —Unverified | 0 |
| Aligning CodeLLMs with Direct Preference Optimization | Oct 24, 2024 | Decision MakingHumanEval | —Unverified | 0 |
| LLM-Aided Efficient Hardware Design Automation | Oct 24, 2024 | Code RepairLogical Reasoning | —Unverified | 0 |
| Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks | Oct 24, 2024 | Logical ReasoningMathematical Problem-Solving | —Unverified | 0 |
| MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures | Oct 20, 2024 | Answer GenerationInformativeness | CodeCode Available | 0 |
| Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration From Cognitive Psychology | Oct 19, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation | Oct 19, 2024 | FormLogical Reasoning | CodeCode Available | 0 |
| From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition | Oct 17, 2024 | Language AcquisitionLogical Reasoning | CodeCode Available | 0 |
| Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval | Oct 16, 2024 | Information RetrievalLogical Reasoning | —Unverified | 0 |
| "Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities | Oct 16, 2024 | Knowledge ProbingLogical Reasoning | —Unverified | 0 |
| Transformer-based Language Models for Reasoning in the Description Logic ALCQ | Oct 12, 2024 | Logical Reasoning | —Unverified | 0 |