| LLM-Aided Efficient Hardware Design Automation | Oct 24, 2024 | Code RepairLogical Reasoning | —Unverified | 0 |
| Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks | Oct 24, 2024 | Logical ReasoningMathematical Problem-Solving | —Unverified | 0 |
| Aligning CodeLLMs with Direct Preference Optimization | Oct 24, 2024 | Decision MakingHumanEval | —Unverified | 0 |
| MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures | Oct 20, 2024 | Answer GenerationInformativeness | CodeCode Available | 0 |
| Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration From Cognitive Psychology | Oct 19, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation | Oct 19, 2024 | FormLogical Reasoning | CodeCode Available | 0 |
| From Babbling to Fluency: Evaluating the Evolution of Language Models in Terms of Human Language Acquisition | Oct 17, 2024 | Language AcquisitionLogical Reasoning | CodeCode Available | 0 |
| Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval | Oct 16, 2024 | Information RetrievalLogical Reasoning | —Unverified | 0 |
| "Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities | Oct 16, 2024 | Knowledge ProbingLogical Reasoning | —Unverified | 0 |
| Boosting Deductive Reasoning with Step Signals In RLHF | Oct 12, 2024 | Formal LogicLogical Reasoning | —Unverified | 0 |
| Transformer-based Language Models for Reasoning in the Description Logic ALCQ | Oct 12, 2024 | Logical Reasoning | —Unverified | 0 |
| A Systematic Assessment of OpenAI o1-Preview for Higher Order Thinking in Education | Oct 11, 2024 | Logical Reasoning | —Unverified | 0 |
| P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains | Oct 11, 2024 | Large Language ModelLogical Reasoning | —Unverified | 0 |
| uto\!L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks | Oct 11, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction | Oct 10, 2024 | Binary ClassificationCitation Prediction | CodeCode Available | 0 |
| KnowGraph: Knowledge-Enabled Anomaly Detection via Logical Reasoning on Graph Data | Oct 10, 2024 | Anomaly DetectionFraud Detection | —Unverified | 0 |
| Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning | Oct 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Automatic Curriculum Expert Iteration for Reliable LLM Reasoning | Oct 10, 2024 | HallucinationLogical Reasoning | CodeCode Available | 1 |
| Think Beyond Size: Adaptive Prompting for More Effective Reasoning | Oct 10, 2024 | Arithmetic ReasoningComputational Efficiency | —Unverified | 0 |
| Can Transformers Reason Logically? A Study in SAT Solving | Oct 9, 2024 | DecoderLogical Reasoning | —Unverified | 0 |
| Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? | Oct 9, 2024 | In-Context LearningLogical Reasoning | CodeCode Available | 0 |
| TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles | Oct 7, 2024 | Logical Reasoning | CodeCode Available | 2 |
| GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models | Oct 7, 2024 | GSM8KLogical Reasoning | CodeCode Available | 1 |
| Latent Feature Mining for Predictive Model Enhancement with Large Language Models | Oct 6, 2024 | Logical Reasoning | —Unverified | 0 |
| Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification | Oct 6, 2024 | ClassificationDomain Generalization | CodeCode Available | 0 |
| Deliberate Reasoning for LLMs as Structure-aware Planning with Accurate World Model | Oct 4, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review | Oct 4, 2024 | Knowledge DistillationLogical Reasoning | CodeCode Available | 2 |
| GraphIC: A Graph-Based In-Context Example Retrieval Model for Multi-Step Reasoning | Oct 3, 2024 | Code GenerationIn-Context Learning | —Unverified | 0 |
| CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning | Oct 3, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| RATIONALYST: Pre-training Process-Supervision for Improving Reasoning | Oct 1, 2024 | Logical Reasoning | CodeCode Available | 1 |
| BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data | Oct 1, 2024 | Code GenerationLogical Reasoning | CodeCode Available | 0 |
| Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation | Sep 30, 2024 | Logical ReasoningMisinformation | —Unverified | 0 |
| Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models | Sep 26, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models | Sep 25, 2024 | Fake News DetectionLanguage Modeling | —Unverified | 0 |
| Strategies for Improving NL-to-FOL Translation with LLMs: Data Generation, Incremental Fine-Tuning, and Verification | Sep 24, 2024 | Data AugmentationLogical Reasoning | CodeCode Available | 0 |
| LTNtorch: PyTorch Implementation of Logic Tensor Networks | Sep 24, 2024 | Binary ClassificationLogical Reasoning | CodeCode Available | 2 |
| Thought-Path Contrastive Learning via Premise-Oriented Data Augmentation for Logical Reading Comprehension | Sep 22, 2024 | Contrastive Learningcounterfactual | CodeCode Available | 0 |
| GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group Discussion | Sep 21, 2024 | Logical Reasoning | —Unverified | 0 |
| Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data | Sep 19, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 |
| LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning | Sep 19, 2024 | GSM8KLogical Reasoning | CodeCode Available | 0 |
| ProSLM : A Prolog Synergized Language Model for explainable Domain Specific Knowledge Based Question Answering | Sep 17, 2024 | Formal LogicLanguage Modeling | —Unverified | 0 |
| Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving | Sep 16, 2024 | Autonomous DrivingLogical Reasoning | —Unverified | 0 |
| Unleash LLMs Potential for Recommendation by Coordinating Twin-Tower Dynamic Semantic Token Generator | Sep 14, 2024 | Logical ReasoningRecommendation Systems | —Unverified | 0 |
| KARGEN: Knowledge-enhanced Automated Radiology Report Generation Using Large Language Models | Sep 9, 2024 | Common Sense ReasoningLogical Reasoning | —Unverified | 0 |
| CauseJudger: Identifying the Cause with LLMs for Abductive Logical Reasoning | Sep 9, 2024 | Common Sense ReasoningLogical Reasoning | —Unverified | 0 |
| Action is the primary key: a categorical framework for episode description and logical reasoning | Sep 7, 2024 | Logical Reasoning | —Unverified | 0 |
| VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning | Sep 3, 2024 | Chart Question AnsweringData Visualization | CodeCode Available | 1 |
| Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness | Aug 31, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments | Aug 28, 2024 | knowledge editingKnowledge Graphs | —Unverified | 0 |
| LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models | Aug 28, 2024 | BenchmarkingLogical Reasoning | CodeCode Available | 1 |