| Boosting Deductive Reasoning with Step Signals In RLHF | Oct 12, 2024 | Formal LogicLogical Reasoning | —Unverified | 0 |
| uto\!L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks | Oct 11, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| A Systematic Assessment of OpenAI o1-Preview for Higher Order Thinking in Education | Oct 11, 2024 | Logical Reasoning | —Unverified | 0 |
| P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains | Oct 11, 2024 | Large Language ModelLogical Reasoning | —Unverified | 0 |
| KnowGraph: Knowledge-Enabled Anomaly Detection via Logical Reasoning on Graph Data | Oct 10, 2024 | Anomaly DetectionFraud Detection | —Unverified | 0 |
| HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction | Oct 10, 2024 | Binary ClassificationCitation Prediction | CodeCode Available | 0 |
| Think Beyond Size: Adaptive Prompting for More Effective Reasoning | Oct 10, 2024 | Arithmetic ReasoningComputational Efficiency | —Unverified | 0 |
| Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? | Oct 9, 2024 | In-Context LearningLogical Reasoning | CodeCode Available | 0 |
| Can Transformers Reason Logically? A Study in SAT Solving | Oct 9, 2024 | DecoderLogical Reasoning | —Unverified | 0 |
| Latent Feature Mining for Predictive Model Enhancement with Large Language Models | Oct 6, 2024 | Logical Reasoning | —Unverified | 0 |
| Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification | Oct 6, 2024 | ClassificationDomain Generalization | CodeCode Available | 0 |
| Deliberate Reasoning for LLMs as Structure-aware Planning with Accurate World Model | Oct 4, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning | Oct 3, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| GraphIC: A Graph-Based In-Context Example Retrieval Model for Multi-Step Reasoning | Oct 3, 2024 | Code GenerationIn-Context Learning | —Unverified | 0 |
| BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data | Oct 1, 2024 | Code GenerationLogical Reasoning | CodeCode Available | 0 |
| Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation | Sep 30, 2024 | Logical ReasoningMisinformation | —Unverified | 0 |
| Logic-of-Thought: Injecting Logic into Contexts for Full Reasoning in Large Language Models | Sep 26, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models | Sep 25, 2024 | Fake News DetectionLanguage Modeling | —Unverified | 0 |
| Strategies for Improving NL-to-FOL Translation with LLMs: Data Generation, Incremental Fine-Tuning, and Verification | Sep 24, 2024 | Data AugmentationLogical Reasoning | CodeCode Available | 0 |
| Thought-Path Contrastive Learning via Premise-Oriented Data Augmentation for Logical Reading Comprehension | Sep 22, 2024 | Contrastive Learningcounterfactual | CodeCode Available | 0 |
| GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group Discussion | Sep 21, 2024 | Logical Reasoning | —Unverified | 0 |
| LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning | Sep 19, 2024 | GSM8KLogical Reasoning | CodeCode Available | 0 |
| Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data | Sep 19, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 |
| ProSLM : A Prolog Synergized Language Model for explainable Domain Specific Knowledge Based Question Answering | Sep 17, 2024 | Formal LogicLanguage Modeling | —Unverified | 0 |
| Video Token Sparsification for Efficient Multimodal LLMs in Autonomous Driving | Sep 16, 2024 | Autonomous DrivingLogical Reasoning | —Unverified | 0 |
| Unleash LLMs Potential for Recommendation by Coordinating Twin-Tower Dynamic Semantic Token Generator | Sep 14, 2024 | Logical ReasoningRecommendation Systems | —Unverified | 0 |
| KARGEN: Knowledge-enhanced Automated Radiology Report Generation Using Large Language Models | Sep 9, 2024 | Common Sense ReasoningLogical Reasoning | —Unverified | 0 |
| CauseJudger: Identifying the Cause with LLMs for Abductive Logical Reasoning | Sep 9, 2024 | Common Sense ReasoningLogical Reasoning | —Unverified | 0 |
| Action is the primary key: a categorical framework for episode description and logical reasoning | Sep 7, 2024 | Logical Reasoning | —Unverified | 0 |
| Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness | Aug 31, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments | Aug 28, 2024 | knowledge editingKnowledge Graphs | —Unverified | 0 |
| SarcasmBench: Towards Evaluating Large Language Models on Sarcasm Understanding | Aug 21, 2024 | Logical ReasoningMathematical Reasoning | —Unverified | 0 |
| Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models | Aug 21, 2024 | Logical ReasoningMotion Synthesis | —Unverified | 0 |
| Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions | Aug 16, 2024 | DescriptiveHallucination | —Unverified | 0 |
| A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models | Aug 16, 2024 | Logical Reasoningvalid | —Unverified | 0 |
| LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image | Aug 14, 2024 | Autonomous DrivingLogical Reasoning | —Unverified | 0 |
| Can Large Language Models Reason? A Characterization via 3-SAT | Aug 13, 2024 | Logical Reasoning | —Unverified | 0 |
| P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for data pruning in LLM Training | Aug 10, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset | Aug 8, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Automated Theorem Provers Help Improve Large Language Model Reasoning | Aug 7, 2024 | Formal LogicLanguage Modeling | —Unverified | 0 |
| Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation | Aug 7, 2024 | Logical ReasoningRecommendation Systems | —Unverified | 0 |
| Leveraging Large Language Models with Chain-of-Thought and Prompt Engineering for Traffic Crash Severity Analysis and Inference | Aug 4, 2024 | Logical ReasoningPrompt Engineering | —Unverified | 0 |
| Deceptive AI systems that give explanations are more convincing than honest AI systems and can amplify belief in misinformation | Jul 31, 2024 | Logical ReasoningMisinformation | —Unverified | 0 |
| CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge | Jul 30, 2024 | In-Context LearningKnowledge Graphs | —Unverified | 0 |
| Take A Step Back: Rethinking the Two Stages in Visual Reasoning | Jul 29, 2024 | Logical ReasoningQuestion Answering | —Unverified | 0 |
| Logic Distillation: Learning from Code Function by Function for Planning and Decision-making | Jul 28, 2024 | Decision MakingKnowledge Distillation | —Unverified | 0 |
| An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought | Jul 22, 2024 | FormLogical Reasoning | —Unverified | 0 |
| An Explainable Fast Deep Neural Network for Emotion Recognition | Jul 20, 2024 | AttributeEmotion Classification | —Unverified | 0 |
| Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? | Jul 20, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Leveraging large language models for nano synthesis mechanism explanation: solid foundations or mere conjectures? | Jul 12, 2024 | Logical ReasoningMultiple-choice | CodeCode Available | 0 |