| Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models | Aug 21, 2024 | Logical ReasoningMotion Synthesis | —Unverified | 0 |
| SarcasmBench: Towards Evaluating Large Language Models on Sarcasm Understanding | Aug 21, 2024 | Logical ReasoningMathematical Reasoning | —Unverified | 0 |
| CHECKWHY: Causal Fact Verification via Argument Structure | Aug 20, 2024 | Fact VerificationLogical Reasoning | CodeCode Available | 1 |
| A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models | Aug 16, 2024 | Logical Reasoningvalid | —Unverified | 0 |
| Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions | Aug 16, 2024 | DescriptiveHallucination | —Unverified | 0 |
| LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image | Aug 14, 2024 | Autonomous DrivingLogical Reasoning | —Unverified | 0 |
| Can Large Language Models Reason? A Characterization via 3-SAT | Aug 13, 2024 | Logical Reasoning | —Unverified | 0 |
| P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for data pruning in LLM Training | Aug 10, 2024 | DiversityLogical Reasoning | —Unverified | 0 |
| Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset | Aug 8, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation | Aug 7, 2024 | Logical ReasoningRecommendation Systems | —Unverified | 0 |
| Automated Theorem Provers Help Improve Large Language Model Reasoning | Aug 7, 2024 | Formal LogicLanguage Modeling | —Unverified | 0 |
| Leveraging Large Language Models with Chain-of-Thought and Prompt Engineering for Traffic Crash Severity Analysis and Inference | Aug 4, 2024 | Logical ReasoningPrompt Engineering | —Unverified | 0 |
| Deceptive AI systems that give explanations are more convincing than honest AI systems and can amplify belief in misinformation | Jul 31, 2024 | Logical ReasoningMisinformation | —Unverified | 0 |
| CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge | Jul 30, 2024 | In-Context LearningKnowledge Graphs | —Unverified | 0 |
| Take A Step Back: Rethinking the Two Stages in Visual Reasoning | Jul 29, 2024 | Logical ReasoningQuestion Answering | —Unverified | 0 |
| Logic Distillation: Learning from Code Function by Function for Planning and Decision-making | Jul 28, 2024 | Decision MakingKnowledge Distillation | —Unverified | 0 |
| An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought | Jul 22, 2024 | FormLogical Reasoning | —Unverified | 0 |
| An Explainable Fast Deep Neural Network for Emotion Recognition | Jul 20, 2024 | AttributeEmotion Classification | —Unverified | 0 |
| Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? | Jul 20, 2024 | Logical Reasoning | CodeCode Available | 0 |
| NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context? | Jul 16, 2024 | 4k8k | CodeCode Available | 9 |
| Leveraging large language models for nano synthesis mechanism explanation: solid foundations or mere conjectures? | Jul 12, 2024 | Logical ReasoningMultiple-choice | CodeCode Available | 0 |
| Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding | Jul 11, 2024 | EEGLanguage Modeling | CodeCode Available | 1 |
| Analyzing Large language models chatbots: An experimental approach using a probability test | Jul 10, 2024 | ChatbotLogical Reasoning | —Unverified | 0 |
| Why should we ever automate moral decision making? | Jul 10, 2024 | Decision MakingEthics | —Unverified | 0 |
| R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning | Jul 8, 2024 | Logical Reasoning | CodeCode Available | 1 |