| FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models | Nov 16, 2023 | Instruction FollowingLogical Reasoning | —Unverified | 0 |
| A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning | Nov 14, 2023 | Logical FallaciesLogical Reasoning | CodeCode Available | 0 |
| Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study | Nov 13, 2023 | Logical ReasoningPrompt Engineering | CodeCode Available | 0 |
| From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models | Nov 12, 2023 | Language ModellingLogical Reasoning | —Unverified | 0 |
| Are LLMs Rigorous Logical Reasoner? Empowering Natural Language Proof Generation with Contrastive Stepwise Decoding | Nov 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Models can be Logical Solvers | Nov 10, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Let's Reinforce Step by Step | Nov 10, 2023 | GSM8KLogical Reasoning | —Unverified | 0 |
| COOL: A Constraint Object-Oriented Logic Programming Language and its Neural-Symbolic Compilation System | Nov 7, 2023 | Logical Reasoning | —Unverified | 0 |
| Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions | Nov 5, 2023 | Logical ReasoningMultiple-choice | —Unverified | 0 |
| Rule Learning as Machine Translation using the Atomic Knowledge Bank | Nov 5, 2023 | Logical ReasoningMachine Translation | CodeCode Available | 0 |
| Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis | Nov 1, 2023 | Logical ReasoningPrompt Engineering | CodeCode Available | 0 |
| Generating by Understanding: Neural Visual Generation with Logical Symbol Groundings | Oct 26, 2023 | DisentanglementLogical Reasoning | CodeCode Available | 0 |
| POE: Process of Elimination for Multiple Choice Reasoning | Oct 24, 2023 | In-Context LearningLogical Reasoning | CodeCode Available | 0 |
| Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention | Oct 23, 2023 | Logical Reasoning | CodeCode Available | 0 |
| Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism | Oct 23, 2023 | Logical ReasoningNegation | —Unverified | 0 |
| DetectGPT-SC: Improving Detection of Text Generated by Large Language Models through Self-Consistency with Masked Predictions | Oct 23, 2023 | Logical ReasoningText Generation | —Unverified | 0 |
| Retrieval-Augmented Neural Response Generation Using Logical Reasoning and Relevance Scoring | Oct 20, 2023 | Logical ReasoningResponse Generation | —Unverified | 0 |
| The potential of large language models for improving probability learning: A study on ChatGPT3.5 and first-year computer engineering students | Oct 9, 2023 | Language ModellingLogical Reasoning | —Unverified | 0 |
| Empower Nested Boolean Logic via Self-Supervised Curriculum Learning | Oct 9, 2023 | Logical ReasoningSelf-Supervised Learning | CodeCode Available | 0 |
| DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers | Oct 5, 2023 | DecoderLogical Reasoning | CodeCode Available | 0 |
| Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance | Oct 3, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 0 |
| Learning Reliable Logical Rules with SATNet | Oct 3, 2023 | Logical Reasoning | —Unverified | 0 |
| Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models | Oct 2, 2023 | Knowledge DistillationLanguage Modelling | —Unverified | 0 |
| DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks | Sep 29, 2023 | Logical Reasoning | —Unverified | 0 |
| Physics of Language Models: Part 3.2, Knowledge Manipulation | Sep 25, 2023 | AttributeLanguage Modelling | —Unverified | 0 |
| EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning | Sep 16, 2023 | Date UnderstandingGSM8K | CodeCode Available | 0 |
| On the Potential of CLIP for Compositional Logical Reasoning | Aug 30, 2023 | Logical ReasoningVisual Reasoning | —Unverified | 0 |
| LR-XFL: Logical Reasoning-based Explainable Federated Learning | Aug 24, 2023 | Federated LearningLogical Reasoning | CodeCode Available | 0 |
| Human Comprehensible Active Learning of Genome-Scale Metabolic Networks | Aug 24, 2023 | Active LearningExperimental Design | —Unverified | 0 |
| Deciphering Raw Data in Neuro-Symbolic Learning with Provable Guarantees | Aug 21, 2023 | Logical Reasoning | CodeCode Available | 0 |
| How susceptible are LLMs to Logical Fallacies? | Aug 18, 2023 | DiagnosticLogical Fallacies | CodeCode Available | 0 |
| Boosting Logical Reasoning in Large Language Models through a New Framework: The Graph of Thought | Aug 16, 2023 | Logical Reasoning | —Unverified | 0 |
| Learning the meanings of function words from grounded language using a visual question answering model | Aug 16, 2023 | Logical ReasoningQuestion Answering | CodeCode Available | 0 |
| Thinking Like an Expert:Multimodal Hypergraph-of-Thought (HoT) Reasoning to boost Foundation Modals | Aug 11, 2023 | Graph LearningLogical Reasoning | —Unverified | 0 |
| Structural Embeddings of Tools for Large Language Models | Aug 1, 2023 | Logical Reasoning | —Unverified | 0 |
| Is ChatGPT a Good Personality Recognizer? A Preliminary Study | Jul 8, 2023 | FairnessLogical Reasoning | —Unverified | 0 |
| Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models | Jun 30, 2023 | Domain GeneralizationIn-Context Learning | CodeCode Available | 0 |
| What is the Title of this Paper? Solving logic puzzles using algorithms | Jun 30, 2023 | Logical Reasoning | —Unverified | 0 |
| Counterfactual Collaborative Reasoning | Jun 30, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Exploring & Exploiting High-Order Graph Structure for Sparse Knowledge Graph Completion | Jun 29, 2023 | Knowledge Graph CompletionLogical Reasoning | —Unverified | 0 |
| Evaluating Large Language Models with NeuBAROCO: Syllogistic Reasoning Ability and Human-like Biases | Jun 21, 2023 | Logical Reasoning | —Unverified | 0 |
| Language to Rewards for Robotic Skill Synthesis | Jun 14, 2023 | In-Context LearningLogical Reasoning | —Unverified | 0 |
| V-LoL: A Diagnostic Dataset for Visual Logical Learning | Jun 13, 2023 | DiagnosticLogical Reasoning | CodeCode Available | 0 |
| Large Language Models as Tax Attorneys: A Case Study in Legal Capabilities Emergence | Jun 12, 2023 | Logical Reasoning | —Unverified | 0 |
| Human-in-the-Loop through Chain-of-Thought | Jun 10, 2023 | Logical Reasoning | —Unverified | 0 |
| LogiQA 2.0—An Improved Dataset for Logical Reasoning in Natural Language Understanding | Jun 6, 2023 | Logical ReasoningLogical Reasoning Reading Comprehension | CodeCode Available | 0 |
| ChatGPT is a Remarkable Tool -- For Experts | Jun 2, 2023 | Logical Reasoning | —Unverified | 0 |
| Knowledge-based Reasoning and Learning under Partial Observability in Ad Hoc Teamwork | Jun 1, 2023 | Logical Reasoning | —Unverified | 0 |
| InDL: A New Dataset and Benchmark for In-Diagram Logic Interpretation based on Visual Illusion | May 28, 2023 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Synthesizing a Progression of Subtasks for Block-Based Visual Programming Tasks | May 27, 2023 | Logical Reasoning | CodeCode Available | 0 |