| FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models | Nov 16, 2023 | Instruction FollowingLogical Reasoning | —Unverified | 0 |
| A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning | Nov 14, 2023 | Logical FallaciesLogical Reasoning | CodeCode Available | 0 |
| Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study | Nov 13, 2023 | Logical ReasoningPrompt Engineering | CodeCode Available | 0 |
| From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models | Nov 12, 2023 | Language ModellingLogical Reasoning | —Unverified | 0 |
| Are LLMs Rigorous Logical Reasoner? Empowering Natural Language Proof Generation with Contrastive Stepwise Decoding | Nov 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Models can be Logical Solvers | Nov 10, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Let's Reinforce Step by Step | Nov 10, 2023 | GSM8KLogical Reasoning | —Unverified | 0 |
| COOL: A Constraint Object-Oriented Logic Programming Language and its Neural-Symbolic Compilation System | Nov 7, 2023 | Logical Reasoning | —Unverified | 0 |
| Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions | Nov 5, 2023 | Logical ReasoningMultiple-choice | —Unverified | 0 |
| Rule Learning as Machine Translation using the Atomic Knowledge Bank | Nov 5, 2023 | Logical ReasoningMachine Translation | CodeCode Available | 0 |
| Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis | Nov 1, 2023 | Logical ReasoningPrompt Engineering | CodeCode Available | 0 |
| Generating by Understanding: Neural Visual Generation with Logical Symbol Groundings | Oct 26, 2023 | DisentanglementLogical Reasoning | CodeCode Available | 0 |
| POE: Process of Elimination for Multiple Choice Reasoning | Oct 24, 2023 | In-Context LearningLogical Reasoning | CodeCode Available | 0 |
| Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention | Oct 23, 2023 | Logical Reasoning | CodeCode Available | 0 |
| Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism | Oct 23, 2023 | Logical ReasoningNegation | —Unverified | 0 |
| DetectGPT-SC: Improving Detection of Text Generated by Large Language Models through Self-Consistency with Masked Predictions | Oct 23, 2023 | Logical ReasoningText Generation | —Unverified | 0 |
| Retrieval-Augmented Neural Response Generation Using Logical Reasoning and Relevance Scoring | Oct 20, 2023 | Logical ReasoningResponse Generation | —Unverified | 0 |
| The potential of large language models for improving probability learning: A study on ChatGPT3.5 and first-year computer engineering students | Oct 9, 2023 | Language ModellingLogical Reasoning | —Unverified | 0 |
| Empower Nested Boolean Logic via Self-Supervised Curriculum Learning | Oct 9, 2023 | Logical ReasoningSelf-Supervised Learning | CodeCode Available | 0 |
| DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers | Oct 5, 2023 | DecoderLogical Reasoning | CodeCode Available | 0 |
| Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance | Oct 3, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 0 |
| Learning Reliable Logical Rules with SATNet | Oct 3, 2023 | Logical Reasoning | —Unverified | 0 |
| Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models | Oct 2, 2023 | Knowledge DistillationLanguage Modelling | —Unverified | 0 |
| DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks | Sep 29, 2023 | Logical Reasoning | —Unverified | 0 |
| Physics of Language Models: Part 3.2, Knowledge Manipulation | Sep 25, 2023 | AttributeLanguage Modelling | —Unverified | 0 |