| Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation | Dec 5, 2023 | Logical Reasoning | CodeCode Available | 2 |
| Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games | Dec 1, 2023 | AI AgentIn-Context Learning | —Unverified | 0 |
| MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI | Nov 27, 2023 | Complex Query AnsweringLogical Reasoning | CodeCode Available | 5 |
| Generation of Explanations for Logic Reasoning | Nov 22, 2023 | Logical ReasoningPhilosophy | —Unverified | 0 |
| Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications | Nov 22, 2023 | FairnessLegal Reasoning | —Unverified | 0 |
| De-fine: Decomposing and Refining Visual Programs with Auto-Feedback | Nov 21, 2023 | Logical Reasoning | —Unverified | 0 |
| WatME: Towards Lossless Watermarking Through Lexical Redundancy | Nov 16, 2023 | Instruction FollowingLanguage Modelling | —Unverified | 0 |
| FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models | Nov 16, 2023 | Instruction FollowingLogical Reasoning | —Unverified | 0 |
| Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs | Nov 16, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 |
| A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning | Nov 14, 2023 | Logical FallaciesLogical Reasoning | CodeCode Available | 0 |
| Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study | Nov 13, 2023 | Logical ReasoningPrompt Engineering | CodeCode Available | 0 |
| From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models | Nov 12, 2023 | Language ModellingLogical Reasoning | —Unverified | 0 |
| Are LLMs Rigorous Logical Reasoner? Empowering Natural Language Proof Generation with Contrastive Stepwise Decoding | Nov 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Let's Reinforce Step by Step | Nov 10, 2023 | GSM8KLogical Reasoning | —Unverified | 0 |
| Language Models can be Logical Solvers | Nov 10, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Chain of Images for Intuitively Reasoning | Nov 9, 2023 | Common Sense ReasoningLanguage Modelling | CodeCode Available | 1 |
| COOL: A Constraint Object-Oriented Logic Programming Language and its Neural-Symbolic Compilation System | Nov 7, 2023 | Logical Reasoning | —Unverified | 0 |
| Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions | Nov 5, 2023 | Logical ReasoningMultiple-choice | —Unverified | 0 |
| Rule Learning as Machine Translation using the Atomic Knowledge Bank | Nov 5, 2023 | Logical ReasoningMachine Translation | CodeCode Available | 0 |
| LLM4Drive: A Survey of Large Language Models for Autonomous Driving | Nov 2, 2023 | Autonomous DrivingFew-Shot Learning | CodeCode Available | 3 |
| Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis | Nov 1, 2023 | Logical ReasoningPrompt Engineering | CodeCode Available | 0 |
| Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace | Oct 30, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 1 |
| DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy | Oct 28, 2023 | Logical Reasoning | CodeCode Available | 1 |
| Generating by Understanding: Neural Visual Generation with Logical Symbol Groundings | Oct 26, 2023 | DisentanglementLogical Reasoning | CodeCode Available | 0 |
| POE: Process of Elimination for Multiple Choice Reasoning | Oct 24, 2023 | In-Context LearningLogical Reasoning | CodeCode Available | 0 |
| Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention | Oct 23, 2023 | Logical Reasoning | CodeCode Available | 0 |
| DetectGPT-SC: Improving Detection of Text Generated by Large Language Models through Self-Consistency with Masked Predictions | Oct 23, 2023 | Logical ReasoningText Generation | —Unverified | 0 |
| Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism | Oct 23, 2023 | Logical ReasoningNegation | —Unverified | 0 |
| Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts | Oct 23, 2023 | Logical ReasoningMath | CodeCode Available | 1 |
| LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers | Oct 23, 2023 | Logical Reasoning | CodeCode Available | 1 |
| Retrieval-Augmented Neural Response Generation Using Logical Reasoning and Relevance Scoring | Oct 20, 2023 | Logical ReasoningResponse Generation | —Unverified | 0 |
| Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World | Oct 16, 2023 | Few-Shot LearningForm | CodeCode Available | 1 |
| Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning | Oct 13, 2023 | Data AugmentationLogical Reasoning | CodeCode Available | 1 |
| Improving Large Language Models in Event Relation Logical Prediction | Oct 13, 2023 | counterfactualEvent Relation Extraction | CodeCode Available | 1 |
| GLoRE: Evaluating Logical Reasoning of Large Language Models | Oct 13, 2023 | Logical ReasoningNatural Language Understanding | CodeCode Available | 1 |
| The potential of large language models for improving probability learning: A study on ChatGPT3.5 and first-year computer engineering students | Oct 9, 2023 | Language ModellingLogical Reasoning | —Unverified | 0 |
| Empower Nested Boolean Logic via Self-Supervised Curriculum Learning | Oct 9, 2023 | Logical ReasoningSelf-Supervised Learning | CodeCode Available | 0 |
| DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers | Oct 5, 2023 | DecoderLogical Reasoning | CodeCode Available | 0 |
| Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance | Oct 3, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 0 |
| Learning Reliable Logical Rules with SATNet | Oct 3, 2023 | Logical Reasoning | —Unverified | 0 |
| Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models | Oct 2, 2023 | Knowledge DistillationLanguage Modelling | —Unverified | 0 |
| DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks | Sep 29, 2023 | Logical Reasoning | —Unverified | 0 |
| Physics of Language Models: Part 3.2, Knowledge Manipulation | Sep 25, 2023 | AttributeLanguage Modelling | —Unverified | 0 |
| EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning | Sep 16, 2023 | Date UnderstandingGSM8K | CodeCode Available | 0 |
| Do PLMs Know and Understand Ontological Knowledge? | Sep 12, 2023 | Logical ReasoningMemorization | CodeCode Available | 1 |
| HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models | Sep 6, 2023 | General KnowledgeLogical Reasoning | CodeCode Available | 1 |
| On the Potential of CLIP for Compositional Logical Reasoning | Aug 30, 2023 | Logical ReasoningVisual Reasoning | —Unverified | 0 |
| LR-XFL: Logical Reasoning-based Explainable Federated Learning | Aug 24, 2023 | Federated LearningLogical Reasoning | CodeCode Available | 0 |
| Human Comprehensible Active Learning of Genome-Scale Metabolic Networks | Aug 24, 2023 | Active LearningExperimental Design | —Unverified | 0 |
| LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking Puzzles | Aug 21, 2023 | Logical Reasoning | CodeCode Available | 1 |