| Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation | Dec 5, 2023 | Logical Reasoning | CodeCode Available | 2 |
| Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games | Dec 1, 2023 | AI AgentIn-Context Learning | —Unverified | 0 |
| MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI | Nov 27, 2023 | Complex Query AnsweringLogical Reasoning | CodeCode Available | 5 |
| Generation of Explanations for Logic Reasoning | Nov 22, 2023 | Logical ReasoningPhilosophy | —Unverified | 0 |
| Enhancing Logical Reasoning in Large Language Models to Facilitate Legal Applications | Nov 22, 2023 | FairnessLegal Reasoning | —Unverified | 0 |
| De-fine: Decomposing and Refining Visual Programs with Auto-Feedback | Nov 21, 2023 | Logical Reasoning | —Unverified | 0 |
| WatME: Towards Lossless Watermarking Through Lexical Redundancy | Nov 16, 2023 | Instruction FollowingLanguage Modelling | —Unverified | 0 |
| FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models | Nov 16, 2023 | Instruction FollowingLogical Reasoning | —Unverified | 0 |
| Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs | Nov 16, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 1 |
| A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning | Nov 14, 2023 | Logical FallaciesLogical Reasoning | CodeCode Available | 0 |
| Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study | Nov 13, 2023 | Logical ReasoningPrompt Engineering | CodeCode Available | 0 |
| From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models | Nov 12, 2023 | Language ModellingLogical Reasoning | —Unverified | 0 |
| Are LLMs Rigorous Logical Reasoner? Empowering Natural Language Proof Generation with Contrastive Stepwise Decoding | Nov 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Let's Reinforce Step by Step | Nov 10, 2023 | GSM8KLogical Reasoning | —Unverified | 0 |
| Language Models can be Logical Solvers | Nov 10, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Chain of Images for Intuitively Reasoning | Nov 9, 2023 | Common Sense ReasoningLanguage Modelling | CodeCode Available | 1 |
| COOL: A Constraint Object-Oriented Logic Programming Language and its Neural-Symbolic Compilation System | Nov 7, 2023 | Logical Reasoning | —Unverified | 0 |
| Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions | Nov 5, 2023 | Logical ReasoningMultiple-choice | —Unverified | 0 |
| Rule Learning as Machine Translation using the Atomic Knowledge Bank | Nov 5, 2023 | Logical ReasoningMachine Translation | CodeCode Available | 0 |
| LLM4Drive: A Survey of Large Language Models for Autonomous Driving | Nov 2, 2023 | Autonomous DrivingFew-Shot Learning | CodeCode Available | 3 |
| Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis | Nov 1, 2023 | Logical ReasoningPrompt Engineering | CodeCode Available | 0 |
| Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace | Oct 30, 2023 | Code GenerationLogical Reasoning | CodeCode Available | 1 |
| DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy | Oct 28, 2023 | Logical Reasoning | CodeCode Available | 1 |
| Generating by Understanding: Neural Visual Generation with Logical Symbol Groundings | Oct 26, 2023 | DisentanglementLogical Reasoning | CodeCode Available | 0 |
| POE: Process of Elimination for Multiple Choice Reasoning | Oct 24, 2023 | In-Context LearningLogical Reasoning | CodeCode Available | 0 |