| TimeLogic: A Temporal Logic Benchmark for Video QA | Jan 13, 2025 | 2kAction Segmentation | —Unverified | 0 |
| Neural Probabilistic Circuits: Enabling Compositional and Interpretable Predictions through Logical Reasoning | Jan 13, 2025 | Attributecounterfactual | —Unverified | 0 |
| Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization | Jan 9, 2025 | Information RetrievalLogical Reasoning | —Unverified | 0 |
| Enhancing Transformers for Generalizable First-Order Logical Entailment | Jan 1, 2025 | Logical ReasoningOut-of-Distribution Generalization | —Unverified | 0 |
| KnowRA: Knowledge Retrieval Augmented Method for Document-level Relation Extraction with Comprehensive Reasoning Abilities | Dec 31, 2024 | Common Sense ReasoningDocument-level Relation Extraction | —Unverified | 0 |
| SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity | Dec 30, 2024 | BenchmarkingCode Generation | —Unverified | 0 |
| StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs | Dec 23, 2024 | BenchmarkingLogical Reasoning | —Unverified | 0 |
| Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework | Dec 22, 2024 | Logical Reasoning | CodeCode Available | 0 |
| Formal Language Knowledge Corpus for Retrieval Augmented Generation | Dec 21, 2024 | Logical ReasoningMathematical Proofs | —Unverified | 0 |
| Logical Consistency of Large Language Models in Fact-checking | Dec 20, 2024 | Fact CheckingHallucination | —Unverified | 0 |
| SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models | Dec 17, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 |
| Reasoning-Aware Query-Focused Summarization over Multi-Table Data | Dec 12, 2024 | Logical ReasoningQuery-focused Summarization | —Unverified | 0 |
| Federated In-Context LLM Agent Learning | Dec 11, 2024 | Federated LearningIn-Context Learning | —Unverified | 0 |
| Algorithmic Phase Transitions in Language Models: A Mechanistic Case Study of Arithmetic | Dec 10, 2024 | Logical Reasoning | —Unverified | 0 |
| Can OpenAI o1 outperform humans in higher-order cognitive thinking? | Dec 7, 2024 | Logical Reasoning | —Unverified | 0 |
| Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games | Dec 6, 2024 | Logical Reasoning | CodeCode Available | 0 |
| MTMT: Consolidating Multiple Thinking Modes to Form a Thought Tree for Strengthening LLM | Dec 5, 2024 | counterfactualForm | —Unverified | 0 |
| Guidance is All You Need: Temperature-Guided Reasoning in Large Language Models | Dec 5, 2024 | AllComputational Efficiency | —Unverified | 0 |
| Reverse Thinking Makes LLMs Stronger Reasoners | Nov 29, 2024 | Data AugmentationKnowledge Distillation | —Unverified | 0 |
| SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment | Nov 27, 2024 | ClassificationDecision Making | —Unverified | 0 |
| Dspy-based Neural-Symbolic Pipeline to Enhance Spatial Reasoning in LLMs | Nov 27, 2024 | Logical ReasoningSemantic Parsing | —Unverified | 0 |
| Learning for Long-Horizon Planning via Neuro-Symbolic Abductive Imitation | Nov 27, 2024 | Imitation LearningLogical Reasoning | CodeCode Available | 0 |
| Object-centric proto-symbolic behavioural reasoning from pixels | Nov 26, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Meaningless is better: hashing bias-inducing words in LLM prompts improves performance in logical reasoning and statistical learning | Nov 26, 2024 | HallucinationLogical Reasoning | —Unverified | 0 |
| HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator | Nov 26, 2024 | Common Sense ReasoningLogical Reasoning | —Unverified | 0 |