| KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models | Oct 15, 2023 | Multiple-choiceTriplet | CodeCode Available | 0 |
| Penetrative AI: Making LLMs Comprehend the Physical World | Oct 14, 2023 | Common Sense ReasoningWorld Knowledge | —Unverified | 0 |
| Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection | Oct 12, 2023 | DescriptiveOut-of-Distribution Detection | —Unverified | 0 |
| Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators | Oct 11, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 |
| How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances | Oct 11, 2023 | World Knowledge | CodeCode Available | 1 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Self-Knowledge Guided Retrieval Augmentation for Large Language Models | Oct 8, 2023 | Question AnsweringRetrieval | —Unverified | 0 |
| Compositional Semantics for Open Vocabulary Spatio-semantic Representations | Oct 8, 2023 | World Knowledge | —Unverified | 0 |
| Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU | Oct 7, 2023 | Multi-task Language UnderstandingWorld Knowledge | CodeCode Available | 1 |
| FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation | Oct 5, 2023 | HallucinationWorld Knowledge | CodeCode Available | 2 |