| BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering | May 25, 2023 | Binary ClassificationKnowledge Graphs | CodeCode Available | 0 |
| ToMChallenges: A Principle-Guided Dataset and Diverse Evaluation Tasks for Exploring Theory of Mind | May 24, 2023 | Multiple-choiceQuestion Answering | CodeCode Available | 0 |
| Have Large Language Models Developed a Personality?: Applicability of Self-Assessment Tests in Measuring Personality in LLMs | May 24, 2023 | Multiple-choice | —Unverified | 0 |
| This Land is Your, My Land: Evaluating Geopolitical Biases in Language Models | May 24, 2023 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy | May 24, 2023 | In-Context LearningMultiple-choice | CodeCode Available | 0 |
| Make a Choice! Knowledge Base Question Answering with In-Context Learning | May 23, 2023 | In-Context LearningKnowledge Base Question Answering | —Unverified | 0 |
| Query Rewriting for Retrieval-Augmented Large Language Models | May 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NarrativeXL: A Large-scale Dataset For Long-Term Memory Models | May 23, 2023 | Multiple-choiceReading Comprehension | CodeCode Available | 1 |
| Iterative Forward Tuning Boosts In-Context Learning in Language Models | May 22, 2023 | Decision MakingIn-Context Learning | CodeCode Available | 0 |
| VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models | May 20, 2023 | Multiple-choiceQuestion Answering | CodeCode Available | 1 |