| Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation | May 24, 2025 | Image GenerationText to Image Generation | CodeCode Available | 0 | 5 |
| CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment | Mar 11, 2022 | Natural Language UnderstandingWorld Knowledge | CodeCode Available | 0 | 5 |
| Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models | May 7, 2021 | Coherence EvaluationLanguage Modelling | CodeCode Available | 0 | 5 |
| Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models | Jul 22, 2024 | DisentanglementQuestion Answering | CodeCode Available | 0 | 5 |
| Interweaving Memories of a Siamese Large Language Model | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Intrinsic Knowledge Evaluation on Chinese Language Models | Nov 29, 2020 | World Knowledge | CodeCode Available | 0 | 5 |
| Investigating associative, switchable and negatable Winograd items on renewed French data sets | Jun 1, 2022 | NegationWorld Knowledge | CodeCode Available | 0 | 5 |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Jun 17, 2024 | In-Context Learningvalid | CodeCode Available | 0 | 5 |
| StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning | Nov 16, 2023 | Question AnsweringWorld Knowledge | CodeCode Available | 0 | 5 |
| Improving Neural Story Generation by Targeted Common Sense Grounding | Aug 26, 2019 | Common Sense ReasoningMulti-Task Learning | CodeCode Available | 0 | 5 |