| Language Model Behavior: A Comprehensive Survey | Mar 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| FLAME: Self-Supervised Low-Resource Taxonomy Expansion using Large Language Models | Feb 21, 2024 | Recommendation SystemsTaxonomy Expansion | CodeCode Available | 0 | 5 |
| COFAR: Commonsense and Factual Reasoning in Image Search | Oct 16, 2022 | Image RetrievalRetrieval | CodeCode Available | 0 | 5 |
| Intrinsic Knowledge Evaluation on Chinese Language Models | Nov 29, 2020 | World Knowledge | CodeCode Available | 0 | 5 |
| Finding Motifs in Knowledge Graphs using Compression | Apr 16, 2021 | Knowledge GraphsWorld Knowledge | CodeCode Available | 0 | 5 |
| Investigating associative, switchable and negatable Winograd items on renewed French data sets | Jun 1, 2022 | NegationWorld Knowledge | CodeCode Available | 0 | 5 |
| Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions | Nov 20, 2023 | Question AnsweringVisual Question Answering | CodeCode Available | 0 | 5 |
| Figurative Language in Recognizing Textual Entailment | Jun 2, 2021 | Natural Language InferenceRTE | CodeCode Available | 0 | 5 |
| Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation | May 24, 2025 | Image GenerationText to Image Generation | CodeCode Available | 0 | 5 |
| CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment | Mar 11, 2022 | Natural Language UnderstandingWorld Knowledge | CodeCode Available | 0 | 5 |