| ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation Metrics | Oct 27, 2022 | Machine TranslationTranslation | CodeCode Available | 1 | 5 |
| CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge | Nov 2, 2018 | Common Sense ReasoningMultiple-choice | CodeCode Available | 1 | 5 |
| Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection | Apr 9, 2024 | Human-Object Interaction DetectionWorld Knowledge | CodeCode Available | 1 | 5 |
| Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement | Jan 21, 2025 | Synthetic Data GenerationWorld Knowledge | CodeCode Available | 1 | 5 |
| LLaRA: Large Language-Recommendation Assistant | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Differentially Private Federated Knowledge Graphs Embedding | May 17, 2021 | Graph EmbeddingKnowledge Graph Embedding | CodeCode Available | 1 | 5 |
| Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers | May 24, 2020 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 1 | 5 |
| Common Sense Enhanced Knowledge-based Recommendation with Large Language Model | Mar 27, 2024 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 1 | 5 |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Apr 22, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 1 | 5 |
| A Unified Encoder-Decoder Framework with Entity Memory | Oct 7, 2022 | DecoderQuestion Answering | CodeCode Available | 1 | 5 |
| Learning or Self-aligning? Rethinking Instruction Fine-tuning | Feb 28, 2024 | World Knowledge | CodeCode Available | 1 | 5 |
| Combo of Thinking and Observing for Outside-Knowledge VQA | May 10, 2023 | DecoderQuestion Answering | CodeCode Available | 1 | 5 |
| Lbl2Vec: An Embedding-Based Approach for Unsupervised Document Retrieval on Predefined Topics | Oct 12, 2022 | Document ClassificationRetrieval | CodeCode Available | 1 | 5 |
| Counterfactual reasoning: Do language models need world knowledge for causal understanding? | Dec 6, 2022 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 | 5 |
| Lenna: Language Enhanced Reasoning Detection Assistant | Dec 5, 2023 | World Knowledge | CodeCode Available | 1 | 5 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 | 5 |
| Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models | May 15, 2024 | AI AgentWorld Knowledge | CodeCode Available | 1 | 5 |
| Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language | Mar 1, 2021 | SentenceWorld Knowledge | CodeCode Available | 1 | 5 |
| Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach | Jun 6, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 1 | 5 |
| Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information | Nov 2, 2023 | ImputationKnowledge Graph Completion | CodeCode Available | 1 | 5 |
| CogIE: An Information Extraction Toolkit for Bridging Texts and CogNet | Aug 1, 2021 | Entity LinkingEntity Typing | CodeCode Available | 1 | 5 |
| Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models | Apr 9, 2024 | Few-Shot LearningLanguage Modelling | CodeCode Available | 1 | 5 |
| Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators | Oct 11, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 | 5 |
| Large-Scale Relation Learning for Question Answering over Knowledge Bases with Pre-trained Language Models | Nov 1, 2021 | Question AnsweringRelation | CodeCode Available | 1 | 5 |
| Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries | Aug 20, 2020 | World Knowledge | CodeCode Available | 1 | 5 |