| Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge | Apr 6, 2021 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 1 |
| LLaRA: Large Language-Recommendation Assistant | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA | Nov 14, 2023 | In-Context LearningProgram Synthesis | CodeCode Available | 1 |
| Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts | Oct 31, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets | Jan 29, 2024 | BenchmarkingMachine Translation | CodeCode Available | 1 |
| Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries | Aug 20, 2020 | World Knowledge | CodeCode Available | 1 |
| Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU | Oct 7, 2023 | Multi-task Language UnderstandingWorld Knowledge | CodeCode Available | 1 |
| Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration | Sep 30, 2023 | World Knowledge | CodeCode Available | 1 |
| An Automatic Graph Construction Framework based on Large Language Models for Recommendation | Dec 24, 2024 | graph constructionQuantization | CodeCode Available | 1 |
| KoLA: Carefully Benchmarking World Knowledge of Large Language Models | Jun 15, 2023 | BenchmarkingHallucination | CodeCode Available | 1 |
| Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers | Dec 7, 2023 | MathMultiple-choice | CodeCode Available | 1 |
| O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering | May 22, 2025 | Answer GenerationOpen-Ended Question Answering | CodeCode Available | 1 |
| Can LLMs' Tuning Methods Work in Medical Multimodal Domain? | Mar 11, 2024 | Transfer LearningWorld Knowledge | CodeCode Available | 1 |
| Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model | Aug 2, 2023 | HallucinationImage Captioning | CodeCode Available | 1 |
| ASER: A Large-scale Eventuality Knowledge Graph | May 1, 2019 | Knowledge GraphsWorld Knowledge | CodeCode Available | 1 |
| Large Scale Knowledge Washing | May 26, 2024 | DecoderMemorization | CodeCode Available | 1 |
| Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models | Nov 14, 2023 | Continual LearningQuestion Answering | CodeCode Available | 1 |
| Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators | Oct 11, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 |
| OpenMix: Exploring Outlier Samples for Misclassification Detection | Mar 30, 2023 | World Knowledge | CodeCode Available | 1 |
| Chain-of-Skills: A Configurable Model for Open-domain Question Answering | May 4, 2023 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning | May 10, 2024 | DecoderGeneralization Bounds | CodeCode Available | 1 |
| Beyond Embeddings: The Promise of Visual Table in Visual Reasoning | Mar 27, 2024 | Representation LearningVisual Question Answering | CodeCode Available | 1 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 |
| Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information | Nov 2, 2023 | ImputationKnowledge Graph Completion | CodeCode Available | 1 |
| Analyzing Knowledge Graph Embedding Methods from a Multi-Embedding Interaction Perspective | Mar 27, 2019 | Graph EmbeddingKnowledge Graph Embedding | CodeCode Available | 1 |
| Knowledge Editing through Chain-of-Thought | Dec 23, 2024 | knowledge editingWorld Knowledge | CodeCode Available | 1 |
| Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation | Jul 20, 2023 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds | May 27, 2023 | Task PlanningWorld Knowledge | CodeCode Available | 1 |
| Is ChatGPT a Good Recommender? A Preliminary Study | Apr 20, 2023 | Recommendation SystemsWorld Knowledge | CodeCode Available | 1 |
| BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models | Apr 5, 2024 | Factual probeGeneral Knowledge | CodeCode Available | 1 |
| Counterfactual reasoning: Do language models need world knowledge for causal understanding? | Dec 6, 2022 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs | Sep 9, 2021 | Common Sense ReasoningLanguage Modelling | CodeCode Available | 1 |
| Knowledge Graph Contrastive Learning for Recommendation | May 2, 2022 | Contrastive LearningGeneral Knowledge | CodeCode Available | 1 |
| Large-Scale Relation Learning for Question Answering over Knowledge Bases with Pre-trained Language Models | Nov 1, 2021 | Question AnsweringRelation | CodeCode Available | 1 |
| I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token | Dec 9, 2024 | World Knowledge | CodeCode Available | 1 |
| Imagine This! Scripts to Compositions to Videos | Apr 10, 2018 | RetrievalWorld Knowledge | CodeCode Available | 1 |
| How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances | Oct 11, 2023 | World Knowledge | CodeCode Available | 1 |
| Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition | Oct 8, 2020 | Question AnsweringWorld Knowledge | CodeCode Available | 1 |
| Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs? | Aug 20, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 1 |
| GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains | May 24, 2025 | geo-localizationVisual Reasoning | CodeCode Available | 1 |
| HeadlineCause: A Dataset of News Headlines for Detecting Causalities | Aug 28, 2021 | Commonsense Causal ReasoningCommon Sense Reasoning | CodeCode Available | 1 |
| ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation Metrics | Oct 27, 2022 | Machine TranslationTranslation | CodeCode Available | 1 |
| CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge | Nov 2, 2018 | Common Sense ReasoningMultiple-choice | CodeCode Available | 1 |
| Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers | May 24, 2020 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 1 |
| Common Sense Enhanced Knowledge-based Recommendation with Large Language Model | Mar 27, 2024 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 1 |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Apr 22, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 1 |
| GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants | Feb 12, 2024 | Code GenerationManagement | CodeCode Available | 1 |
| InGram: Inductive Knowledge Graph Embedding via Relation Graphs | May 31, 2023 | Entity EmbeddingsGraph Embedding | CodeCode Available | 1 |
| A Unified Encoder-Decoder Framework with Entity Memory | Oct 7, 2022 | DecoderQuestion Answering | CodeCode Available | 1 |