| Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation | Jul 20, 2023 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds | May 27, 2023 | Task PlanningWorld Knowledge | CodeCode Available | 1 |
| Knowledge Graph Contrastive Learning for Recommendation | May 2, 2022 | Contrastive LearningGeneral Knowledge | CodeCode Available | 1 |
| Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement | Jan 21, 2025 | Synthetic Data GenerationWorld Knowledge | CodeCode Available | 1 |
| Is ChatGPT a Good Recommender? A Preliminary Study | Apr 20, 2023 | Recommendation SystemsWorld Knowledge | CodeCode Available | 1 |
| Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries | Aug 20, 2020 | World Knowledge | CodeCode Available | 1 |
| InGram: Inductive Knowledge Graph Embedding via Relation Graphs | May 31, 2023 | Entity EmbeddingsGraph Embedding | CodeCode Available | 1 |
| Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition | Oct 8, 2020 | Question AnsweringWorld Knowledge | CodeCode Available | 1 |
| Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive Summarization | Aug 30, 2021 | Abstractive Text SummarizationReinforcement Learning (RL) | CodeCode Available | 1 |
| KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs | Sep 9, 2021 | Common Sense ReasoningLanguage Modelling | CodeCode Available | 1 |
| ACES: Translation Accuracy Challenge Sets for Evaluating Machine Translation Metrics | Oct 27, 2022 | Machine TranslationTranslation | CodeCode Available | 1 |
| Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration | Sep 30, 2023 | World Knowledge | CodeCode Available | 1 |
| CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge | Nov 2, 2018 | Common Sense ReasoningMultiple-choice | CodeCode Available | 1 |
| Counterfactual reasoning: Do language models need world knowledge for causal understanding? | Dec 6, 2022 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios | May 26, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | May 9, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 1 |
| I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token | Dec 9, 2024 | World Knowledge | CodeCode Available | 1 |
| Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language | Mar 1, 2021 | SentenceWorld Knowledge | CodeCode Available | 1 |
| CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models | Sep 27, 2024 | Reinforcement Learning (RL)World Knowledge | CodeCode Available | 1 |
| Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information | Nov 2, 2023 | ImputationKnowledge Graph Completion | CodeCode Available | 1 |
| Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers | May 24, 2020 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 1 |
| Common Sense Enhanced Knowledge-based Recommendation with Large Language Model | Mar 27, 2024 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 1 |
| Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators | Oct 11, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 |
| MEIM: Multi-partition Embedding Interaction Beyond Block Term Format for Efficient and Expressive Link Prediction | Sep 30, 2022 | Graph EmbeddingKnowledge Graph Embedding | CodeCode Available | 1 |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Apr 22, 2024 | BenchmarkingWorld Knowledge | CodeCode Available | 1 |
| Mixture of Low-rank Experts for Transferable AI-Generated Image Detection | Apr 7, 2024 | Descriptiveparameter-efficient fine-tuning | CodeCode Available | 1 |
| Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers | Dec 7, 2023 | MathMultiple-choice | CodeCode Available | 1 |
| Imagine This! Scripts to Compositions to Videos | Apr 10, 2018 | RetrievalWorld Knowledge | CodeCode Available | 1 |
| Lbl2Vec: An Embedding-Based Approach for Unsupervised Document Retrieval on Predefined Topics | Oct 12, 2022 | Document ClassificationRetrieval | CodeCode Available | 1 |
| Dense X Retrieval: What Retrieval Granularity Should We Use? | Dec 11, 2023 | RetrievalSentence | CodeCode Available | 1 |
| A Unified Encoder-Decoder Framework with Entity Memory | Oct 7, 2022 | DecoderQuestion Answering | CodeCode Available | 1 |
| Combo of Thinking and Observing for Outside-Knowledge VQA | May 10, 2023 | DecoderQuestion Answering | CodeCode Available | 1 |
| Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs? | Aug 20, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 1 |
| PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning | May 10, 2024 | DecoderGeneralization Bounds | CodeCode Available | 1 |
| Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement | Sep 17, 2024 | Active LearningDiversity | CodeCode Available | 1 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge | Apr 6, 2021 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 1 |
| Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach | Jun 6, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances | Oct 11, 2023 | World Knowledge | CodeCode Available | 1 |
| GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains | May 24, 2025 | geo-localizationVisual Reasoning | CodeCode Available | 1 |
| GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants | Feb 12, 2024 | Code GenerationManagement | CodeCode Available | 1 |
| CogIE: An Information Extraction Toolkit for Bridging Texts and CogNet | Aug 1, 2021 | Entity LinkingEntity Typing | CodeCode Available | 1 |
| HeadlineCause: A Dataset of News Headlines for Detecting Causalities | Aug 28, 2021 | Commonsense Causal ReasoningCommon Sense Reasoning | CodeCode Available | 1 |
| REALM: Retrieval-Augmented Language Model Pre-Training | Feb 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| F-ViTA: Foundation Model Guided Visible to Thermal Translation | Apr 3, 2025 | Scene UnderstandingStyle Transfer | CodeCode Available | 1 |
| FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data Classification | Mar 18, 2025 | Combinatorial OptimizationContrastive Learning | CodeCode Available | 1 |
| Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Jul 10, 2024 | counterfactualFact Checking | CodeCode Available | 1 |
| Differentially Private Federated Knowledge Graphs Embedding | May 17, 2021 | Graph EmbeddingKnowledge Graph Embedding | CodeCode Available | 1 |
| Do PLMs Know and Understand Ontological Knowledge? | Sep 12, 2023 | Logical ReasoningMemorization | CodeCode Available | 1 |
| FELM: Benchmarking Factuality Evaluation of Large Language Models | Oct 1, 2023 | BenchmarkingMath | CodeCode Available | 1 |