| ASER: A Large-scale Eventuality Knowledge Graph | May 1, 2019 | Knowledge GraphsWorld Knowledge | CodeCode Available | 1 |
| Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive Summarization | Aug 30, 2021 | Abstractive Text SummarizationReinforcement Learning (RL) | CodeCode Available | 1 |
| Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition | Oct 8, 2020 | Question AnsweringWorld Knowledge | CodeCode Available | 1 |
| Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors | Nov 20, 2022 | Model EditingWorld Knowledge | CodeCode Available | 1 |
| InGram: Inductive Knowledge Graph Embedding via Relation Graphs | May 31, 2023 | Entity EmbeddingsGraph Embedding | CodeCode Available | 1 |
| Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds | May 27, 2023 | Task PlanningWorld Knowledge | CodeCode Available | 1 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 |
| How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances | Oct 11, 2023 | World Knowledge | CodeCode Available | 1 |
| AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token | Dec 9, 2024 | World Knowledge | CodeCode Available | 1 |
| Breaking NLI Systems with Sentences that Require Simple Lexical Inferences | May 6, 2018 | World Knowledge | CodeCode Available | 1 |
| A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge | Jun 3, 2022 | Question AnsweringVisual Question Answering | CodeCode Available | 1 |
| HeadlineCause: A Dataset of News Headlines for Detecting Causalities | Aug 28, 2021 | Commonsense Causal ReasoningCommon Sense Reasoning | CodeCode Available | 1 |
| Can LLMs' Tuning Methods Work in Medical Multimodal Domain? | Mar 11, 2024 | Transfer LearningWorld Knowledge | CodeCode Available | 1 |
| A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering | Nov 13, 2023 | Decision MakingExplanation Generation | CodeCode Available | 1 |
| Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs? | Aug 20, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 1 |
| Imagine This! Scripts to Compositions to Videos | Apr 10, 2018 | RetrievalWorld Knowledge | CodeCode Available | 1 |
| Knowledge Editing through Chain-of-Thought | Dec 23, 2024 | knowledge editingWorld Knowledge | CodeCode Available | 1 |
| FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data Classification | Mar 18, 2025 | Combinatorial OptimizationContrastive Learning | CodeCode Available | 1 |
| Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge | Apr 6, 2021 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 1 |
| F-ViTA: Foundation Model Guided Visible to Thermal Translation | Apr 3, 2025 | Scene UnderstandingStyle Transfer | CodeCode Available | 1 |
| GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains | May 24, 2025 | geo-localizationVisual Reasoning | CodeCode Available | 1 |
| GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants | Feb 12, 2024 | Code GenerationManagement | CodeCode Available | 1 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| FELM: Benchmarking Factuality Evaluation of Large Language Models | Oct 1, 2023 | BenchmarkingMath | CodeCode Available | 1 |