| Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds | May 27, 2023 | Task PlanningWorld Knowledge | CodeCode Available | 1 |
| Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation | Jul 20, 2023 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU | Oct 7, 2023 | Multi-task Language UnderstandingWorld Knowledge | CodeCode Available | 1 |
| Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA | Nov 14, 2023 | In-Context LearningProgram Synthesis | CodeCode Available | 1 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 |
| Knowledge Editing through Chain-of-Thought | Dec 23, 2024 | knowledge editingWorld Knowledge | CodeCode Available | 1 |
| MEIM: Multi-partition Embedding Interaction Beyond Block Term Format for Efficient and Expressive Link Prediction | Sep 30, 2022 | Graph EmbeddingKnowledge Graph Embedding | CodeCode Available | 1 |
| Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge | Apr 6, 2021 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 1 |
| How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances | Oct 11, 2023 | World Knowledge | CodeCode Available | 1 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants | Feb 12, 2024 | Code GenerationManagement | CodeCode Available | 1 |
| Large Scale Knowledge Washing | May 26, 2024 | DecoderMemorization | CodeCode Available | 1 |
| An Automatic Graph Construction Framework based on Large Language Models for Recommendation | Dec 24, 2024 | graph constructionQuantization | CodeCode Available | 1 |
| Can LLMs' Tuning Methods Work in Medical Multimodal Domain? | Mar 11, 2024 | Transfer LearningWorld Knowledge | CodeCode Available | 1 |
| HeadlineCause: A Dataset of News Headlines for Detecting Causalities | Aug 28, 2021 | Commonsense Causal ReasoningCommon Sense Reasoning | CodeCode Available | 1 |
| ASER: A Large-scale Eventuality Knowledge Graph | May 1, 2019 | Knowledge GraphsWorld Knowledge | CodeCode Available | 1 |
| Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers | Dec 7, 2023 | MathMultiple-choice | CodeCode Available | 1 |
| Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models | Nov 14, 2023 | Continual LearningQuestion Answering | CodeCode Available | 1 |
| Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model | Aug 2, 2023 | HallucinationImage Captioning | CodeCode Available | 1 |
| CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge | Nov 2, 2018 | Common Sense ReasoningMultiple-choice | CodeCode Available | 1 |
| CogIE: An Information Extraction Toolkit for Bridging Texts and CogNet | Aug 1, 2021 | Entity LinkingEntity Typing | CodeCode Available | 1 |
| Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs? | Aug 20, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 1 |
| Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators | Oct 11, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 |
| Beyond Embeddings: The Promise of Visual Table in Visual Reasoning | Mar 27, 2024 | Representation LearningVisual Question Answering | CodeCode Available | 1 |
| Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information | Nov 2, 2023 | ImputationKnowledge Graph Completion | CodeCode Available | 1 |