| Galactica: A Large Language Model for Science | Nov 16, 2022 | AnachronismsBias Detection | CodeCode Available | 4 |
| PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain | Oct 22, 2023 | Dialogue GenerationDialogue Understanding | CodeCode Available | 2 |
| LambdaKG: A Library for Pre-trained Language Model-Based Knowledge Graph Embeddings | Oct 1, 2022 | Graph Representation LearningKnowledge Graph Completion | CodeCode Available | 2 |
| mGPT: Few-Shot Learners Go Multilingual | Apr 15, 2022 | Cross-Lingual Natural Language InferenceCross-Lingual Paraphrase Identification | CodeCode Available | 2 |
| GPT Understands, Too | Mar 18, 2021 | Knowledge ProbingLanguage Modeling | CodeCode Available | 2 |
| BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models | Apr 5, 2024 | Factual probeGeneral Knowledge | CodeCode Available | 1 |
| Using Large Language Models for Knowledge Engineering (LLMKE): A Case Study on Wikidata | Sep 15, 2023 | Knowledge Probing | CodeCode Available | 1 |
| Can Language Models Solve Graph Problems in Natural Language? | May 17, 2023 | In-Context LearningKnowledge Probing | CodeCode Available | 1 |
| LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development | May 12, 2023 | Knowledge ProbingLanguage Modeling | CodeCode Available | 1 |
| Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding | Mar 21, 2023 | Knowledge ProbingLanguage Modelling | CodeCode Available | 1 |
| When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories | Dec 20, 2022 | Knowledge ProbingMemorization | CodeCode Available | 1 |
| Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue Systems | Dec 15, 2022 | Knowledge ProbingResponse Generation | CodeCode Available | 1 |
| COPEN: Probing Conceptual Knowledge in Pre-trained Language Models | Nov 8, 2022 | Knowledge Probing | CodeCode Available | 1 |
| Calibrating Factual Knowledge in Pretrained Language Models | Oct 7, 2022 | Knowledge ProbingQuestion Answering | CodeCode Available | 1 |
| Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language Models | Oct 15, 2021 | Knowledge ProbingTransfer Learning | CodeCode Available | 1 |
| CoLAKE: Contextualized Language and Knowledge Embedding | Oct 1, 2020 | Entity EmbeddingsKnowledge Graph Completion | CodeCode Available | 1 |
| Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing | May 27, 2025 | Knowledge Probing | —Unverified | 0 |
| "Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities | Oct 16, 2024 | Knowledge ProbingLogical Reasoning | —Unverified | 0 |
| Correlation and Navigation in the Vocabulary Key Representation Space of Language Models | Oct 3, 2024 | DiversityKnowledge Probing | CodeCode Available | 0 |
| LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models | Aug 28, 2024 | Continual LearningKnowledge Probing | —Unverified | 0 |
| Knowledge Probing for Graph Representation Learning | Aug 7, 2024 | Graph ClassificationGraph Learning | —Unverified | 0 |
| What Matters in Memorizing and Recalling Facts? Multifaceted Benchmarks for Knowledge Probing in Language Models | Jun 18, 2024 | DecoderHallucination | —Unverified | 0 |
| Chaos with Keywords: Exposing Large Language Models Sycophantic Hallucination to Misleading Keywords and Evaluating Defense Strategies | Jun 6, 2024 | HallucinationKnowledge Probing | —Unverified | 0 |
| Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph | Apr 4, 2024 | Claim VerificationCommon Sense Reasoning | CodeCode Available | 0 |
| TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models | Mar 17, 2024 | Knowledge GraphsKnowledge Probing | —Unverified | 0 |