| Do We Know What LLMs Don't Know? A Study of Consistency in Knowledge Probing | May 27, 2025 | Knowledge Probing | —Unverified | 0 |
| "Let's Argue Both Sides": Argument Generation Can Force Small Models to Utilize Previously Inaccessible Reasoning Capabilities | Oct 16, 2024 | Knowledge ProbingLogical Reasoning | —Unverified | 0 |
| Correlation and Navigation in the Vocabulary Key Representation Space of Language Models | Oct 3, 2024 | DiversityKnowledge Probing | CodeCode Available | 0 |
| LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models | Aug 28, 2024 | Continual LearningKnowledge Probing | —Unverified | 0 |
| Knowledge Probing for Graph Representation Learning | Aug 7, 2024 | Graph ClassificationGraph Learning | —Unverified | 0 |
| What Matters in Memorizing and Recalling Facts? Multifaceted Benchmarks for Knowledge Probing in Language Models | Jun 18, 2024 | DecoderHallucination | —Unverified | 0 |
| Chaos with Keywords: Exposing Large Language Models Sycophantic Hallucination to Misleading Keywords and Evaluating Defense Strategies | Jun 6, 2024 | HallucinationKnowledge Probing | —Unverified | 0 |
| BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models | Apr 5, 2024 | Factual probeGeneral Knowledge | CodeCode Available | 1 |
| Unveiling LLMs: The Evolution of Latent Representations in a Dynamic Knowledge Graph | Apr 4, 2024 | Claim VerificationCommon Sense Reasoning | CodeCode Available | 0 |
| TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models | Mar 17, 2024 | Knowledge GraphsKnowledge Probing | —Unverified | 0 |
| Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge | Mar 8, 2024 | Cross-Lingual TransferKnowledge Probing | CodeCode Available | 0 |
| Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation | Jan 27, 2024 | HallucinationKnowledge Probing | —Unverified | 0 |
| Language Representation Projection: Can We Transfer Factual Knowledge across Languages in Multilingual Language Models? | Nov 7, 2023 | Knowledge ProbingRetrieval | —Unverified | 0 |
| Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models | Oct 25, 2023 | Knowledge ProbingWorld Knowledge | —Unverified | 0 |
| Kiki or Bouba? Sound Symbolism in Vision-and-Language Models | Oct 25, 2023 | Knowledge Probing | —Unverified | 0 |
| PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain | Oct 22, 2023 | Dialogue GenerationDialogue Understanding | CodeCode Available | 2 |
| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 |
| Using Large Language Models for Knowledge Engineering (LLMKE): A Case Study on Wikidata | Sep 15, 2023 | Knowledge Probing | CodeCode Available | 1 |
| Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names? | Sep 14, 2023 | Code GenerationKnowledge Probing | —Unverified | 0 |
| Can Language Models Solve Graph Problems in Natural Language? | May 17, 2023 | In-Context LearningKnowledge Probing | CodeCode Available | 1 |
| LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development | May 12, 2023 | Knowledge ProbingLanguage Modeling | CodeCode Available | 1 |
| Knowledge-augmented Frame Semantic Parsing with Hybrid Prompt-tuning | Mar 25, 2023 | Knowledge ProbingSemantic Parsing | —Unverified | 0 |
| Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding | Mar 21, 2023 | Knowledge ProbingLanguage Modelling | CodeCode Available | 1 |
| When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories | Dec 20, 2022 | Knowledge ProbingMemorization | CodeCode Available | 1 |
| Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue Systems | Dec 15, 2022 | Knowledge ProbingResponse Generation | CodeCode Available | 1 |