| Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent | Mar 28, 2024 | World Knowledge | CodeCode Available | 0 |
| Augment or Not? A Comparative Study of Pure and Augmented Large Language Model Recommenders | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition | Apr 9, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning | Nov 16, 2023 | Question AnsweringWorld Knowledge | CodeCode Available | 0 |
| Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models | Jul 22, 2024 | DisentanglementQuestion Answering | CodeCode Available | 0 |
| KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models | Oct 15, 2023 | Multiple-choiceTriplet | CodeCode Available | 0 |
| CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment | Mar 11, 2022 | Natural Language UnderstandingWorld Knowledge | CodeCode Available | 0 |
| Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries | Feb 9, 2025 | DiversityFairness | CodeCode Available | 0 |
| PCR4ALL: A Comprehensive Evaluation Benchmark for Pronoun Coreference Resolution in English | Jun 1, 2022 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models | May 7, 2021 | Coherence EvaluationLanguage Modelling | CodeCode Available | 0 |