| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 | 5 |
| Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language Models | Mar 21, 2024 | SentenceWorld Knowledge | CodeCode Available | 0 | 5 |
| Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior Simulation | Mar 27, 2024 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 0 | 5 |
| Language models show human-like content effects on reasoning tasks | Jul 14, 2022 | Language ModellingLogical Reasoning | CodeCode Available | 0 | 5 |
| Geographical Erasure in Language Generation | Oct 23, 2023 | Text GenerationWorld Knowledge | CodeCode Available | 0 | 5 |
| Language Model Behavior: A Comprehensive Survey | Mar 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Contextual Knowledge Pursuit for Faithful Visual Synthesis | Nov 29, 2023 | Language ModellingRetrieval | CodeCode Available | 0 | 5 |
| Knowledge Graph Completion with Mixed Geometry Tensor Factorization | Apr 3, 2025 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 0 | 5 |
| LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric Literals | Apr 28, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 0 | 5 |
| Memory-Modular Classification: Learning to Generalize with Memory Replacement | Apr 8, 2025 | Classificationimage-classification | CodeCode Available | 0 | 5 |
| Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models | Jul 22, 2024 | DisentanglementQuestion Answering | CodeCode Available | 0 | 5 |
| GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge? | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LoFTI: Localization and Factuality Transfer to Indian Locales | Jul 16, 2024 | World Knowledge | CodeCode Available | 0 | 5 |
| Logic Attention Based Neighborhood Aggregation for Inductive Knowledge Graph Embedding | Nov 4, 2018 | Graph EmbeddingKnowledge Graph Completion | CodeCode Available | 0 | 5 |
| ComDensE : Combined Dense Embedding of Relation-aware and Common Features for Knowledge Graph Completion | Jun 29, 2022 | Inductive BiasKnowledge Graph Completion | CodeCode Available | 0 | 5 |
| KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models | Oct 15, 2023 | Multiple-choiceTriplet | CodeCode Available | 0 | 5 |
| Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition | Apr 9, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Augment or Not? A Comparative Study of Pure and Augmented Large Language Model Recommenders | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Combining Analogy with Language Models for Knowledge Extraction | Jun 22, 2021 | ArticlesLanguage Modeling | CodeCode Available | 0 | 5 |
| Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models | May 7, 2021 | Coherence EvaluationLanguage Modelling | CodeCode Available | 0 | 5 |
| Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent | Mar 28, 2024 | World Knowledge | CodeCode Available | 0 | 5 |
| DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models | Jul 24, 2024 | Retrieval-augmented GenerationWorld Knowledge | CodeCode Available | 0 | 5 |
| Investigating associative, switchable and negatable Winograd items on renewed French data sets | Jun 1, 2022 | NegationWorld Knowledge | CodeCode Available | 0 | 5 |
| Augmenting Neural Networks with First-order Logic | Jun 14, 2019 | ChunkingNatural Language Inference | CodeCode Available | 0 | 5 |
| Intrinsic Knowledge Evaluation on Chinese Language Models | Nov 29, 2020 | World Knowledge | CodeCode Available | 0 | 5 |