| Modeling Semantic Plausibility by Injecting World Knowledge | Apr 2, 2018 | World Knowledge | CodeCode Available | 0 | 5 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 | 5 |
| MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty | Aug 13, 2024 | Mathematical ReasoningQuestion Answering | CodeCode Available | 0 | 5 |
| LoRec: Large Language Model for Robust Sequential Recommendation against Poisoning Attacks | Jan 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Massively Multilingual Language Models for Cross Lingual Fact Extraction from Low Resource Indian Languages | Feb 9, 2023 | FormKnowledge Graphs | CodeCode Available | 0 | 5 |
| More Room for Language: Investigating the Effect of Retrieval on Language Models | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused Evaluation | Mar 25, 2025 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge | Oct 23, 2023 | Phrase GroundingWorld Knowledge | CodeCode Available | 0 | 5 |
| LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive Diagnosis | May 14, 2025 | cognitive diagnosisWorld Knowledge | CodeCode Available | 0 | 5 |
| LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model | May 3, 2024 | Image CaptioningInstruction Following | CodeCode Available | 0 | 5 |
| Locating and Extracting Relational Concepts in Large Language Models | Jun 19, 2024 | World Knowledge | CodeCode Available | 0 | 5 |
| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 | 5 |
| Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language Models | Mar 21, 2024 | SentenceWorld Knowledge | CodeCode Available | 0 | 5 |
| MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations | Jun 25, 2025 | World Knowledge | CodeCode Available | 0 | 5 |
| LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description | Aug 9, 2024 | DiversityInstruction Following | CodeCode Available | 0 | 5 |
| LoFTI: Localization and Factuality Transfer to Indian Locales | Jul 16, 2024 | World Knowledge | CodeCode Available | 0 | 5 |
| Geographical Erasure in Language Generation | Oct 23, 2023 | Text GenerationWorld Knowledge | CodeCode Available | 0 | 5 |
| LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric Literals | Apr 28, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 0 | 5 |
| Logic Attention Based Neighborhood Aggregation for Inductive Knowledge Graph Embedding | Nov 4, 2018 | Graph EmbeddingKnowledge Graph Completion | CodeCode Available | 0 | 5 |
| Morph Call: Probing Morphosyntactic Content of Multilingual Transformers | Apr 26, 2021 | Common Sense ReasoningMORPH | CodeCode Available | 0 | 5 |
| ComDensE : Combined Dense Embedding of Relation-aware and Common Features for Knowledge Graph Completion | Jun 29, 2022 | Inductive BiasKnowledge Graph Completion | CodeCode Available | 0 | 5 |
| Language Model Behavior: A Comprehensive Survey | Mar 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Augment or Not? A Comparative Study of Pure and Augmented Large Language Model Recommenders | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Combining Analogy with Language Models for Knowledge Extraction | Jun 22, 2021 | ArticlesLanguage Modeling | CodeCode Available | 0 | 5 |
| Language models show human-like content effects on reasoning tasks | Jul 14, 2022 | Language ModellingLogical Reasoning | CodeCode Available | 0 | 5 |
| DYNAMICQA: Tracing Internal Knowledge Conflicts in Language Models | Jul 24, 2024 | Retrieval-augmented GenerationWorld Knowledge | CodeCode Available | 0 | 5 |
| Knowledge Graph Completion with Mixed Geometry Tensor Factorization | Apr 3, 2025 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 0 | 5 |
| Augmenting Neural Networks with First-order Logic | Jun 14, 2019 | ChunkingNatural Language Inference | CodeCode Available | 0 | 5 |
| Knowledge Generation -- Variational Bayes on Knowledge Graphs | Jan 21, 2021 | DecoderGraph Matching | CodeCode Available | 0 | 5 |
| Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent | Mar 28, 2024 | World Knowledge | CodeCode Available | 0 | 5 |
| Frame- and Entity-Based Knowledge for Common-Sense Argumentative Reasoning | Nov 1, 2018 | Argument MiningCommon Sense Reasoning | CodeCode Available | 0 | 5 |
| Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition | Apr 9, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Contextual Knowledge Pursuit for Faithful Visual Synthesis | Nov 29, 2023 | Language ModellingRetrieval | CodeCode Available | 0 | 5 |
| Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior Simulation | Mar 27, 2024 | Common Sense ReasoningWorld Knowledge | CodeCode Available | 0 | 5 |
| FLAME: Self-Supervised Low-Resource Taxonomy Expansion using Large Language Models | Feb 21, 2024 | Recommendation SystemsTaxonomy Expansion | CodeCode Available | 0 | 5 |
| COFAR: Commonsense and Factual Reasoning in Image Search | Oct 16, 2022 | Image RetrievalRetrieval | CodeCode Available | 0 | 5 |
| Finding Motifs in Knowledge Graphs using Compression | Apr 16, 2021 | Knowledge GraphsWorld Knowledge | CodeCode Available | 0 | 5 |
| KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models | Oct 15, 2023 | Multiple-choiceTriplet | CodeCode Available | 0 | 5 |
| Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions | Nov 20, 2023 | Question AnsweringVisual Question Answering | CodeCode Available | 0 | 5 |
| Figurative Language in Recognizing Textual Entailment | Jun 2, 2021 | Natural Language InferenceRTE | CodeCode Available | 0 | 5 |
| Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation | May 24, 2025 | Image GenerationText to Image Generation | CodeCode Available | 0 | 5 |
| CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment | Mar 11, 2022 | Natural Language UnderstandingWorld Knowledge | CodeCode Available | 0 | 5 |
| Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models | May 7, 2021 | Coherence EvaluationLanguage Modelling | CodeCode Available | 0 | 5 |
| Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models | Jul 22, 2024 | DisentanglementQuestion Answering | CodeCode Available | 0 | 5 |
| Interweaving Memories of a Siamese Large Language Model | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Intrinsic Knowledge Evaluation on Chinese Language Models | Nov 29, 2020 | World Knowledge | CodeCode Available | 0 | 5 |
| Investigating associative, switchable and negatable Winograd items on renewed French data sets | Jun 1, 2022 | NegationWorld Knowledge | CodeCode Available | 0 | 5 |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Jun 17, 2024 | In-Context Learningvalid | CodeCode Available | 0 | 5 |
| StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning | Nov 16, 2023 | Question AnsweringWorld Knowledge | CodeCode Available | 0 | 5 |
| Improving Neural Story Generation by Targeted Common Sense Grounding | Aug 26, 2019 | Common Sense ReasoningMulti-Task Learning | CodeCode Available | 0 | 5 |