| A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge | Jun 3, 2022 | Question AnsweringVisual Question Answering | CodeCode Available | 1 | 5 |
| Physics of Language Models: Part 3.1, Knowledge Storage and Extraction | Sep 25, 2023 | Question AnsweringSentence | CodeCode Available | 1 | 5 |
| Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning | May 29, 2023 | Autonomous DrivingDecoder | CodeCode Available | 1 | 5 |
| Differentially Private Federated Knowledge Graphs Embedding | May 17, 2021 | Graph EmbeddingKnowledge Graph Embedding | CodeCode Available | 1 | 5 |
| Do PLMs Know and Understand Ontological Knowledge? | Sep 12, 2023 | Logical ReasoningMemorization | CodeCode Available | 1 | 5 |
| FELM: Benchmarking Factuality Evaluation of Large Language Models | Oct 1, 2023 | BenchmarkingMath | CodeCode Available | 1 | 5 |
| F-ViTA: Foundation Model Guided Visible to Thermal Translation | Apr 3, 2025 | Scene UnderstandingStyle Transfer | CodeCode Available | 1 | 5 |
| PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives | May 3, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 1 | 5 |
| Probabilistic Case-based Reasoning for Open-World Knowledge Graph Completion | Oct 7, 2020 | Knowledge Graph CompletionLink Prediction | CodeCode Available | 1 | 5 |
| ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval | Oct 24, 2024 | Image RetrievalRetrieval | CodeCode Available | 0 | 5 |
| ExPUNations: Augmenting Puns with Keywords and Explanations | Oct 24, 2022 | Explanation GenerationNatural Language Understanding | CodeCode Available | 0 | 5 |
| Open-World Knowledge Graph Completion | Nov 9, 2017 | Entity LinkingKnowledge Graph Completion | CodeCode Available | 0 | 5 |
| Temporal Fact Reasoning over Hyper-Relational Knowledge Graphs | Jul 14, 2023 | Knowledge GraphsLink Prediction | CodeCode Available | 0 | 5 |
| On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme Classification | Aug 18, 2024 | ImputationMissing Labels | CodeCode Available | 0 | 5 |
| Causal interventions expose implicit situation models for commonsense language understanding | Jun 6, 2023 | World Knowledge | CodeCode Available | 0 | 5 |
| ObjCAViT: Improving Monocular Depth Estimation Using Natural Language Models And Image-Object Cross-Attention | Nov 30, 2022 | Depth EstimationImage-to-Image Translation | CodeCode Available | 0 | 5 |
| Explain Yourself! Leveraging Language Models for Commonsense Reasoning | Jun 6, 2019 | Common Sense ReasoningForm | CodeCode Available | 0 | 5 |
| My Teacher Thinks The World Is Flat! Interpreting Automatic Essay Scoring Mechanism | Dec 27, 2020 | Common Sense ReasoningNatural Language Understanding | CodeCode Available | 0 | 5 |
| Event knowledge in large language models: the gap between the impossible and the unlikely | Dec 2, 2022 | SentenceWorld Knowledge | CodeCode Available | 0 | 5 |
| Morph Call: Probing Morphosyntactic Content of Multilingual Transformers | Apr 26, 2021 | Common Sense ReasoningMORPH | CodeCode Available | 0 | 5 |
| Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference Alignment | Jun 24, 2025 | Informativenessreinforcement-learning | CodeCode Available | 0 | 5 |
| NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge | May 8, 2023 | Knowledge Distillationvalid | CodeCode Available | 0 | 5 |
| Evaluating Contrastive Feedback for Effective User Simulations | May 5, 2025 | Information RetrievalPrompt Engineering | CodeCode Available | 0 | 5 |
| ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation | Jul 5, 2021 | Few-Shot LearningNatural Language Understanding | CodeCode Available | 0 | 5 |
| Arrows are the Verbs of Diagrams | Aug 1, 2018 | BIG-bench Machine LearningWorld Knowledge | CodeCode Available | 0 | 5 |
| Modeling Semantic Plausibility by Injecting World Knowledge | Apr 2, 2018 | World Knowledge | CodeCode Available | 0 | 5 |
| Mitigating Hallucination in Fictional Character Role-Play | Jun 25, 2024 | HallucinationWorld Knowledge | CodeCode Available | 0 | 5 |
| Mitigating Temporal Misalignment by Discarding Outdated Facts | May 24, 2023 | Question AnsweringRetrieval | CodeCode Available | 0 | 5 |
| MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization | Sep 22, 2021 | ArticlesDocument Summarization | CodeCode Available | 0 | 5 |
| Evaluating Methods for Extraction of Aspect Terms in Opinion Texts in Portuguese - the Challenges of Implicit Aspects | Jun 1, 2022 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 0 | 5 |
| More Room for Language: Investigating the Effect of Retrieval on Language Models | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| EventGround: Narrative Reasoning by Grounding to Eventuality-centric Knowledge Graphs | Mar 30, 2024 | Graph Neural NetworkKnowledge Graphs | CodeCode Available | 0 | 5 |
| NLITrans at SemEval-2018 Task 12: Transfer of Semantic Knowledge for Argument Comprehension | Apr 23, 2018 | PositionSentence | CodeCode Available | 0 | 5 |
| EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text Generation | Oct 30, 2021 | KG-to-Text GenerationKnowledge Graphs | CodeCode Available | 0 | 5 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 | 5 |
| Massively Multilingual Language Models for Cross Lingual Fact Extraction from Low Resource Indian Languages | Feb 9, 2023 | FormKnowledge Graphs | CodeCode Available | 0 | 5 |
| Memory-Modular Classification: Learning to Generalize with Memory Replacement | Apr 8, 2025 | Classificationimage-classification | CodeCode Available | 0 | 5 |
| Enhancing Content-based Recommendation via Large Language Model | Mar 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data Uncertainty | Aug 13, 2024 | Mathematical ReasoningQuestion Answering | CodeCode Available | 0 | 5 |
| Locating and Extracting Relational Concepts in Large Language Models | Jun 19, 2024 | World Knowledge | CodeCode Available | 0 | 5 |
| Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge | Oct 23, 2023 | Phrase GroundingWorld Knowledge | CodeCode Available | 0 | 5 |
| LoFTI: Localization and Factuality Transfer to Indian Locales | Jul 16, 2024 | World Knowledge | CodeCode Available | 0 | 5 |
| Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician Exams | Jun 17, 2024 | AllBenchmarking | CodeCode Available | 0 | 5 |
| Logic Attention Based Neighborhood Aggregation for Inductive Knowledge Graph Embedding | Nov 4, 2018 | Graph EmbeddingKnowledge Graph Completion | CodeCode Available | 0 | 5 |
| Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts | May 25, 2022 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 0 | 5 |
| A surprisal oracle for when every layer counts | Dec 4, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 0 | 5 |
| LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused Evaluation | Mar 25, 2025 | counterfactualDecision Making | CodeCode Available | 0 | 5 |
| LoRec: Large Language Model for Robust Sequential Recommendation against Poisoning Attacks | Jan 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description | Aug 9, 2024 | DiversityInstruction Following | CodeCode Available | 0 | 5 |
| LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric Literals | Apr 28, 2023 | Knowledge GraphsWorld Knowledge | CodeCode Available | 0 | 5 |