| A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge | Jun 3, 2022 | Question AnsweringVisual Question Answering | CodeCode Available | 1 |
| Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors | Mar 26, 2025 | Depth EstimationWorld Knowledge | CodeCode Available | 1 |
| GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants | Feb 12, 2024 | Code GenerationManagement | CodeCode Available | 1 |
| HeadlineCause: A Dataset of News Headlines for Detecting Causalities | Aug 28, 2021 | Commonsense Causal ReasoningCommon Sense Reasoning | CodeCode Available | 1 |
| I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token | Dec 9, 2024 | World Knowledge | CodeCode Available | 1 |
| There is a Time and Place for Reasoning Beyond the Image | Mar 1, 2022 | 16kArticles | CodeCode Available | 1 |
| Knowledge Editing through Chain-of-Thought | Dec 23, 2024 | knowledge editingWorld Knowledge | CodeCode Available | 1 |
| Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models | May 15, 2024 | AI AgentWorld Knowledge | CodeCode Available | 1 |
| Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models | Apr 9, 2024 | Few-Shot LearningLanguage Modelling | CodeCode Available | 1 |
| ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Knowledge-Augmented Neural Network Model for Implicit Discourse Relation Classification | Aug 1, 2018 | General ClassificationImplicit Discourse Relation Classification | —Unverified | 0 |
| A Survey of Reinforcement Learning Informed by Natural Language | Jun 10, 2019 | Decision MakingInstruction Following | —Unverified | 0 |
| Exploring the Potential of Large Language Models for Heterophilic Graphs | Aug 26, 2024 | Node ClassificationWorld Knowledge | —Unverified | 0 |
| GLOVER: Generalizable Open-Vocabulary Affordance Reasoning for Task-Oriented Grasping | Nov 19, 2024 | Common Sense ReasoningHuman-Object Interaction Detection | —Unverified | 0 |
| GOT4Rec: Graph of Thoughts for Sequential Recommendation | Nov 22, 2024 | General KnowledgeSequential Recommendation | —Unverified | 0 |
| A Study into Investigating Temporal Robustness of LLMs | Mar 21, 2025 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models | Oct 25, 2023 | Knowledge ProbingWorld Knowledge | —Unverified | 0 |
| Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach | Dec 9, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Categorization in the Wild: Generalizing Cognitive Models to Naturalistic Data across Languages | Feb 23, 2019 | World Knowledge | —Unverified | 0 |
| A Joint Training Framework for Open-World Knowledge Graph Embeddings | Jun 22, 2021 | Dialogue GenerationEntity Embeddings | —Unverified | 0 |
| GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding | Feb 9, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| EXnet: Efficient In-context Learning for Data-less Text classification | May 24, 2023 | In-Context LearningQuestion Answering | —Unverified | 0 |
| EventVAD: Training-Free Event-Aware Video Anomaly Detection | Apr 17, 2025 | Anomaly DetectionBoundary Detection | —Unverified | 0 |
| Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers | Jun 7, 2024 | General KnowledgeQuestion Generation | —Unverified | 0 |
| The Next Chapter: A Study of Large Language Models in Storytelling | Jan 24, 2023 | Story GenerationWorld Knowledge | —Unverified | 0 |
| Generative Retrieval and Alignment Model: A New Paradigm for E-commerce Retrieval | Apr 2, 2025 | General KnowledgeRetrieval | —Unverified | 0 |
| Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning | Jan 27, 2025 | World Knowledge | —Unverified | 0 |
| ADAM: An Embodied Causal Agent in Open-World Environments | Oct 29, 2024 | Lifelong learningMinecraft | —Unverified | 0 |
| A Bayesian Model for Joint Learning of Categories and their Features | May 1, 2015 | World Knowledge | —Unverified | 0 |
| Can LLMs Maintain Fundamental Abilities under KV Cache Compression? | Feb 4, 2025 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| Evaluating the Ability of Large Language Models to Reason about Cardinal Directions | Jun 24, 2024 | World Knowledge | —Unverified | 0 |
| EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents | Mar 18, 2024 | Reinforcement Learning (RL)World Knowledge | —Unverified | 0 |
| A Google-Proof Collection of French Winograd Schemas | Apr 1, 2017 | Coreference ResolutionWorld Knowledge | —Unverified | 0 |
| A Semi-supervised Scalable Unified Framework for E-commerce Query Classification | Jun 26, 2025 | ClassificationWorld Knowledge | —Unverified | 0 |
| Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge | May 30, 2023 | Answer SelectionQuestion Answering | —Unverified | 0 |
| Entity Type Recognition using an Ensemble of Distributional Semantic Models to Enhance Query Understanding | Apr 4, 2016 | Information RetrievalRetrieval | —Unverified | 0 |
| Dynamic Retrieval-Augmented Generation | Dec 14, 2023 | abstractive question answeringCode Generation | —Unverified | 0 |
| Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions | Apr 6, 2023 | World Knowledge | —Unverified | 0 |
| Enthymemetic Conditionals | Jun 1, 2019 | World Knowledge | —Unverified | 0 |
| Enriching Basque Coreference Resolution System using Semantic Knowledge sources | Apr 1, 2017 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Exploring Factual Entailment with NLI: A News Media Study | Jun 24, 2024 | ArticlesFew-Shot Learning | —Unverified | 0 |
| Can Language Models Act as Knowledge Bases at Scale? | Feb 22, 2024 | Natural Language QueriesWorld Knowledge | —Unverified | 0 |
| Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection | Oct 12, 2023 | DescriptiveOut-of-Distribution Detection | —Unverified | 0 |
| Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks | Nov 9, 2023 | Multiple-choiceWorld Knowledge | —Unverified | 0 |
| Exploring the Limits of Few-Shot Link Prediction in Knowledge Graphs | Feb 5, 2021 | Knowledge GraphsLink Prediction | —Unverified | 0 |
| Generating image captions with external encyclopedic knowledge | Oct 10, 2022 | Caption GenerationImage Captioning | —Unverified | 0 |
| Geo-Aware Image Caption Generation | Dec 1, 2020 | Caption GenerationImage Captioning | —Unverified | 0 |
| GRADE: Quantifying Sample Diversity in Text-to-Image Models | Oct 29, 2024 | AttributeDiversity | —Unverified | 0 |
| Enhancing Traffic Prediction with Textual Data Using Large Language Models | May 10, 2024 | PredictionScheduling | —Unverified | 0 |