| Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models | Jul 8, 2025 | Future predictionLarge Language Model | —Unverified | 0 |
| Vision Language Models are In-Context Value Learners | Nov 7, 2024 | In-Context LearningWorld Knowledge | —Unverified | 0 |
| Vision-Language Models Provide Promptable Representations for Reinforcement Learning | Feb 5, 2024 | Common Sense ReasoningInstruction Following | —Unverified | 0 |
| Visual Commonsense in Pretrained Unimodal and Multimodal Models | Jan 16, 2022 | AttributeWorld Knowledge | —Unverified | 0 |
| Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Sep 13, 2024 | Sequential Decision MakingWorld Knowledge | —Unverified | 0 |
| Visual Programming for Text-to-Image Generation and Evaluation | May 24, 2023 | Image GenerationLayout Generation | —Unverified | 0 |
| Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models | Jul 28, 2024 | World Knowledge | —Unverified | 0 |
| VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks | Dec 24, 2024 | Common Sense ReasoningTransfer Learning | —Unverified | 0 |
| VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Jul 9, 2024 | Autonomous DrivingImage to 3D | —Unverified | 0 |
| We Usually Don't Like Going to the Dentist: Using Common Sense to Detect Irony on Twitter | Dec 1, 2018 | Common Sense ReasoningGeneral Classification | —Unverified | 0 |
| What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models? | May 31, 2023 | Common Sense ReasoningFew-Shot Learning | —Unverified | 0 |
| What goes into a word: generating image descriptions with top-down spatial knowledge | Oct 1, 2019 | DecoderLanguage Modeling | —Unverified | 0 |
| What Knowledge is Needed to Solve the RTE5 Textual Entailment Challenge? | Jun 10, 2018 | Natural Language InferenceRTE | —Unverified | 0 |
| What's the best place for an AI conference, Vancouver or ______: Why completing comparative questions is difficult | Apr 5, 2021 | World Knowledge | —Unverified | 0 |
| "Why" Has the Least Side Effect on Model Editing | Sep 27, 2024 | Experimental Designknowledge editing | —Unverified | 0 |
| WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia | Jun 19, 2024 | Language ModellingRAG | —Unverified | 0 |
| Wikipedia2Vec: An Efficient Toolkit for Learning and Visualizing the Embeddings of Words and Entities from Wikipedia | Dec 15, 2018 | World Knowledge | —Unverified | 0 |
| Wikipedia-based Semantic Interpretation for Natural Language Processing | Jan 15, 2014 | Common Sense ReasoningText Categorization | —Unverified | 0 |
| WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation | Aug 1, 2021 | ArticlesWorld Knowledge | —Unverified | 0 |
| Winograd Schemas and Machine Translation | Aug 5, 2016 | Machine TranslationSentence | —Unverified | 0 |
| WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge | Jan 12, 2024 | Multimodal Sentiment AnalysisSentiment Analysis | —Unverified | 0 |
| WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation | May 2, 2025 | Image GenerationText to Image Generation | —Unverified | 0 |
| World Knowledge as Indirect Supervision for Document Clustering | Jul 30, 2016 | ClusteringWorld Knowledge | —Unverified | 0 |
| World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving | Dec 9, 2024 | Autonomous DrivingWorld Knowledge | —Unverified | 0 |
| World Knowledge for Abstract Meaning Representation Parsing | May 1, 2018 | Abstract Meaning RepresentationAMR Parsing | —Unverified | 0 |
| World Knowledge for Reading Comprehension: Rare Entity Prediction with Hierarchical LSTMs Using External Descriptions | Sep 1, 2017 | DiversityLanguage Modeling | —Unverified | 0 |
| World Knowledge from AI Image Generation for Robot Control | Mar 20, 2025 | Image GenerationWorld Knowledge | —Unverified | 0 |
| Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering | Jun 1, 2021 | Knowledge GraphsQuestion Answering | —Unverified | 0 |
| WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning | May 6, 2024 | Multiple-choiceVideo Understanding | —Unverified | 0 |
| WorldTree: A Corpus of Explanation Graphs for Elementary Science Questions supporting Multi-Hop Inference | Feb 8, 2018 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| WorldTree V2: A Corpus of Science-Domain Structured Explanations and Inference Patterns supporting Multi-Hop Inference | May 1, 2020 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| Would you describe a leopard as yellow? Evaluating crowd-annotations with justified and informative disagreement | Dec 1, 2020 | Diagnosticvalid | —Unverified | 0 |
| XTE: Explainable Text Entailment | Sep 25, 2020 | Machine TranslationQuestion Answering | —Unverified | 0 |
| Zero-shot Robotic Manipulation with Language-guided Instruction and Formal Task Planning | Jan 25, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis | Aug 27, 2024 | BenchmarkingLarge Language Model | —Unverified | 0 |
| MeKB-Rec: Personal Knowledge Graph Learning for Cross-Domain Recommendation | Oct 17, 2023 | Graph LearningRecommendation Systems | —Unverified | 0 |
| Mental Modeling of Reinforcement Learning Agents by Language Models | Jun 26, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| MetaMorph: Multimodal Understanding and Generation via Instruction Tuning | Dec 18, 2024 | Instruction FollowingMORPH | —Unverified | 0 |
| Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization | Jun 27, 2020 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| MMHQA-ICL: Multimodal In-context Learning for Hybrid Question Answering over Text, Tables and Images | Sep 9, 2023 | In-Context LearningQuestion Answering | —Unverified | 0 |
| MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning | Jun 12, 2025 | Image GenerationMultimodal Reasoning | —Unverified | 0 |
| Modeling Dynamic Relationships Between Characters in Literary Novels | Nov 30, 2015 | Structured PredictionWorld Knowledge | —Unverified | 0 |
| Modelling linguistic vagueness and uncertainty in historical texts | Sep 1, 2019 | World Knowledge | —Unverified | 0 |
| MoLoRec: A Generalizable and Efficient Framework for LLM-Based Recommendation | Feb 12, 2025 | parameter-efficient fine-tuningRecommendation Systems | —Unverified | 0 |
| MOVi: Training-free Text-conditioned Multi-Object Video Generation | May 29, 2025 | ObjectVideo Generation | —Unverified | 0 |
| MQuinE: a cure for "Z-paradox" in knowledge graph embedding models | Feb 5, 2024 | Graph EmbeddingInformation Retrieval | —Unverified | 0 |
| Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag Set | Sep 14, 2019 | ArticlesClassification | —Unverified | 0 |
| MultiLoRA: Democratizing LoRA for Better Multi-Task Learning | Nov 20, 2023 | Multi-Task LearningNatural Language Understanding | —Unverified | 0 |
| Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles | Sep 10, 2024 | Autonomous VehiclesLanguage Modeling | —Unverified | 0 |
| Multimodal Pretrained Models for Verifiable Sequential Decision-Making: Planning, Grounding, and Perception | Aug 10, 2023 | Decision MakingRobot Manipulation | —Unverified | 0 |