| Contextual Knowledge Pursuit for Faithful Visual Synthesis | Nov 29, 2023 | Language ModellingRetrieval | CodeCode Available | 0 |
| Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering | Nov 29, 2023 | Common Sense ReasoningQuestion Answering | —Unverified | 0 |
| Large Language Models as Automated Aligners for benchmarking Vision-Language Models | Nov 24, 2023 | BenchmarkingWorld Knowledge | —Unverified | 0 |
| ShareGPT4V: Improving Large Multi-Modal Models with Better Captions | Nov 21, 2023 | DescriptiveMME | CodeCode Available | 0 |
| Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions | Nov 20, 2023 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| MultiLoRA: Democratizing LoRA for Better Multi-Task Learning | Nov 20, 2023 | Multi-Task LearningNatural Language Understanding | —Unverified | 0 |
| RecExplainer: Aligning Large Language Models for Explaining Recommendation Models | Nov 18, 2023 | Explanation GenerationInstruction Following | —Unverified | 0 |
| StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning | Nov 16, 2023 | Question AnsweringWorld Knowledge | CodeCode Available | 0 |
| Online Continual Knowledge Learning for Language Models | Nov 16, 2023 | Continual LearningFact Checking | —Unverified | 0 |
| LOKE: Linked Open Knowledge Extraction for Automated Knowledge Graph Construction | Nov 15, 2023 | Entity Linkinggraph construction | —Unverified | 0 |
| Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models | Nov 15, 2023 | World Knowledge | CodeCode Available | 1 |
| Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models | Nov 14, 2023 | Continual LearningQuestion Answering | CodeCode Available | 1 |
| Towards Open-Ended Visual Recognition with Large Language Model | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA | Nov 14, 2023 | In-Context LearningProgram Synthesis | CodeCode Available | 1 |
| A Study of Implicit Ranking Unfairness in Large Language Models | Nov 13, 2023 | Data AugmentationFairness | CodeCode Available | 0 |
| A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering | Nov 13, 2023 | Decision MakingExplanation Generation | CodeCode Available | 1 |
| Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks | Nov 9, 2023 | Multiple-choiceWorld Knowledge | —Unverified | 0 |
| Active Reasoning in an Open-World Environment | Nov 3, 2023 | Instruction FollowingMinecraft | —Unverified | 0 |
| ACES: Translation Accuracy Challenge Sets at WMT 2023 | Nov 2, 2023 | TranslationWorld Knowledge | —Unverified | 0 |
| Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information | Nov 2, 2023 | ImputationKnowledge Graph Completion | CodeCode Available | 1 |
| Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts | Oct 31, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| CapsFusion: Rethinking Image-Text Data at Scale | Oct 31, 2023 | World Knowledge | CodeCode Available | 2 |
| Test-time Augmentation for Factual Probing | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models | Oct 25, 2023 | Knowledge ProbingWorld Knowledge | —Unverified | 0 |
| Geographical Erasure in Language Generation | Oct 23, 2023 | Text GenerationWorld Knowledge | CodeCode Available | 0 |
| Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge | Oct 23, 2023 | Phrase GroundingWorld Knowledge | CodeCode Available | 0 |
| One Model for All: Large Language Models are Domain-Agnostic Recommendation Systems | Oct 22, 2023 | AllLanguage Modeling | —Unverified | 0 |
| MeKB-Rec: Personal Knowledge Graph Learning for Cross-Domain Recommendation | Oct 17, 2023 | Graph LearningRecommendation Systems | —Unverified | 0 |
| EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge | Oct 16, 2023 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| VidCoM: Fast Video Comprehension through Large Language Models with Multimodal Tools | Oct 16, 2023 | Caption GenerationDescriptive | —Unverified | 0 |
| KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models | Oct 15, 2023 | Multiple-choiceTriplet | CodeCode Available | 0 |
| Penetrative AI: Making LLMs Comprehend the Physical World | Oct 14, 2023 | Common Sense ReasoningWorld Knowledge | —Unverified | 0 |
| Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection | Oct 12, 2023 | DescriptiveOut-of-Distribution Detection | —Unverified | 0 |
| Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators | Oct 11, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 |
| How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances | Oct 11, 2023 | World Knowledge | CodeCode Available | 1 |
| Mistral 7B | Oct 10, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Self-Knowledge Guided Retrieval Augmentation for Large Language Models | Oct 8, 2023 | Question AnsweringRetrieval | —Unverified | 0 |
| Compositional Semantics for Open Vocabulary Spatio-semantic Representations | Oct 8, 2023 | World Knowledge | —Unverified | 0 |
| Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU | Oct 7, 2023 | Multi-task Language UnderstandingWorld Knowledge | CodeCode Available | 1 |
| FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation | Oct 5, 2023 | HallucinationWorld Knowledge | CodeCode Available | 2 |
| Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond | Oct 3, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| FELM: Benchmarking Factuality Evaluation of Large Language Models | Oct 1, 2023 | BenchmarkingMath | CodeCode Available | 1 |
| Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration | Sep 30, 2023 | World Knowledge | CodeCode Available | 1 |
| Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment | Sep 30, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Augmenting LLMs with Knowledge: A survey on hallucination prevention | Sep 28, 2023 | HallucinationLanguage Modeling | —Unverified | 0 |
| Analyzing the Efficacy of an LLM-Only Approach for Image-based Document Question Answering | Sep 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Physics of Language Models: Part 3.1, Knowledge Storage and Extraction | Sep 25, 2023 | Question AnsweringSentence | CodeCode Available | 1 |
| Bravo MaRDI: A Wikibase Powered Knowledge Graph on Mathematics | Sep 20, 2023 | World Knowledge | CodeCode Available | 0 |
| Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering | Sep 20, 2023 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| Reformulating Sequential Recommendation: Learning Dynamic User Interest with Content-enriched Language Modeling | Sep 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |