| WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge | Jan 12, 2024 | Multimodal Sentiment AnalysisSentiment Analysis | —Unverified | 0 |
| Enhancing Multilingual Information Retrieval in Mixed Human Resources Environments: A RAG Model Implementation for Multicultural Enterprise | Jan 3, 2024 | Information RetrievalRAG | —Unverified | 0 |
| Open-Vocabulary 3D Semantic Segmentation with Foundation Models | Jan 1, 2024 | 3D Semantic SegmentationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| V?: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Jan 1, 2024 | Visual GroundingWorld Knowledge | CodeCode Available | 4 |
| PokeMQA: Programmable knowledge editing for Multi-hop Question Answering | Dec 23, 2023 | Answer Generationknowledge editing | CodeCode Available | 1 |
| Anchoring Path for Inductive Relation Prediction in Knowledge Graphs | Dec 21, 2023 | Inductive Relation PredictionKnowledge Graphs | CodeCode Available | 0 |
| Typhoon: Thai Large Language Models | Dec 21, 2023 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Dec 21, 2023 | Visual Question AnsweringWorld Knowledge | CodeCode Available | 2 |
| Implicit Affordance Acquisition via Causal Action-Effect Modeling in the Video Domain | Dec 18, 2023 | World Knowledge | CodeCode Available | 0 |
| LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin | Dec 15, 2023 | Language ModellingMixture-of-Experts | CodeCode Available | 2 |
| SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector | Dec 14, 2023 | Knowledge DistillationObject | CodeCode Available | 1 |
| Dynamic Retrieval-Augmented Generation | Dec 14, 2023 | abstractive question answeringCode Generation | —Unverified | 0 |
| Learning adaptive planning representations with natural language guidance | Dec 13, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| CoRTEx: Contrastive Learning for Representing Terms via Explanations with Applications on Constructing Biomedical Knowledge Graphs | Dec 13, 2023 | ClusteringContrastive Learning | CodeCode Available | 0 |
| High-throughput Biomedical Relation Extraction for Semi-Structured Web Articles Empowered by Large Language Models | Dec 13, 2023 | ArticlesBinary Classification | —Unverified | 0 |
| VILA: On Pre-training for Visual Language Models | Dec 12, 2023 | In-Context LearningLanguage Modelling | CodeCode Available | 4 |
| Dense X Retrieval: What Retrieval Granularity Should We Use? | Dec 11, 2023 | RetrievalSentence | CodeCode Available | 1 |
| Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers | Dec 7, 2023 | MathMultiple-choice | CodeCode Available | 1 |
| Scalable Knowledge Graph Construction and Inference on Human Genome Variants | Dec 7, 2023 | graph constructionKnowledge Graphs | —Unverified | 0 |
| LLaRA: Large Language-Recommendation Assistant | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Knowledge Model: Perspectives and Challenges | Dec 5, 2023 | knowledge editingKnowledge Graphs | —Unverified | 0 |
| Lenna: Language Enhanced Reasoning Detection Assistant | Dec 5, 2023 | World Knowledge | CodeCode Available | 1 |
| Video Summarization: Towards Entity-Aware Captions | Dec 1, 2023 | Image CaptioningVideo Captioning | CodeCode Available | 0 |
| Instruction-tuning Aligns LLMs to the Human Brain | Dec 1, 2023 | Natural Language QueriesWorld Knowledge | —Unverified | 0 |
| ChatPose: Chatting about 3D Human Pose | Nov 30, 2023 | Pose EstimationPose Prediction | —Unverified | 0 |