| Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models | Feb 26, 2024 | AttributeFine-Grained Visual Categorization | —Unverified | 0 |
| PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering | Feb 26, 2024 | Question AnsweringRetrieval | —Unverified | 0 |
| Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics | Feb 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Language Models Act as Knowledge Bases at Scale? | Feb 22, 2024 | Natural Language QueriesWorld Knowledge | —Unverified | 0 |
| WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Feb 22, 2024 | Image-level Supervised Instance Segmentationobject-detection | CodeCode Available | 2 |
| LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition | Feb 22, 2024 | Data Augmentationfew-shot-ner | —Unverified | 0 |
| FLAME: Self-Supervised Low-Resource Taxonomy Expansion using Large Language Models | Feb 21, 2024 | Recommendation SystemsTaxonomy Expansion | CodeCode Available | 0 |
| Lying Blindly: Bypassing ChatGPT's Safeguards to Generate Hard-to-Detect Disinformation Claims | Feb 13, 2024 | World Knowledge | —Unverified | 0 |
| GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants | Feb 12, 2024 | Code GenerationManagement | CodeCode Available | 1 |
| GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding | Feb 9, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases | Feb 5, 2024 | Layout GenerationObject | —Unverified | 0 |
| MQuinE: a cure for "Z-paradox" in knowledge graph embedding models | Feb 5, 2024 | Graph EmbeddingInformation Retrieval | —Unverified | 0 |
| Vision-Language Models Provide Promptable Representations for Reinforcement Learning | Feb 5, 2024 | Common Sense ReasoningInstruction Following | —Unverified | 0 |
| GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Note On Lookahead In Real Life And Computing | Feb 2, 2024 | Future predictionWorld Knowledge | —Unverified | 0 |
| LoRec: Large Language Model for Robust Sequential Recommendation against Poisoning Attacks | Jan 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 |
| Efficient Tool Use with Chain-of-Abstraction Reasoning | Jan 30, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear | Jan 30, 2024 | World Knowledge | —Unverified | 0 |
| Towards Generating Informative Textual Description for Neurons in Language Models | Jan 30, 2024 | World Knowledge | —Unverified | 0 |
| Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets | Jan 29, 2024 | BenchmarkingMachine Translation | CodeCode Available | 1 |
| Democratizing Fine-grained Visual Recognition with Large Language Models | Jan 24, 2024 | Fine-Grained Visual RecognitionWorld Knowledge | —Unverified | 0 |
| Can AI Assistants Know What They Don't Know? | Jan 24, 2024 | MathOpen-Domain Question Answering | CodeCode Available | 2 |
| Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge | Jan 19, 2024 | Question AnsweringQuestion Generation | CodeCode Available | 1 |
| Knowledge Verification to Nip Hallucination in the Bud | Jan 19, 2024 | HallucinationWorld Knowledge | CodeCode Available | 1 |
| WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge | Jan 12, 2024 | Multimodal Sentiment AnalysisSentiment Analysis | —Unverified | 0 |
| Enhancing Multilingual Information Retrieval in Mixed Human Resources Environments: A RAG Model Implementation for Multicultural Enterprise | Jan 3, 2024 | Information RetrievalRAG | —Unverified | 0 |
| Open-Vocabulary 3D Semantic Segmentation with Foundation Models | Jan 1, 2024 | 3D Semantic SegmentationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| V?: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Jan 1, 2024 | Visual GroundingWorld Knowledge | CodeCode Available | 4 |
| PokeMQA: Programmable knowledge editing for Multi-hop Question Answering | Dec 23, 2023 | Answer Generationknowledge editing | CodeCode Available | 1 |
| Anchoring Path for Inductive Relation Prediction in Knowledge Graphs | Dec 21, 2023 | Inductive Relation PredictionKnowledge Graphs | CodeCode Available | 0 |
| Typhoon: Thai Large Language Models | Dec 21, 2023 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Dec 21, 2023 | Visual Question AnsweringWorld Knowledge | CodeCode Available | 2 |
| Implicit Affordance Acquisition via Causal Action-Effect Modeling in the Video Domain | Dec 18, 2023 | World Knowledge | CodeCode Available | 0 |
| LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin | Dec 15, 2023 | Language ModellingMixture-of-Experts | CodeCode Available | 2 |
| SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector | Dec 14, 2023 | Knowledge DistillationObject | CodeCode Available | 1 |
| Dynamic Retrieval-Augmented Generation | Dec 14, 2023 | abstractive question answeringCode Generation | —Unverified | 0 |
| Learning adaptive planning representations with natural language guidance | Dec 13, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| CoRTEx: Contrastive Learning for Representing Terms via Explanations with Applications on Constructing Biomedical Knowledge Graphs | Dec 13, 2023 | ClusteringContrastive Learning | CodeCode Available | 0 |
| High-throughput Biomedical Relation Extraction for Semi-Structured Web Articles Empowered by Large Language Models | Dec 13, 2023 | ArticlesBinary Classification | —Unverified | 0 |
| VILA: On Pre-training for Visual Language Models | Dec 12, 2023 | In-Context LearningLanguage Modelling | CodeCode Available | 4 |
| Dense X Retrieval: What Retrieval Granularity Should We Use? | Dec 11, 2023 | RetrievalSentence | CodeCode Available | 1 |
| Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers | Dec 7, 2023 | MathMultiple-choice | CodeCode Available | 1 |
| Scalable Knowledge Graph Construction and Inference on Human Genome Variants | Dec 7, 2023 | graph constructionKnowledge Graphs | —Unverified | 0 |
| LLaRA: Large Language-Recommendation Assistant | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Knowledge Model: Perspectives and Challenges | Dec 5, 2023 | knowledge editingKnowledge Graphs | —Unverified | 0 |
| Lenna: Language Enhanced Reasoning Detection Assistant | Dec 5, 2023 | World Knowledge | CodeCode Available | 1 |
| Video Summarization: Towards Entity-Aware Captions | Dec 1, 2023 | Image CaptioningVideo Captioning | CodeCode Available | 0 |
| Instruction-tuning Aligns LLMs to the Human Brain | Dec 1, 2023 | Natural Language QueriesWorld Knowledge | —Unverified | 0 |
| ChatPose: Chatting about 3D Human Pose | Nov 30, 2023 | Pose EstimationPose Prediction | —Unverified | 0 |