| Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models | Mar 27, 2024 | Image ClassificationImage Comprehension | CodeCode Available | 7 |
| MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning | Oct 14, 2023 | Image ClassificationImage Description | CodeCode Available | 7 |
| Visual Instruction Tuning | Apr 17, 2023 | 1 Image, 2*2 Stitching3D Question Answering (3D-QA) | CodeCode Available | 6 |
| Improved Baselines with Visual Instruction Tuning | Oct 5, 2023 | Factual Inconsistency Detection in Chart CaptioningImage Classification | CodeCode Available | 6 |
| Efficient Multimodal Learning from Data-centric Perspective | Feb 18, 2024 | Image ClassificationReferring Expression Comprehension | CodeCode Available | 5 |
| LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day | Jun 1, 2023 | Image ClassificationInstruction Following | CodeCode Available | 4 |
| MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices | Dec 28, 2023 | AutoMLCPU | CodeCode Available | 3 |
| Elysium: Exploring Object-level Perception in Videos via MLLM | Mar 25, 2024 | ObjectObject Tracking | CodeCode Available | 2 |
| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 |
| Frontiers in Intelligent Colonoscopy | Oct 22, 2024 | Image Captioning | CodeCode Available | 2 |
| Kosmos-2: Grounding Multimodal Large Language Models to the World | Jun 26, 2023 | Image CaptioningIn-Context Learning | CodeCode Available | 1 |
| Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE | Sep 26, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception | Mar 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Modeling Context in Referring Expressions | Jul 31, 2016 | Referring ExpressionReferring expression generation | CodeCode Available | 1 |
| DisCLIP: Open-Vocabulary Referring Expression Generation | May 30, 2023 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Easy Things First: Installments Improve Referring Expression Generation for Objects in Photographs | Aug 1, 2016 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Adapting Descriptions of People to the Point of View of a Moving Observer | Nov 1, 2018 | PositionReferring Expression | —Unverified | 0 |
| An Empirical Approach for Modeling Fuzzy Geographical Descriptors | Mar 30, 2017 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images | Sep 1, 2017 | Image DescriptionReferring Expression | —Unverified | 0 |
| Fuzzy Logic for Vagueness Management in Referring Expression Generation | Sep 1, 2020 | ManagementReferring Expression | —Unverified | 0 |
| Generating Quantified Referring Expressions through Attention-Driven Incremental Perception | Dec 1, 2020 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Generating Texts with Integer Linear Programming | Oct 31, 2018 | Concept-To-Text GenerationReferring Expression | —Unverified | 0 |
| Gera \~ao de Express\~oes de Refer\^encia usando Rela \~oes Espaciais (Referring Expression Generation Using Spatial Relations) [in Portuguese] | Jan 1, 2013 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| An Incremental Iterated Response Model of Pragmatics | Sep 30, 2018 | modelReferring Expression | —Unverified | 0 |
| G-TUNA: a corpus of referring expressions in German, including duration information | Sep 1, 2017 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Improving the generation of personalised descriptions | Sep 1, 2017 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Improving the Naturalness and Diversity of Referring Expression Generation models using Minimum Risk Training | Dec 1, 2020 | DiversityReferring Expression | —Unverified | 0 |
| Informativity in Image Captions vs. Referring Expressions | Jun 1, 2020 | Image CaptioningObject | —Unverified | 0 |
| Intrinsic Task-based Evaluation for Referring Expression Generation | Feb 12, 2024 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Justifying Corpus-Based Choices in Referring Expression Generation | Sep 1, 2013 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Learning Distributions over Logical Forms for Referring Expression Generation | Oct 1, 2013 | Density EstimationReferring Expression | —Unverified | 0 |
| Learning Preferences for Referring Expression Generation: Effects of Domain, Language and Algorithm | May 1, 2012 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Lessons from Computational Modelling of Reference Production in Mandarin and English | Nov 14, 2020 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Meteorologists and Students: A resource for language grounding of geographical descriptors | Sep 7, 2018 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| MuDoCo: Corpus for Multidomain Coreference Resolution and Referring Expression Generation | May 1, 2020 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| A Predictive Model for Notional Anaphora in English | Apr 19, 2018 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Non-neural Models Matter: A Re-evaluation of Neural Referring Expression Generation Systems | Mar 15, 2022 | BIG-bench Machine LearningReferring Expression | —Unverified | 0 |
| Obtaining referential word meanings from visual and distributional information: Experiments on object naming | Jul 1, 2017 | ObjectObject Recognition | —Unverified | 0 |
| OMEGA : A probabilistic approach to referring expression generation in a virtual environment | Dec 1, 2020 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| On The Feasibility of Open Domain Referring Expression Generation Using Large Scale Folksonomies | Jun 1, 2012 | Document SummarizationMulti-Document Summarization | —Unverified | 0 |
| On the Robustness of Standalone Referring Expression Generation Algorithms Using RDF Data | Sep 1, 2016 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Assessing Neural Referential Form Selectors on a Realistic Multilingual Dataset | Oct 10, 2022 | FormReferring Expression | —Unverified | 0 |
| Perspective-corrected Spatial Referring Expression Generation for Human-Robot Interaction | Apr 4, 2021 | DiversityReferring Expression | —Unverified | 0 |
| Reference production in human-computer interaction: Issues for Corpus-based Referring Expression Generation | May 1, 2018 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Refer-iTTS: A System for Referring in Spoken Installments to Objects in Real-World Images | Sep 1, 2017 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Referring Expression Generation and Comprehension via Attributes | Oct 1, 2017 | AttributeReferring Expression | —Unverified | 0 |
| Referring Expression Generation in time-constrained communication | May 1, 2018 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Augmenting Robot Knowledge Consultants with Distributed Short Term Memory | Nov 26, 2018 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Referring Expression Generation under Uncertainty: Algorithm and Evaluation Framework | Sep 1, 2017 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Building Multimodal Simulations for Natural Language | Apr 1, 2017 | Formal LogicReferring Expression | —Unverified | 0 |