SOTAVerified

Image Retrieval with Multi-Modal Query

The problem of retrieving images from a database based on a multi-modal (image- text) query. Specifically, the query text prompts some modification in the query image and the task is to retrieve images with the desired modifications.

Papers

Showing 110 of 10 papers

TitleStatusHype
Collaborative Group: Composed Image Retrieval via Consensus Learning from Noisy Annotations0
Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty RegularizationCode1
Compositional Learning of Image-Text Query for Image RetrievalCode1
Composing Text and Image for Image Retrieval - An Empirical OdysseyCode1
Attributes as Operators: Factorizing Unseen Attribute-Object CompositionsCode0
FiLM: Visual Reasoning with a General Conditioning LayerCode1
Automatic Spatially-aware Fashion Concept DiscoveryCode1
A simple neural network module for relational reasoningCode0
Image Question Answering using Convolutional Neural Network with Dynamic Parameter PredictionCode0
Show and Tell: A Neural Image Caption GeneratorCode1
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ComposeAERecall@1011.8Unverified
2TIRGRecall@103.34Unverified