SOTAVerified

MultiGrain: a unified image embedding for classes and instances

2019-02-14Code Available0· sign in to hype

Maxim Berman, Hervé Jégou, Andrea Vedaldi, Iasonas Kokkinos, Matthijs Douze

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

MultiGrain is a network architecture producing compact vector representations that are suited both for image classification and particular object retrieval. It builds on a standard classification trunk. The top of the network produces an embedding containing coarse and fine-grained information, so that images can be recognized based on the object class, particular object, or if they are distorted copies. Our joint training is simple: we minimize a cross-entropy loss for classification and a ranking loss that determines if two images are identical up to data augmentation, with no need for additional labels. A key component of MultiGrain is a pooling layer that takes advantage of high-resolution images with a network trained at a lower resolution. When fed to a linear classifier, the learned embeddings provide state-of-the-art classification accuracy. For instance, we obtain 79.4% top-1 accuracy with a ResNet-50 learned on Imagenet, which is a +1.8% absolute improvement over the AutoAugment method. When compared with the cosine similarity, the same embeddings perform on par with the state-of-the-art for image retrieval at moderate resolutions.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
ImageNetMultiGrain NASNet-A-Mobile (350px)Top 1 Accuracy75.1Unverified
ImageNetMultiGrain R50-AA-500Top 1 Accuracy79.4Unverified
ImageNetMultiGrain R50-AA-224Top 1 Accuracy78.2Unverified
ImageNetMultiGrain PNASNet (500px)Top 1 Accuracy83.6Unverified
ImageNetMultiGrain PNASNet (450px)Top 1 Accuracy83.2Unverified
ImageNetMultiGrain SENet154 (450px)Top 1 Accuracy83.1Unverified
ImageNetMultiGrain SENet154 (400px)Top 1 Accuracy83Unverified
ImageNetMultiGrain SENet154 (500px)Top 1 Accuracy82.7Unverified
ImageNetMultiGrain PNASNet (400px)Top 1 Accuracy82.6Unverified
ImageNetMultiGrain PNASNet (300px)Top 1 Accuracy81.3Unverified

Reproductions