SOTAVerified

Learning multiple visual domains with residual adapters

2017-05-22NeurIPS 2017Code Available0· sign in to hype

Sylvestre-Alvise Rebuffi, Hakan Bilen, Andrea Vedaldi

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

There is a growing interest in learning data representations that work well for many different types of problems and data. In this paper, we look in particular at the task of learning a single visual representation that can be successfully utilized in the analysis of very different types of images, from dog breeds to stop signs and digits. Inspired by recent work on learning networks that predict the parameters of another, we develop a tunable deep network architecture that, by means of adapter residual modules, can be steered on the fly to diverse visual domains. Our method achieves a high degree of parameter sharing while maintaining or even improving the accuracy of domain-specific representations. We also introduce the Visual Decathlon Challenge, a benchmark that evaluates the ability of representations to capture simultaneously ten very different visual domains and measures their ability to recognize well uniformly.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
visual domain decathlon (10 tasks)Res. adapt. (large)decathlon discipline (Score)3,131Unverified
visual domain decathlon (10 tasks)Res. adapt. finetune alldecathlon discipline (Score)2,643Unverified
visual domain decathlon (10 tasks)Res. adapt. decaydecathlon discipline (Score)2,621Unverified
visual domain decathlon (10 tasks)Res. adapt. dom-preddecathlon discipline (Score)2,503Unverified
visual domain decathlon (10 tasks)Res. adapt.decathlon discipline (Score)2,118Unverified

Reproductions