Detecting Visual Relationships with Deep Relational Networks

2017-04-11CVPR 2017Code Available0· sign in to hype

Bo Dai, Yuqi Zhang, Dahua Lin

Code Available — Be the first to reproduce this paper.

Code

github.com/doubledaibo/drnet
OfficialIn papercaffe2★ 0

Abstract

Relationships among objects play a crucial role in image understanding. Despite the great success of deep learning techniques in recognizing individual objects, reasoning about the relationships among objects remains a challenging task. Previous methods often treat this as a classification problem, considering each type of relationship (e.g. "ride") or each distinct visual phrase (e.g. "person-ride-horse") as a category. Such approaches are faced with significant difficulties caused by the high diversity of visual appearance for each kind of relationships or the large number of distinct visual phrases. We propose an integrated framework to tackle this problem. At the heart of this framework is the Deep Relational Network, a novel formulation designed specifically for exploiting the statistical dependencies between objects and their relationships. On two large datasets, the proposed method achieves substantial improvement over state-of-the-art.

Tasks

Diversity General Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
VRD Phrase Detection	Dai et. al [[Dai, Zhang, and Lin2017]]	R@100	23.45	—	Unverified
VRD Predicate Detection	Dai et. al [[Dai, Zhang, and Lin2017]]	R@100	81.9	—	Unverified
VRD Relationship Detection	Dai et. al [[Dai, Zhang, and Lin2017]]	R@100	20.88	—	Unverified

Detecting Visual Relationships with Deep Relational Networks

Code

Abstract

Tasks

Benchmark Results

Reproductions