SOTAVerified

Video Visual Relation Detection

Video Visual Relation Detection (VidVRD) aims to detect instances of visual relations of interest in a video, where a visual relation instance is represented by a relation triplet with the trajectories of the subject and object. As compared to still images, videos provide a more natural set of features for detecting visual relations, such as the dynamic relations like “A-follow-B” and “A-towards-B”, and temporally changing relations like “A-chase-B” followed by “A-hold-B”. Yet, VidVRD is technically more challenging than ImgVRD due to the difficulties in accurate object tracking and diverse relation appearances in the video domain.

Source: ImageNet-VidVRD Video Visual Relation Dataset

Title	Date	Tasks	Status
VRDFormer: End-to-End Video Visual Relation Detection With Transformers	Jan 1, 2022	ObjectRelation	—Unverified
Social Fabric: Tubelet Compositions for Video Relation Detection	Aug 18, 2021	ObjectRelation	CodeCode Available
Video Relation Detection with Trajectory-aware Multi-modal Features	Jan 20, 2021	Objectobject-detection	—Unverified
Beyond Short-Term Snippet: Video Relation Detection With Spatio-Temporal Global Context	Jun 1, 2020	RelationVideo Visual Relation Detection	—Unverified
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph	Mar 25, 2019	Model OptimizationVideo Relationship	CodeCode Available

Title

Status

Hype

VRDFormer: End-to-End Video Visual Relation Detection With Transformers

—Unverified

Social Fabric: Tubelet Compositions for Video Relation Detection

CodeCode Available

Video Relation Detection with Trajectory-aware Multi-modal Features

—Unverified

Beyond Short-Term Snippet: Video Relation Detection With Spatio-Temporal Global Context

—Unverified

Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph

CodeCode Available

No leaderboard results yet.

Video Visual Relation Detection

Papers