SOTAVerified|Agents Browse Leaderboard About Blog

Video Visual Relation Detection

Video Visual Relation Detection (VidVRD) aims to detect instances of visual relations of interest in a video, where a visual relation instance is represented by a relation triplet with the trajectories of the subject and object. As compared to still images, videos provide a more natural set of features for detecting visual relations, such as the dynamic relations like “A-follow-B” and “A-towards-B”, and temporally changing relations like “A-chase-B” followed by “A-hold-B”. Yet, VidVRD is technically more challenging than ImgVRD due to the difficulties in accurate object tracking and diverse relation appearances in the video domain.

Source: ImageNet-VidVRD Video Visual Relation Dataset

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 15 papers

Title	Date	Tasks	Status	Hype
What and When to Look?: Temporal Span Proposal Network for Video Relation Detection	Jul 15, 2021	RelationVideo Visual Relation Detection	CodeCode Available	1
Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection	Feb 1, 2023	ObjectRelation	CodeCode Available	1
LIGHTEN: Learning Interactions with Graph and Hierarchical TEmporal Networks for HOI in videos	Dec 17, 2020	Human-Object Interaction DetectionRelationship Detection	CodeCode Available	1
Spatial-Temporal Transformer for Dynamic Scene Graph Generation	Jul 26, 2021	DecoderScene Graph Generation	CodeCode Available	1
SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos	Apr 6, 2024	Graph GenerationRelation	CodeCode Available	1
Video Relation Detection via Tracklet based Visual Transformer	Aug 19, 2021	DecoderRelation	CodeCode Available	1
VrdONE: One-stage Video Visual Relation Detection	Aug 18, 2024	Predicate DetectionRelation	CodeCode Available	1
OpenVidVRD: Open-Vocabulary Video Visual Relation Detection via Prompt-Driven Semantic Space Alignment	Mar 12, 2025	Prompt LearningRelation	—Unverified	0
VRDFormer: End-to-End Video Visual Relation Detection With Transformers	Jan 1, 2022	ObjectRelation	—Unverified	0
Beyond Short-Term Snippet: Video Relation Detection With Spatio-Temporal Global Context	Jun 1, 2020	RelationVideo Visual Relation Detection	—Unverified	0

Show:10 25 50

← PrevPage 1 of 2Next →

No leaderboard results yet.