SOTAVerified

Video Object Tracking

Video Object Detection aims to detect targets in videos using both spatial and temporal information. It's usually deeply integrated with tasks such as Object Detection and Object Tracking.

Papers

Showing 150 of 98 papers

TitleStatusHype
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectorsCode7
Track Anything: Segment Anything Meets VideosCode5
Medical SAM 2: Segment medical images as video via Segment Anything Model 2Code4
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
Exploiting Multimodal Spatial-temporal Patterns for Video Object TrackingCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
Fast Online Object Tracking and Segmentation: A Unifying ApproachCode2
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream FrameworkCode2
Autoregressive Visual TrackingCode2
Video Polyp Segmentation: A Deep Learning PerspectiveCode2
ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to DescribeCode2
Towards Grand Unification of Object TrackingCode2
ODTrack: Online Dense Temporal Token Learning for Visual TrackingCode2
XTrack: Multimodal Training Boosts RGB-X Video Object TrackersCode2
MixFormer: End-to-End Tracking with Iterative Mixed AttentionCode2
Deeper and Wider Siamese Networks for Real-Time Visual TrackingCode1
Towards Sequence-Level Training for Visual TrackingCode1
Depth Attention for Robust RGB TrackingCode1
Do Different Tracking Tasks Require Different Appearance Models?Code1
TSM: Temporal Shift Module for Efficient Video UnderstandingCode1
Fast Template Matching and Update for Video Object Tracking and SegmentationCode1
Teaching VLMs to Localize Specific Objects from In-context ExamplesCode1
Tracking-by-Trackers with a Distilled and Reinforced ModelCode1
ADTrack: Target-Aware Dual Filter Learning for Real-Time Anti-Dark UAV TrackingCode1
Target-Aware Tracking with Long-term Context AttentionCode1
AiATrack: Attention in Attention for Transformer Visual TrackingCode1
High-Speed Tracking with Kernelized Correlation FiltersCode1
STMTrack: Template-free Visual Tracking with Space-time Memory NetworksCode1
HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term TrackingCode1
ApproxDet: Content and Contention-Aware Approximate Object Detection for MobilesCode1
Single-Model and Any-Modality for Video Object TrackingCode1
Robust Visual Tracking by SegmentationCode1
Revealing the Dark Secrets of Masked Image ModelingCode1
Referring Video Object Segmentation via Language-aligned Track SelectionCode1
A Real-Time Wrong-Way Vehicle Detection Based on YOLO and Centroid TrackingCode1
Transformer TrackingCode1
Quo Vadis, Action Recognition? A New Model and the Kinetics DatasetCode1
Learning Object Permanence from VideoCode1
Associate Everything Detected: Facilitating Tracking-by-Detection to the UnknownCode1
Learning Spatio-Temporal Transformer for Visual TrackingCode1
Learning to Fuse Asymmetric Feature Maps in Siamese TrackersCode1
Learning What and Where: Disentangling Location and Identity Tracking Without SupervisionCode1
LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture SearchCode1
ProContEXT: Exploring Progressive Context Transformer for TrackingCode1
NeighborTrack: Improving Single Object Tracking by Bipartite Matching with Neighbor TrackletsCode1
NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object TrackingCode1
Ocean: Object-aware Anchor-free TrackingCode1
Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object TrackingCode1
BundleTrack: 6D Pose Tracking for Novel Objects without Instance or Category-Level 3D ModelsCode1
CATER: A diagnostic dataset for Compositional Actions and TEmporal ReasoningCode1
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.