SOTAVerified

Video Object Tracking

Video Object Detection aims to detect targets in videos using both spatial and temporal information. It's usually deeply integrated with tasks such as Object Detection and Object Tracking.

Papers

Showing 150 of 98 papers

TitleStatusHype
HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term TrackingCode1
Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction0
Exploiting Multimodal Spatial-temporal Patterns for Video Object TrackingCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
Referring Video Object Segmentation via Language-aligned Track SelectionCode1
Teaching VLMs to Localize Specific Objects from In-context ExamplesCode1
Depth Attention for Robust RGB TrackingCode1
NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object TrackingCode1
VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking0
Associate Everything Detected: Facilitating Tracking-by-Detection to the UnknownCode1
Medical SAM 2: Segment medical images as video via Segment Anything Model 2Code4
XTrack: Multimodal Training Boosts RGB-X Video Object TrackersCode2
Gaga: Group Any Gaussians via 3D-aware Memory Bank0
UniVS: Unified and Universal Video Segmentation with Prompts as QueriesCode3
ODTrack: Online Dense Temporal Token Learning for Visual TrackingCode2
ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to DescribeCode2
Single-Model and Any-Modality for Video Object TrackingCode1
Robust Visual Tracking by Motion Analyzing0
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking0
Bridging the Gap Between End-to-end and Non-End-to-end Multi-Object TrackingCode1
Track Anything: Segment Anything Meets VideosCode5
Target-Aware Tracking with Long-term Context AttentionCode1
Autoregressive Visual TrackingCode2
NeighborTrack: Improving Single Object Tracking by Bipartite Matching with Neighbor TrackletsCode1
ProContEXT: Exploring Progressive Context Transformer for TrackingCode1
A Real-Time Wrong-Way Vehicle Detection Based on YOLO and Centroid TrackingCode1
Towards Sequence-Level Training for Visual TrackingCode1
Video object tracking based on YOLOv7 and DeepSORT0
AiATrack: Attention in Attention for Transformer Visual TrackingCode1
Towards Grand Unification of Object TrackingCode2
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectorsCode7
Revealing the Dark Secrets of Masked Image ModelingCode1
Learning What and Where: Disentangling Location and Identity Tracking Without SupervisionCode1
Video Polyp Segmentation: A Deep Learning PerspectiveCode2
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream FrameworkCode2
Robust Visual Tracking by SegmentationCode1
Transforming Model Prediction for TrackingCode0
MixFormer: End-to-End Tracking with Iterative Mixed AttentionCode2
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning0
Efficient Visual Tracking with Exemplar TransformersCode0
Evaluating and Improving Interactions with Hazy Oracles0
INFERNO: Inferring Object-Centric 3D Scene Representations without Supervision0
BundleTrack: 6D Pose Tracking for Novel Objects without Instance or Category-Level 3D ModelsCode1
Do Different Tracking Tasks Require Different Appearance Models?Code1
ADTrack: Target-Aware Dual Filter Learning for Real-Time Anti-Dark UAV TrackingCode1
LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture SearchCode1
STMTrack: Template-free Visual Tracking with Space-time Memory NetworksCode1
Learning Spatio-Temporal Transformer for Visual TrackingCode1
Learning Target Candidate Association to Keep Track of What Not to TrackCode0
Transformer TrackingCode1
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.