SOTAVerified

Visual Tracking

Visual Tracking is an essential and actively researched problem in the field of computer vision with various real-world applications such as robotic services, smart surveillance systems, autonomous driving, and human-computer interaction. It refers to the automatic estimation of the trajectory of an arbitrary target object, usually specified by a bounding box in the first frame, as it moves around in subsequent video frames.

Source: Learning Reinforced Attentional Representation for End-to-End Visual Tracking

Papers

Showing 150 of 525 papers

TitleStatusHype
What You Have is What You Track: Adaptive and Robust Multimodal TrackingCode0
R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement LearningCode2
Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual TrackingCode1
Comparison of Two Methods for Stationary Incident Detection Based on Background Image0
Towards Effective and Efficient Adversarial Defense with Diffusion Models for Robust Visual TrackingCode0
CLDTracker: A Comprehensive Language Description for Visual TrackingCode0
TrackVLA: Embodied Visual Tracking in the Wild0
VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models0
Hierarchical Instruction-aware Embodied Visual Tracking0
Efficient Motion Prompt Learning for Robust Visual TrackingCode1
Towards Adaptive Meta-Gradient Adversarial Examples for Visual TrackingCode0
DeepSORT-Driven Visual Tracking Approach for Gesture Recognition in Interactive Systems0
Predicting Road Surface Anomalies by Visual Tracking of a Preceding Vehicle0
Adversarial Attack for RGB-Event based Visual Object TrackingCode0
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual TrackingCode1
Towards General Multimodal Visual Tracking0
Similarity-Guided Layer-Adaptive Vision Transformer for UAV TrackingCode2
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual TrackingCode1
Technical Report for ReID-SAM on SkiTB Visual Tracking Challenge 20250
CFTrack: Enhancing Lightweight Visual Tracking through Contrastive Learning and Feature Matching0
Enhanced Transformer-Based Tracking for Skiing Events: Overcoming Multi-Camera Challenges, Scale Variations and Rapid Motion -- SkiTB Visual Tracking Challenge 20250
DreamTrack: Dreaming the Future for Multimodal Visual Object Tracking0
Autoregressive Sequential Pretraining for Visual Tracking0
Exploring Historical Information for RGBE Visual Tracking with Mamba0
Less is More: Token Context-aware Learning for Object TrackingCode1
FusionSORT: Fusion Methods for Online Multi-object Visual TrackingCode0
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV TrackingCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
Improving Accuracy and Generalization for Efficient Visual Tracking0
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware MemoryCode9
Vision Eagle Attention: a new lens for advancing image classificationCode1
MFTIQ: Multi-Flow Tracker with Independent Matching Quality EstimationCode1
ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model0
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking0
The Solution for Single Object Tracking Task of Perception Test Challenge 20240
Improving Visual Object Tracking through Visual PromptingCode1
Distilling Channels for Efficient Deep Tracking0
Camouflaged Object Tracking: A BenchmarkCode0
Low-Light Object Tracking: A BenchmarkCode1
MambaEVT: Event Stream based Visual Object Tracking using State Space ModelCode1
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Local All-Pair Correspondence for Point TrackingCode3
Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers0
Tracking Reflected Objects: A BenchmarkCode0
Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV TrackingCode0
Adaptively Bypassing Vision Transformer Blocks for Efficient Visual TrackingCode0
Multi-Granularity Language-Guided Multi-Object TrackingCode1
Robust Visual Tracking via Iterative Gradient Descent and Threshold Selection0
XTrack: Multimodal Training Boosts RGB-X Video Object TrackersCode2
Show:102550
← PrevPage 1 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ARTrack-LAUC60.3Unverified
2UNINEXT-HAUC59.3Unverified
3JointNLTAUC56.9Unverified
4OSTrackAUC55.9Unverified
5TransTAUC50.7Unverified
6AdaSwitcherAUC42Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard61.3Unverified
2TAPIR (MOVi-E)Average Jaccard59.8Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard57.2Unverified
2TAPIR (MOVi-E)Average Jaccard57.1Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard84.7Unverified
2TAPIR (MOVi-E)Average Jaccard84.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (MOVi-E)Average Jaccard66.2Unverified
2TAPIR (Panning MOVi-E)Average Jaccard62.7Unverified
#ModelMetricClaimedVerifiedStatus
1TATrack-LAUC71.1Unverified
#ModelMetricClaimedVerifiedStatus
1SiamFC-lu (Ours)AUC0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SiamFC-lu (Ours)AUC0.66Unverified
#ModelMetricClaimedVerifiedStatus
1MDNetScore0.64Unverified
#ModelMetricClaimedVerifiedStatus
1TATrack-LACCURACY0.85Unverified