SOTAVerified

Visual Tracking

Visual Tracking is an essential and actively researched problem in the field of computer vision with various real-world applications such as robotic services, smart surveillance systems, autonomous driving, and human-computer interaction. It refers to the automatic estimation of the trajectory of an arbitrary target object, usually specified by a bounding box in the first frame, as it moves around in subsequent video frames.

Source: Learning Reinforced Attentional Representation for End-to-End Visual Tracking

Papers

Showing 150 of 525 papers

TitleStatusHype
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware MemoryCode9
TAPIR: Tracking Any Point with per-frame Initialization and temporal RefinementCode3
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
Local All-Pair Correspondence for Point TrackingCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV TrackingCode2
Autoregressive Visual TrackingCode2
Similarity-Guided Layer-Adaptive Vision Transformer for UAV TrackingCode2
Fast Online Object Tracking and Segmentation: A Unifying ApproachCode2
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream FrameworkCode2
XTrack: Multimodal Training Boosts RGB-X Video Object TrackersCode2
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-TrainingCode2
Tracking Meets LoRA: Faster Training, Larger Model, Stronger PerformanceCode2
ODTrack: Online Dense Temporal Token Learning for Visual TrackingCode2
R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement LearningCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
Towards Real-World Visual Tracking with Temporal ContextsCode2
VastTrack: Vast Category Visual Object TrackingCode2
Cross-Modal Object Tracking via Modality-Aware Fusion Network and A Large-Scale DatasetCode2
Long-term Frame-Event Visual Tracking: Benchmark Dataset and BaselineCode2
You Only Demonstrate Once: Category-Level Manipulation from Single Visual DemonstrationCode2
Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel BaselineCode2
Do Different Tracking Tasks Require Different Appearance Models?Code1
Learning Spatio-Appearance Memory Network for High-Performance Visual TrackingCode1
Learning Spatio-Temporal Transformer for Visual TrackingCode1
Learning to Adversarially Blur Visual Object TrackingCode1
AiATrack: Attention in Attention for Transformer Visual TrackingCode1
Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box EstimationCode1
Ad2Attack: Adaptive Adversarial Attack on Real-Time UAV TrackingCode1
LaSOT: A High-quality Large-scale Single Object Tracking BenchmarkCode1
Joint Visual Grounding and Tracking with Natural Language SpecificationCode1
Tracking-by-Trackers with a Distilled and Reinforced ModelCode1
Differentiable Particle Filters through Conditional Normalizing FlowCode1
HIPTrack: Visual Tracking with Historical PromptsCode1
Learning to Fuse Asymmetric Feature Maps in Siamese TrackersCode1
Improving Underwater Visual Tracking With a Large Scale Dataset and Image EnhancementCode1
How to Train Your Energy-Based Model for RegressionCode1
Improving Visual Object Tracking through Visual PromptingCode1
Fully Convolutional Online TrackingCode1
Global Instance Tracking: Locating Target More Like HumansCode1
Correlation Filters for Unmanned Aerial Vehicle-Based Aerial Tracking: A Review and Experimental EvaluationCode1
Energy-Based Models for Deep Probabilistic RegressionCode1
Automatic Failure Recovery and Re-Initialization for Online UAV Tracking with Joint Scale and Aspect Ratio OptimizationCode1
Cross-Drone Transformer Network for Robust Single Object TrackingCode1
Conditional Measurement Density Estimation in Sequential Monte Carlo via Normalizing FlowCode1
Deep Convolutional Neural Networks for Thermal Infrared Object TrackingCode1
Transformer TrackingCode1
High-Performance Long-Term Tracking with Meta-UpdaterCode1
Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and SegmentationCode1
Show:102550
← PrevPage 1 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ARTrack-LAUC60.3Unverified
2UNINEXT-HAUC59.3Unverified
3JointNLTAUC56.9Unverified
4OSTrackAUC55.9Unverified
5TransTAUC50.7Unverified
6AdaSwitcherAUC42Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard61.3Unverified
2TAPIR (MOVi-E)Average Jaccard59.8Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard57.2Unverified
2TAPIR (MOVi-E)Average Jaccard57.1Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard84.7Unverified
2TAPIR (MOVi-E)Average Jaccard84.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (MOVi-E)Average Jaccard66.2Unverified
2TAPIR (Panning MOVi-E)Average Jaccard62.7Unverified
#ModelMetricClaimedVerifiedStatus
1TATrack-LAUC71.1Unverified
#ModelMetricClaimedVerifiedStatus
1SiamFC-lu (Ours)AUC0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SiamFC-lu (Ours)AUC0.66Unverified
#ModelMetricClaimedVerifiedStatus
1MDNetScore0.64Unverified
#ModelMetricClaimedVerifiedStatus
1TATrack-LACCURACY0.85Unverified