SOTAVerified

Visual Tracking

Visual Tracking is an essential and actively researched problem in the field of computer vision with various real-world applications such as robotic services, smart surveillance systems, autonomous driving, and human-computer interaction. It refers to the automatic estimation of the trajectory of an arbitrary target object, usually specified by a bounding box in the first frame, as it moves around in subsequent video frames.

Source: Learning Reinforced Attentional Representation for End-to-End Visual Tracking

Papers

Showing 51100 of 525 papers

TitleStatusHype
DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM0
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-TrainingCode2
Learning Tracking Representations from Single Point Annotations0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
RTracker: Recoverable Tracking via PN Tree Structured MemoryCode1
Multi-attention Associate Prediction Network for Visual Tracking0
Pedestrian Tracking with Monocular Camera using Unconstrained 3D Motion Model0
A Spectrum-based Image Denoising Method with Edge Feature Enhancement0
Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers0
Long-term Frame-Event Visual Tracking: Benchmark Dataset and BaselineCode2
Tracking Meets LoRA: Faster Training, Larger Model, Stronger PerformanceCode2
Motion-Guided Dual-Camera Tracker for Endoscope Tracking and Motion Analysis in a Mechanical Gastric SimulatorCode0
VastTrack: Vast Category Visual Object TrackingCode2
Unifying Visual and Vision-Language Tracking via Contrastive LearningCode1
Multi-task Learning for Joint Re-identification, Team Affiliation, and Role Classification for Sports Visual Tracking0
Explicit Visual Prompts for Visual Object TrackingCode1
ODTrack: Online Dense Temporal Token Learning for Visual TrackingCode2
Autoregressive Queries for Adaptive Tracking with Spatio-Temporal Transformers0
DiffusionTrack: Point Set Diffusion Model for Visual Object TrackingCode0
Cross-Modal Object Tracking via Modality-Aware Fusion Network and A Large-Scale DatasetCode2
Visual tracking brain computer interface0
HIPTrack: Visual Tracking with Historical PromptsCode1
AViTMP: A Tracking-Specific Transformer for Single-Branch Visual TrackingCode0
ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual TrackingCode1
Staged Depthwise Correlation and Feature Fusion for Siamese Object Tracking0
Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel BaselineCode2
BASE: Probably a Better Approach to Multi-Object Tracking0
LiteTrack: Layer Pruning with Asynchronous Feature Extraction for Lightweight and Efficient Visual TrackingCode1
Robust Visual Tracking by Motion Analyzing0
Towards Efficient Training with Negative Samples in Visual Tracking0
Efficient Training for Visual Tracking with Deformable Transformer0
Improving Underwater Visual Tracking With a Large Scale Dataset and Image EnhancementCode1
Learning Visual Tracking and Reaching with Deep Reinforcement Learning on a UR10e Robotic ArmCode0
Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and SegmentationCode1
CiteTracker: Correlating Image and Text for Visual TrackingCode1
Towards Real-World Visual Tracking with Temporal ContextsCode2
HHTrack: Hyperspectral Object Tracking Using Hybrid Attention0
Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual TrackingCode1
Robust Object Modeling for Visual TrackingCode1
Low-complexity Multidimensional DCT Approximations0
TAPIR: Tracking Any Point with per-frame Initialization and temporal RefinementCode3
Cross-Drone Transformer Network for Robust Single Object TrackingCode1
Estimation of control area in badminton doubles with pose information from top and back view drone videosCode0
Tracking through Containers and Occluders in the WildCode1
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object TrackingCode1
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D DataCode1
SiamTHN: Siamese Target Highlight Network for Visual Tracking0
Joint Visual Grounding and Tracking with Natural Language SpecificationCode1
Tracker Meets Night: A Transformer Enhancer for UAV TrackingCode1
Universal Instance Perception as Object Discovery and RetrievalCode3
Show:102550
← PrevPage 2 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ARTrack-LAUC60.3Unverified
2UNINEXT-HAUC59.3Unverified
3JointNLTAUC56.9Unverified
4OSTrackAUC55.9Unverified
5TransTAUC50.7Unverified
6AdaSwitcherAUC42Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard61.3Unverified
2TAPIR (MOVi-E)Average Jaccard59.8Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard57.2Unverified
2TAPIR (MOVi-E)Average Jaccard57.1Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard84.7Unverified
2TAPIR (MOVi-E)Average Jaccard84.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (MOVi-E)Average Jaccard66.2Unverified
2TAPIR (Panning MOVi-E)Average Jaccard62.7Unverified
#ModelMetricClaimedVerifiedStatus
1TATrack-LAUC71.1Unverified
#ModelMetricClaimedVerifiedStatus
1SiamFC-lu (Ours)AUC0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SiamFC-lu (Ours)AUC0.66Unverified
#ModelMetricClaimedVerifiedStatus
1MDNetScore0.64Unverified
#ModelMetricClaimedVerifiedStatus
1TATrack-LACCURACY0.85Unverified