SOTAVerified

Object Tracking

Object tracking is the task of taking an initial set of object detections, creating a unique ID for each of the initial detections, and then tracking each of the objects as they move around frames in a video, maintaining the ID assignment. State-of-the-art methods involve fusing data from RGB and event-based cameras to produce more reliable object tracking. CNN-based models using only RGB images as input are also effective. The most popular benchmark is OTB. There are several evaluation metrics specific to object tracking, including HOTA, MOTA, IDF1, and Track-mAP.

( Image credit: Towards-Realtime-MOT )

Papers

Showing 150 of 1966 papers

TitleStatusHype
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware MemoryCode9
Track Anything: Segment Anything Meets VideosCode5
Matching Anything by Segmenting AnythingCode5
Underwater Camouflaged Object Tracking Meets Vision-Language SAM2Code5
Awesome Multi-modal Object TrackingCode5
Medical SAM 2: Segment medical images as video via Segment Anything Model 2Code4
CoTracker: It is Better to Track TogetherCode4
SiamMask: A Framework for Fast Online Object Tracking and SegmentationCode4
Segment and Track AnythingCode4
Trackastra: Transformer-based cell tracking for live-cell microscopyCode4
BoostTrack: boosting the similarity measure and detection confidence for improved multiple object trackingCode3
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous DrivingCode3
Multiple Object Tracking as ID PredictionCode3
BoostTrack++: using tracklet information to detect more objects in multiple object trackingCode3
VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-TuningCode3
FruitNeRF: A Unified Neural Radiance Field based Fruit Counting FrameworkCode3
Deep OC-SORT: Multi-Pedestrian Tracking by Adaptive Re-IdentificationCode3
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
Cross Modal Transformer: Towards Fast and Robust 3D Object DetectionCode3
Universal Instance Perception as Object Discovery and RetrievalCode3
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object TrackingCode3
BoT-SORT: Robust Associations Multi-Pedestrian TrackingCode3
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown ObjectsCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
Lifting Multi-View Detection and Tracking to the Bird's Eye ViewCode2
Beyond MOT: Semantic Multi-Object TrackingCode2
Language as Queries for Referring Video Object SegmentationCode2
Long-term Frame-Event Visual Tracking: Benchmark Dataset and BaselineCode2
Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless ObjectsCode2
Large-Scale Pre-training for Person Re-identification with Noisy LabelsCode2
Hybrid-SORT: Weak Cues Matter for Online Multi-Object TrackingCode2
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream FrameworkCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
Autoregressive Visual TrackingCode2
Global Tracking TransformersCode2
GTA: Global Tracklet Association for Multi-Object Tracking in SportsCode2
Joint Perception and Prediction for Autonomous Driving: A SurveyCode2
Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object TrackingCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
Focusing on Tracks for Online Multi-Object TrackingCode2
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
Delving into the Trajectory Long-tail Distribution for Muti-object TrackingCode2
Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel BaselineCode2
Exploiting Multimodal Spatial-temporal Patterns for Video Object TrackingCode2
ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to DescribeCode2
Fast Online Object Tracking and Segmentation: A Unifying ApproachCode2
Elysium: Exploring Object-level Perception in Videos via MLLMCode2
Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep AssociationCode2
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
Show:102550
← PrevPage 1 of 40Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HR-CEUTrack-LargeSuccess Rate65Unverified
2HR-CEUTrack-BaseSuccess Rate63.2Unverified
3CEUTrack-LargeSuccess Rate62.8Unverified
4CEUTrack-BaseSuccess Rate62Unverified
5SiamR-CNNSuccess Rate60.9Unverified
6TransTSuccess Rate60.5Unverified
7SuperDiMPSuccess Rate60.2Unverified
8TrDiMPSuccess Rate60.1Unverified
9KeepTrackSuccess Rate59.6Unverified
10AiATrackSuccess Rate59Unverified
#ModelMetricClaimedVerifiedStatus
1HR-MonTrack-BaseSuccess Rate68.5Unverified
2HR-MonTrack-TinySuccess Rate66.3Unverified
3Multi-modalSuccess Rate63.4Unverified
4PrDiMPSuccess Rate59Unverified
5DiMPSuccess Rate57.1Unverified
6MonTrackSuccess Rate54.9Unverified
7ATOMSuccess Rate46.5Unverified
8KYSSuccess Rate26.6Unverified
#ModelMetricClaimedVerifiedStatus
1OmniTrackHOTA23.45Unverified
2DeepSORTHOTA21.16Unverified
3OC-SORTHOTA20.83Unverified
4ByteTrackHOTA20.66Unverified
5TrackFormerHOTA19.62Unverified
6HybridSORTHOTA16.64Unverified
7DiffMOTHOTA16.4Unverified
8Bot-SORTHOTA15.77Unverified
#ModelMetricClaimedVerifiedStatus
1DiMP50Success Rate67.33Unverified
2PrDiMP50Success Rate67Unverified
3PrDiMP18Success Rate65.9Unverified
4DiMP18Success Rate64.6Unverified
5AtomSuccess Rate63.8Unverified
#ModelMetricClaimedVerifiedStatus
1finalHumans0.14Unverified
2night_furyHumans0.05Unverified
3Yolo based methodHumans0.02Unverified
4finalHumans0Unverified
#ModelMetricClaimedVerifiedStatus
1M2-Trackmean precision83.4Unverified
2BATmean precision75.2Unverified
#ModelMetricClaimedVerifiedStatus
1UMMT3DMOTA95Unverified
2MMPTRACK3DMOTA94.8Unverified
#ModelMetricClaimedVerifiedStatus
1Siam-FCAverage IOU0.66Unverified
#ModelMetricClaimedVerifiedStatus
1RT-MDNetPrecision Plot0.63Unverified