| Practical Video Object Detection via Feature Selection and Aggregation | Jul 29, 2024 | feature selectionGPU | CodeCode Available | 3 |
| Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark | Nov 24, 2022 | 2D Object DetectionImage Retrieval | CodeCode Available | 2 |
| YOLOV: Making Still Image Object Detectors Great at Video Object Detection | Aug 20, 2022 | GPUObject | CodeCode Available | 2 |
| TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers | Jan 13, 2022 | GPUObject | CodeCode Available | 2 |
| Detection and Tracking Meet Drones Challenge | Jan 16, 2020 | Multi-Object TrackingObject | CodeCode Available | 2 |
| FADE: A Dataset for Detecting Falling Objects around Buildings in Video | Aug 11, 2024 | Moving Object DetectionObject | CodeCode Available | 1 |
| Multi-resolution Rescored ByteTrack for Video Object Detection on Ultra-low-power Embedded Systems | Apr 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Camera clustering for scalable stream-based active distillation | Apr 16, 2024 | ClusteringKnowledge Distillation | CodeCode Available | 1 |
| Efficient One-stage Video Object Detection by Exploiting Temporal Consistency | Feb 14, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| TDViT: Temporal Dilated Video Transformer for Dense Video Tasks | Feb 14, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Spatio-temporal Prompting Network for Robust Video Feature Extraction | Feb 4, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection | Jan 18, 2024 | Mambaobject-detection | CodeCode Available | 1 |
| DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object Detection | Oct 30, 2023 | DenoisingGPU | CodeCode Available | 1 |
| Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers | Aug 25, 2023 | Action RecognitionObject Detection | CodeCode Available | 1 |
| Objects do not disappear: Video object detection by single-frame object location anticipation | Aug 9, 2023 | Computational EfficiencyObject | CodeCode Available | 1 |
| FeatEnHancer: Enhancing Hierarchical Features for Object Detection and Beyond Under Low-Light Vision | Aug 7, 2023 | Face Detectionobject-detection | CodeCode Available | 1 |
| 3D Video Object Detection with Learnable Object-Centric Global Optimization | Mar 27, 2023 | 3D Scene Reconstructionglobal-optimization | CodeCode Available | 1 |
| FAQ: Feature Aggregated Queries for Transformer-based Video Object Detectors | Mar 15, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Feature Aggregated Queries for Transformer-Based Video Object Detectors | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Fewer is More: Efficient Object Detection in Large Aerial Images | Dec 26, 2022 | 4kObject | CodeCode Available | 1 |
| PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection | Sep 6, 2022 | object-detectionObject Detection | CodeCode Available | 1 |
| Video Sparse Transformer With Attention-Guided Memory for Video Object Detection | Jun 17, 2022 | Objectobject-detection | CodeCode Available | 1 |
| TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video | Nov 17, 2021 | Data AugmentationImage Augmentation | CodeCode Available | 1 |
| AI Accelerator Survey and Trends | Sep 18, 2021 | BenchmarkingComputational Efficiency | CodeCode Available | 1 |
| FFAVOD: Feature Fusion Architecture for Video Object Detection | Sep 15, 2021 | Objectobject-detection | CodeCode Available | 1 |
| TF-Blender: Temporal Feature Blender for Video Object Detection | Aug 12, 2021 | Objectobject-detection | CodeCode Available | 1 |
| End-to-End Video Object Detection with Spatial-Temporal Transformers | May 23, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Few-Shot Video Object Detection | Apr 30, 2021 | DiversityFew-Shot Learning | CodeCode Available | 1 |
| Emerging Properties in Self-Supervised Vision Transformers | Apr 29, 2021 | Copy DetectionImage Classification | CodeCode Available | 1 |
| Short-term anchor linking and long-term self-guided attention for video object detection | Apr 18, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Motion Vector Extrapolation for Video Object Detection | Apr 18, 2021 | CPUGPU | CodeCode Available | 1 |
| HoughNet: Integrating near and long-range evidence for visual detection | Apr 14, 2021 | 3D Object DetectionImage Generation | CodeCode Available | 1 |
| Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature Aggregation | Mar 25, 2021 | GPUObject | CodeCode Available | 1 |
| PatchNet -- Short-range Template Matching for Efficient Video Processing | Mar 10, 2021 | Objectobject-detection | CodeCode Available | 1 |
| ApproxDet: Content and Contention-Aware Approximate Object Detection for Mobiles | Oct 21, 2020 | Objectobject-detection | CodeCode Available | 1 |
| Robust and Efficient Post-Processing for Video Object Detection (REPP) | Oct 1, 2020 | Autonomous DrivingDense Object Detection | CodeCode Available | 1 |
| Robust and efficient post-processing for video object detection | Sep 23, 2020 | Autonomous DrivingObject | CodeCode Available | 1 |
| Mining Inter-Video Proposal Relations for Video Object Detection | Aug 1, 2020 | Objectobject-detection | CodeCode Available | 1 |
| Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection | Apr 9, 2020 | Objectobject-detection | CodeCode Available | 1 |
| LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention | Apr 3, 2020 | Objectobject-detection | CodeCode Available | 1 |
| Memory Enhanced Global-Local Aggregation for Video Object Detection | Mar 26, 2020 | Objectobject-detection | CodeCode Available | 1 |
| Relation Distillation Networks for Video Object Detection | Aug 26, 2019 | Objectobject-detection | CodeCode Available | 1 |
| TSM: Temporal Shift Module for Efficient Video Understanding | Nov 20, 2018 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| Towards High Performance Video Object Detection for Mobiles | Apr 16, 2018 | Objectobject-detection | CodeCode Available | 1 |
| Structure-measure: A New Way to Evaluate Foreground Maps | Aug 2, 2017 | Objectobject-detection | CodeCode Available | 1 |
| Lightweight Multi-Frame Integration for Robust YOLO Object Detection in Videos | Jun 25, 2025 | Autonomous DrivingComputational Efficiency | —Unverified | 0 |
| Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection | Apr 16, 2025 | Graph LearningGraph Representation Learning | —Unverified | 0 |
| Context in object detection: a systematic literature review | Mar 29, 2025 | Few-Shot Object DetectionObject | —Unverified | 0 |
| Region Masking to Accelerate Video Processing on Neuromorphic Hardware | Mar 21, 2025 | object-detectionObject Detection | —Unverified | 0 |
| TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection | Mar 18, 2025 | GPUobject-detection | —Unverified | 0 |