| BOP Challenge 2020 on 6D Object Localization | Sep 15, 2020 | 6D Pose Estimation6D Pose Estimation using RGB | CodeCode Available | 2 |
| Aligning and Prompting Everything All at Once for Universal Visual Perception | Dec 4, 2023 | AllObject | CodeCode Available | 2 |
| NetTrack: Tracking Highly Dynamic Objects with a Net | Mar 17, 2024 | Multi-Object TrackingObject | CodeCode Available | 2 |
| NOPE: Novel Object Pose Estimation from a Single Image | Mar 23, 2023 | ObjectPose Estimation | CodeCode Available | 2 |
| DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep Association | Feb 24, 2022 | 3D Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 2 |
| Boundary-Aware Segmentation Network for Mobile and Web Applications | Jan 12, 2021 | Camouflaged Object SegmentationDecoder | CodeCode Available | 2 |
| Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset | Dec 9, 2024 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 2 |
| Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations | Dec 18, 2020 | 3D Object Detection3D Object Tracking | CodeCode Available | 2 |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Oct 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection | Jul 7, 2022 | ObjectOpen Vocabulary Attribute Detection | CodeCode Available | 2 |
| DeepInteraction: 3D Object Detection via Modality Interaction | Aug 23, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 |
| Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild | Jul 21, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 |
| C2AM: Contrastive Learning of Class-Agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation | Jan 1, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| OnePose: One-Shot Object Pose Estimation without CAD Models | May 24, 2022 | 6D Pose EstimationGraph Attention | CodeCode Available | 2 |
| Deep Snake for Real-Time Instance Segmentation | Jan 6, 2020 | GPUInstance Segmentation | CodeCode Available | 2 |
| One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos | Sep 29, 2024 | AllImage Segmentation | CodeCode Available | 2 |
| Open Vocabulary Monocular 3D Object Detection | Nov 25, 2024 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 2 |
| Open World Object Detection: A Survey | Oct 15, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| Cross-View Referring Multi-Object Tracking | Dec 23, 2024 | Cross-view Referring Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 2 |
| Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Mar 5, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention | Jun 18, 2024 | ObjectResponse Generation | CodeCode Available | 2 |
| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Mar 1, 2024 | Objectobject-detection | CodeCode Available | 2 |
| PlanT: Explainable Planning Transformers via Object-Level Representations | Oct 25, 2022 | CARLA longest6Decision Making | CodeCode Available | 2 |
| CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement | Jun 27, 2024 | Human-Object Interaction DetectionHuman-Object Interaction Generation | CodeCode Available | 2 |
| Point-to-Box Network for Accurate Object Detection via Single Point Supervision | Jul 14, 2022 | AttributeMultiple Instance Learning | CodeCode Available | 2 |
| Poly Kernel Inception Network for Remote Sensing Detection | Mar 10, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation | Mar 25, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 |
| Progressive Representation Learning for Real-Time UAV Tracking | Sep 25, 2024 | ObjectObject Tracking | CodeCode Available | 2 |
| Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting | Apr 7, 2025 | Boundary DetectionObject | CodeCode Available | 2 |
| CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown Degradations | Mar 17, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models | Jan 25, 2025 | AttributeContrastive Learning | CodeCode Available | 2 |
| ALBench: A Framework for Evaluating Active Learning in Object Detection | Jul 27, 2022 | Active Learningimage-classification | CodeCode Available | 2 |
| Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine Perception | Jun 10, 2023 | 3D Object DetectionBenchmarking | CodeCode Available | 2 |
| Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector | Feb 5, 2024 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |
| R-FCN-3000 at 30fps: Decoupling Detection and Classification | Dec 5, 2017 | ClassificationGeneral Classification | CodeCode Available | 2 |
| RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection | Aug 18, 2022 | Objectobject-detection | CodeCode Available | 2 |
| CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion | Dec 2, 2022 | 3D Object TrackingAutonomous Vehicles | CodeCode Available | 2 |
| RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting | Apr 25, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 |
| RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework | Sep 18, 2024 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds | Jun 9, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| Efficient Video Object Segmentation via Modulated Cross-Attention Memory | Mar 26, 2024 | GPUObject | CodeCode Available | 2 |
| CenterFormer: Center-based Transformer for 3D Object Detection | Sep 12, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| CenterNet++ for Object Detection | Apr 18, 2022 | Objectobject-detection | CodeCode Available | 2 |
| In-Hand Object Rotation via Rapid Motor Adaptation | Oct 10, 2022 | ObjectReinforcement Learning (RL) | CodeCode Available | 2 |
| CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions | Apr 25, 2024 | MambaMultispectral Object Detection | CodeCode Available | 2 |
| Centralized Feature Pyramid for Object Detection | Oct 5, 2022 | Objectobject-detection | CodeCode Available | 2 |
| SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection | Mar 9, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Removal then Selection: A Coarse-to-Fine Fusion Perspective for RGB-Infrared Object Detection | Jan 19, 2024 | Multispectral Object DetectionObject | CodeCode Available | 2 |
| Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation | Mar 2, 2021 | Data AugmentationObject | CodeCode Available | 1 |