| DetGPT: Detect What You Need via Reasoning | May 23, 2023 | Autonomous DrivingObject | CodeCode Available | 2 | 5 |
| Fine-Grained Prototypes Distillation for Few-Shot Object Detection | Jan 15, 2024 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 2 | 5 |
| Focal Loss for Dense Object Detection | Aug 7, 2017 | 2D Object DetectionDense Object Detection | CodeCode Available | 2 | 5 |
| Focal Sparse Convolutional Networks for 3D Object Detection | Apr 26, 2022 | 3D Object DetectionObject | CodeCode Available | 2 | 5 |
| Fully Sparse 3D Object Detection | Jul 20, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| Fusing Visual Appearance and Geometry for Multi-modality 6DoF Object Tracking | Feb 22, 2023 | 3D Object Tracking6D Pose Estimation | CodeCode Available | 2 | 5 |
| DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds | Jun 9, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 | 5 |
| DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution | Jun 3, 2020 | Instance SegmentationObject | CodeCode Available | 2 | 5 |
| Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images | Apr 22, 2022 | ObjectPose Estimation | CodeCode Available | 2 | 5 |
| Detect Everything with Few Examples | Sep 22, 2023 | Binary ClassificationCross-Domain Few-Shot Object Detection | CodeCode Available | 2 | 5 |
| DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on Simulation | Oct 6, 2022 | Object | CodeCode Available | 2 | 5 |
| Global Tracking Transformers | Mar 24, 2022 | Multi-Object TrackingObject | CodeCode Available | 2 | 5 |
| GOReloc: Graph-based Object-Level Relocalization for Visual SLAM | Aug 15, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior | Feb 23, 2024 | ObjectObject Rearrangement | CodeCode Available | 2 | 5 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 | 5 |
| HASSOD: Hierarchical Adaptive Self-Supervised Object Detection | Feb 5, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity | Oct 14, 2024 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 | 5 |
| Boundary-Aware Segmentation Network for Mobile and Web Applications | Jan 12, 2021 | Camouflaged Object SegmentationDecoder | CodeCode Available | 2 | 5 |
| EdgeYOLO: An Edge-Real-Time Object Detector | Feb 15, 2023 | Data AugmentationEdge-computing | CodeCode Available | 2 | 5 |
| InstMove: Instance Motion for Object-centric Video Segmentation | Mar 14, 2023 | ObjectOptical Flow Estimation | CodeCode Available | 2 | 5 |
| InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition | May 21, 2025 | Earth ObservationObject | CodeCode Available | 2 | 5 |
| InteractVLM: 3D Interaction Reasoning from 2D Foundational Models | Apr 7, 2025 | 3D ReconstructionObject | CodeCode Available | 2 | 5 |
| Deep Snake for Real-Time Instance Segmentation | Jan 6, 2020 | GPUInstance Segmentation | CodeCode Available | 2 | 5 |
| Interpreting Object-level Foundation Models via Visual Precision Search | Nov 25, 2024 | Explainable Artificial Intelligence (XAI)Object | CodeCode Available | 2 | 5 |
| Articulated Object Interaction in Unknown Scenes with Whole-Body Mobile Manipulation | Mar 18, 2021 | Object | CodeCode Available | 2 | 5 |
| Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer | Apr 7, 2024 | 3D Human Reconstruction3D Object Reconstruction | CodeCode Available | 2 | 5 |
| DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep Association | Feb 24, 2022 | 3D Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 2 | 5 |
| Large Selective Kernel Network for Remote Sensing Object Detection | Mar 16, 2023 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection | Jul 7, 2022 | ObjectOpen Vocabulary Attribute Detection | CodeCode Available | 2 | 5 |
| Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Apr 9, 2024 | Image RetrievalObject | CodeCode Available | 2 | 5 |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Oct 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| DeepInteraction: 3D Object Detection via Modality Interaction | Aug 23, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 | 5 |
| LiDAR Snowfall Simulation for Robust 3D Object Detection | Mar 28, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| LISO: Lidar-only Self-Supervised 3D Object Detection | Mar 11, 2024 | 3D Object DetectionObject | CodeCode Available | 2 | 5 |
| Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification | Mar 15, 2024 | Object | CodeCode Available | 2 | 5 |
| Make It Count: Text-to-Image Generation with an Accurate Number of Objects | Jun 14, 2024 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification | Dec 14, 2024 | Mixture-of-ExpertsObject | CodeCode Available | 2 | 5 |
| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Mar 1, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking | Jul 28, 2023 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 | 5 |
| A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation | Sep 27, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 | 5 |
| AdaMixer: A Fast-Converging Query-Based Object Detector | Mar 30, 2022 | ObjectObject Detection | CodeCode Available | 2 | 5 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 | 5 |
| Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Mar 5, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| MonoCD: Monocular 3D Object Detection with Complementary Depths | Apr 4, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 | 5 |
| MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection | Mar 24, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 | 5 |
| Cross-View Referring Multi-Object Tracking | Dec 23, 2024 | Cross-view Referring Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 2 | 5 |
| DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting | Apr 25, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 | 5 |
| Dense Distinct Query for End-to-End Object Detection | Mar 22, 2023 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation | Mar 25, 2022 | Contrastive Learningimage-classification | CodeCode Available | 2 | 5 |
| Contextual Object Detection with Multimodal Large Language Models | May 29, 2023 | Cloze TestDecoder | CodeCode Available | 2 | 5 |