| Scalable Video Object Segmentation with Identification Mechanism | Mar 22, 2022 | ObjectSegmentation | CodeCode Available | 2 | 5 |
| RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion | Dec 2, 2022 | 3D Object TrackingAutonomous Vehicles | CodeCode Available | 2 | 5 |
| Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation | Mar 5, 2025 | ObjectReferring Video Object Segmentation | CodeCode Available | 2 | 5 |
| EgoLifter: Open-world 3D Segmentation for Egocentric Perception | Mar 26, 2024 | 3D ReconstructionObject | CodeCode Available | 2 | 5 |
| Efficient Video Object Segmentation via Modulated Cross-Attention Memory | Mar 26, 2024 | GPUObject | CodeCode Available | 2 | 5 |
| EGTR: Extracting Graph from Transformer for Scene Graph Generation | Apr 2, 2024 | Graph GenerationMulti-Task Learning | CodeCode Available | 2 | 5 |
| AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention | Jun 18, 2024 | ObjectResponse Generation | CodeCode Available | 2 | 5 |
| Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders | Mar 26, 2024 | ObjectSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| Efficient Teacher: Semi-Supervised Object Detection for YOLOv5 | Feb 15, 2023 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Elysium: Exploring Object-level Perception in Videos via MLLM | Mar 25, 2024 | ObjectObject Tracking | CodeCode Available | 2 | 5 |
| EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection | Mar 31, 2023 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 | 5 |
| Duoduo CLIP: Efficient 3D Understanding with Multi-View Images | Jun 17, 2024 | GPUObject | CodeCode Available | 2 | 5 |
| EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation | Mar 3, 2024 | ObjectRepresentation Learning | CodeCode Available | 2 | 5 |
| DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images | Aug 15, 2022 | NeRFObject | CodeCode Available | 2 | 5 |
| DiffusionTrack: Diffusion Model For Multi-Object Tracking | Aug 19, 2023 | Denoisingmodel | CodeCode Available | 2 | 5 |
| DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models | Jul 5, 2023 | Object | CodeCode Available | 2 | 5 |
| EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild | Nov 21, 2024 | 3D ReconstructionObject | CodeCode Available | 2 | 5 |
| Equalized Focal Loss for Dense Long-Tailed Object Detection | Jan 7, 2022 | Long-tailed Object DetectionObject | CodeCode Available | 2 | 5 |
| Fine-Grained Prototypes Distillation for Few-Shot Object Detection | Jan 15, 2024 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 2 | 5 |
| DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on Simulation | Oct 6, 2022 | Object | CodeCode Available | 2 | 5 |
| DetGPT: Detect What You Need via Reasoning | May 23, 2023 | Autonomous DrivingObject | CodeCode Available | 2 | 5 |
| DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object Detection | Dec 6, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| Detect Everything with Few Examples | Sep 22, 2023 | Binary ClassificationCross-Domain Few-Shot Object Detection | CodeCode Available | 2 | 5 |
| Deep Snake for Real-Time Instance Segmentation | Jan 6, 2020 | GPUInstance Segmentation | CodeCode Available | 2 | 5 |
| DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification | Dec 14, 2024 | Mixture-of-ExpertsObject | CodeCode Available | 2 | 5 |
| DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution | Jun 3, 2020 | Instance SegmentationObject | CodeCode Available | 2 | 5 |
| Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation | Feb 4, 2025 | DenoisingDomain Generalization | CodeCode Available | 2 | 5 |
| DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep Association | Feb 24, 2022 | 3D Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 2 | 5 |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Oct 18, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| Dense Distinct Query for End-to-End Object Detection | Mar 22, 2023 | Objectobject-detection | CodeCode Available | 2 | 5 |
| DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World | Jun 30, 2025 | Caption GenerationObject | CodeCode Available | 2 | 5 |
| DeepInteraction: 3D Object Detection via Modality Interaction | Aug 23, 2022 | 3D Object DetectionDecoder | CodeCode Available | 2 | 5 |
| Augmented Object Intelligence with XR-Objects | Apr 20, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| 3D Object Detection for Autonomous Driving: A Comprehensive Survey | Jun 19, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 | 5 |
| DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds | Jun 9, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 | 5 |
| DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting | Apr 25, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 | 5 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 | 5 |
| Cross-View Referring Multi-Object Tracking | Dec 23, 2024 | Cross-view Referring Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 2 | 5 |
| DQ-DETR: DETR with Dynamic Query for Tiny Object Detection | Apr 4, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting | Nov 26, 2024 | AttributeDiversity | CodeCode Available | 2 | 5 |
| Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Mar 5, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries | Mar 29, 2024 | ObjectVideo Instance Segmentation | CodeCode Available | 2 | 5 |
| Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection | May 19, 2025 | Event-based visionObject | CodeCode Available | 2 | 5 |
| EdgeYOLO: An Edge-Real-Time Object Detector | Feb 15, 2023 | Data AugmentationEdge-computing | CodeCode Available | 2 | 5 |
| Autoregressive Visual Tracking | Jan 1, 2023 | ObjectObject Tracking | CodeCode Available | 2 | 5 |
| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Mar 1, 2024 | Objectobject-detection | CodeCode Available | 2 | 5 |
| E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection | Mar 14, 2024 | Autonomous DrivingObject | CodeCode Available | 2 | 5 |
| Contextual Object Detection with Multimodal Large Language Models | May 29, 2023 | Cloze TestDecoder | CodeCode Available | 2 | 5 |
| Context-Guided Spatio-Temporal Video Grounding | Jan 3, 2024 | ObjectSpatio-Temporal Video Grounding | CodeCode Available | 2 | 5 |