| CaSPR: Learning Canonical Spatiotemporal Point Cloud Representations | Aug 6, 2020 | Camera Pose EstimationObject | CodeCode Available | 1 |
| Density Crop-guided Semi-supervised Object Detection in Aerial Images | Aug 9, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Dense Relational Image Captioning via Multi-task Triple-Stream Networks | Oct 8, 2020 | Graph GenerationImage Captioning | CodeCode Available | 1 |
| GOOD: Exploring Geometric Cues for Detecting Objects in an Open World | Dec 22, 2022 | Class-agnostic Object DetectionObject | CodeCode Available | 1 |
| GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting | Mar 15, 2022 | 6D Pose Estimation6D Pose Estimation using RGB | CodeCode Available | 1 |
| GRAB: A Dataset of Whole-Body Human Grasping of Objects | Aug 25, 2020 | Grasp Contact PredictionGrasp Generation | CodeCode Available | 1 |
| Categorical Depth Distribution Network for Monocular 3D Object Detection | Mar 1, 2021 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 1 |
| 3D ShapeNets: A Deep Representation for Volumetric Shapes | Jun 22, 2014 | 3D Point Cloud Classification3D Shape Representation | CodeCode Available | 1 |
| CerberusDet: Unified Multi-Dataset Object Detection | Jul 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Dense Relation Distillation with Context-aware Aggregation for Few-Shot Object Detection | Mar 30, 2021 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 1 |
| Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Mar 12, 2024 | Autonomous DrivingConformal Prediction | CodeCode Available | 1 |
| Ground-aware Monocular 3D Object Detection for Autonomous Driving | Feb 1, 2021 | 3D Object Detection6D Pose Estimation using RGB | CodeCode Available | 1 |
| Grounding 3D Object Affordance from 2D Interactions in Images | Mar 18, 2023 | Object | CodeCode Available | 1 |
| Group Collaborative Learning for Co-Salient Object Detection | Mar 15, 2021 | Co-Salient Object DetectionObject | CodeCode Available | 1 |
| Adaptive Class Suppression Loss for Long-Tail Object Detection | Apr 2, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Category-Level 6D Object Pose and Size Estimation using Self-Supervised Deep Prior Deformation Networks | Jul 12, 2022 | 6D Pose EstimationObject | CodeCode Available | 1 |
| GuessWhat?! Visual object discovery through multi-modal dialogue | Nov 23, 2016 | ObjectObject Discovery | CodeCode Available | 1 |
| Guided Slot Attention for Unsupervised Video Object Segmentation | Mar 15, 2023 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection | Apr 19, 2023 | 3D Object DetectionAttribute | CodeCode Available | 1 |
| H3DNet: 3D Object Detection Using Hybrid Geometric Primitives | Jun 10, 2020 | 3D Object DetectionObject | CodeCode Available | 1 |
| HallE-Control: Controlling Object Hallucination in Large Multimodal Models | Oct 3, 2023 | AttributeDecoder | CodeCode Available | 1 |
| HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image | Sep 14, 2023 | Motion PlanningObject | CodeCode Available | 1 |
| Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning | May 9, 2022 | Image CaptioningObject | CodeCode Available | 1 |
| Densely Constrained Depth Estimator for Monocular 3D Object Detection | Jul 20, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 1 |
| Densely Deformable Efficient Salient Object Detection Network | Feb 12, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Harmonizing Transferability and Discriminability for Adapting Object Detectors | Mar 13, 2020 | Objectobject-detection | CodeCode Available | 1 |
| CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation | May 24, 2024 | Generalized Referring Expression SegmentationObject | CodeCode Available | 1 |
| HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors | Jul 12, 2022 | Knowledge DistillationObject | CodeCode Available | 1 |
| Category Query Learning for Human-Object Interaction Classification | Mar 24, 2023 | ClassificationDecoder | CodeCode Available | 1 |
| Helping Hands: An Object-Aware Ego-Centric Video Recognition Model | Aug 15, 2023 | DecoderObject | CodeCode Available | 1 |
| CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images | Jan 1, 2024 | 3D Object Detection3D Reconstruction | CodeCode Available | 1 |
| CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning | Oct 10, 2019 | DiagnosticObject | CodeCode Available | 1 |
| Hierarchical Memory Matching Network for Video Object Segmentation | Sep 23, 2021 | ObjectRetrieval | CodeCode Available | 1 |
| Hierarchical Object-to-Zone Graph for Object Navigation | Sep 5, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation | Sep 19, 2021 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| CATRE: Iterative Point Clouds Alignment for Category-level Object Pose Refinement | Jul 17, 2022 | ObjectPose Estimation | CodeCode Available | 1 |
| BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection | Dec 4, 2023 | 3D Object DetectionDecoder | CodeCode Available | 1 |
| High Pileup Particle Tracking with Object Condensation | Dec 6, 2023 | Edge ClassificationObject | CodeCode Available | 1 |
| Dense Learning based Semi-Supervised Object Detection | Apr 15, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Densely Nested Top-Down Flows for Salient Object Detection | Feb 18, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Density Map Guided Object Detection in Aerial Images | Apr 12, 2020 | Image CroppingObject | CodeCode Available | 1 |
| HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D Reconstruction | Jun 24, 2022 | 3D ReconstructionCamera Pose Estimation | CodeCode Available | 1 |
| CBNet: A Composite Backbone Network Architecture for Object Detection | Jul 1, 2021 | Instance SegmentationObject | CodeCode Available | 1 |
| BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios | Dec 12, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images | Dec 16, 2021 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion | May 2, 2025 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 1 |
| CDNet is all you need: Cascade DCN based underwater object detection RCNN | Nov 25, 2021 | AllObject | CodeCode Available | 1 |
| C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation | Feb 27, 2025 | ObjectVideo Generation | CodeCode Available | 1 |
| Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation | Oct 23, 2020 | ObjectOne-shot visual object segmentation | CodeCode Available | 1 |
| BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection | Nov 17, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 1 |