| Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification | Jan 17, 2021 | Fine-Grained Image ClassificationGeneral Classification | CodeCode Available | 1 | 5 |
| DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM | Mar 19, 2024 | Objectobject-detection | CodeCode Available | 1 | 5 |
| Context-aware Cross-level Fusion Network for Camouflaged Object Detection | May 26, 2021 | Camouflaged Object SegmentationObject | CodeCode Available | 1 | 5 |
| GSPN: Generative Shape Proposal Network for 3D Instance Segmentation in Point Cloud | Dec 8, 2018 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 1 | 5 |
| Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation | Sep 12, 2023 | Image CaptioningImage Generation | CodeCode Available | 1 | 5 |
| Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in Videos | Jun 5, 2022 | ObjectObject Tracking | CodeCode Available | 1 | 5 |
| Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction | Aug 28, 2020 | 3D ReconstructionObject | CodeCode Available | 1 | 5 |
| Canonical Capsules: Self-Supervised Capsules in Canonical Pose | Dec 8, 2020 | 3D Point Cloud ReconstructionGeneral Classification | CodeCode Available | 1 | 5 |
| ContactPose: A Dataset of Grasps with Object Contact and Hand Pose | Jul 19, 2020 | Grasp Contact PredictionObject | CodeCode Available | 1 | 5 |
| 3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding | Mar 30, 2021 | Affordance DetectionBenchmarking | CodeCode Available | 1 | 5 |
| Can OOD Object Detectors Learn from Foundation Models? | Sep 8, 2024 | Objectobject-detection | CodeCode Available | 1 | 5 |
| Can SAM Segment Anything? When SAM Meets Camouflaged Object Detection | Apr 10, 2023 | Objectobject-detection | CodeCode Available | 1 | 5 |
| Diagnosing Human-object Interaction Detectors | Aug 16, 2023 | ClassificationHuman-Object Interaction Detection | CodeCode Available | 1 | 5 |
| DiffuBox: Refining 3D Object Detection with Point Diffusion | May 25, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 | 5 |
| DFR-FastMOT: Detection Failure Resistant Tracker for Fast Multi-Object Tracking Based on Sensor Fusion | Feb 28, 2023 | Autonomous VehiclesMulti-Object Tracking | CodeCode Available | 1 | 5 |
| D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions | Dec 1, 2021 | Motion SynthesisObject | CodeCode Available | 1 | 5 |
| Contemplating real-world object classification | Mar 8, 2021 | ClassificationData Augmentation | CodeCode Available | 1 | 5 |
| ContactGen: Generative Contact Modeling for Grasp Generation | Oct 5, 2023 | Grasp GenerationObject | CodeCode Available | 1 | 5 |
| Differentiable Physics Simulation of Dynamics-Augmented Neural Objects | Oct 17, 2022 | FrictionObject | CodeCode Available | 1 | 5 |
| CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation | Jun 1, 2024 | 2D Pose EstimationAnimal Pose Estimation | CodeCode Available | 1 | 5 |
| 3D Object Detection with a Self-supervised Lidar Scene Flow Backbone | May 2, 2022 | 3D Object DetectionObject | CodeCode Available | 1 | 5 |
| Differentiable Soft-Masked Attention | Jun 1, 2022 | ObjectSegmentation | CodeCode Available | 1 | 5 |
| Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos | May 7, 2024 | DenoisingObject | CodeCode Available | 1 | 5 |
| Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Tagging | Nov 28, 2022 | graph constructionnamed-entity-recognition | CodeCode Available | 1 | 5 |
| Context-Aware Synthesis and Placement of Object Instances | Dec 6, 2018 | ObjectScene Parsing | CodeCode Available | 1 | 5 |
| Contour Knowledge Transfer for Salient Object Detection | Sep 1, 2018 | Contour DetectionObject | CodeCode Available | 1 | 5 |
| Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays | Mar 11, 2023 | DenoisingObject | CodeCode Available | 1 | 5 |
| GTNet:Guided Transformer Network for Detecting Human-Object Interactions | Aug 2, 2021 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection | Aug 14, 2020 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter | Sep 6, 2023 | Contrastive LearningDenoising | CodeCode Available | 1 | 5 |
| ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion | Oct 16, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 1 | 5 |
| DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object Detection | Oct 30, 2023 | DenoisingGPU | CodeCode Available | 1 | 5 |
| Grounding 3D Object Affordance from 2D Interactions in Images | Mar 18, 2023 | Object | CodeCode Available | 1 | 5 |
| L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation | Apr 7, 2022 | ObjectSemantic Segmentation | CodeCode Available | 1 | 5 |
| DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection | Jun 21, 2024 | Class-agnostic Object DetectionMulti-object discovery | CodeCode Available | 1 | 5 |
| DIOD: Self-Distillation Meets Object Discovery | Jan 1, 2024 | Instance SegmentationKnowledge Distillation | CodeCode Available | 1 | 5 |
| CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects | Mar 28, 2023 | DecoderGPU | CodeCode Available | 1 | 5 |
| AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation | Nov 13, 2023 | AttributeHallucination | CodeCode Available | 1 | 5 |
| Discovering A Variety of Objects in Spatio-Temporal Human-Object Interactions | Nov 14, 2022 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| Cascade-DETR: Delving into High-Quality Universal Object Detection | Jul 20, 2023 | DecoderObject | CodeCode Available | 1 | 5 |
| SeSame: Simple, Easy 3D Object Detection with Point-Wise Semantics | Mar 11, 2024 | 2D Object Detection3D Object Detection | CodeCode Available | 1 | 5 |
| Cascaded Human-Object Interaction Recognition | Mar 9, 2020 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| Discovering Objects that Can Move | Mar 18, 2022 | Motion SegmentationObject | CodeCode Available | 1 | 5 |
| Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth | Mar 29, 2022 | Depth EstimationDepth Prediction | CodeCode Available | 1 | 5 |
| Cascade Graph Neural Networks for RGB-D Salient Object Detection | Aug 7, 2020 | Objectobject-detection | CodeCode Available | 1 | 5 |
| Discriminative Appearance Modeling with Multi-track Pooling for Real-time Multi-object Tracking | Jan 28, 2021 | Multi-Object TrackingObject | CodeCode Available | 1 | 5 |
| Consistency-based Active Learning for Object Detection | Mar 18, 2021 | Active LearningClassification | CodeCode Available | 1 | 5 |
| LANDMARK: Language-guided Representation Enhancement Framework for Scene Graph Generation | Mar 2, 2023 | Graph GenerationObject | CodeCode Available | 1 | 5 |
| Disentangle and Remerge: Interventional Knowledge Distillation for Few-Shot Object Detection from A Conditional Causal Perspective | Aug 26, 2022 | Few-Shot LearningFew-Shot Object Detection | CodeCode Available | 1 | 5 |
| Ground-aware Monocular 3D Object Detection for Autonomous Driving | Feb 1, 2021 | 3D Object Detection6D Pose Estimation using RGB | CodeCode Available | 1 | 5 |