| AFPN: Asymptotic Feature Pyramid Network for Object Detection | Jun 28, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation | Sep 12, 2023 | Image CaptioningImage Generation | CodeCode Available | 1 |
| Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds | Mar 29, 2020 | 3D Object ClassificationGeneral Classification | CodeCode Available | 1 |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Jul 12, 2024 | Collaborative InferenceLanguage Modelling | CodeCode Available | 1 |
| Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection | Aug 11, 2023 | Objectobject-detection | CodeCode Available | 1 |
| GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture | Jun 3, 2021 | ClusteringObject | CodeCode Available | 1 |
| D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence | Sep 30, 2022 | 3D Object DetectionObject | CodeCode Available | 1 |
| A Framework for Extracting and Encoding Features from Object-Centric Event Data | Sep 2, 2022 | Object | CodeCode Available | 1 |
| ABO: Dataset and Benchmarks for Real-World 3D Object Understanding | Oct 12, 2021 | 3D ReconstructionMetric Learning | CodeCode Available | 1 |
| BiDet: An Efficient Binarized Object Detector | Mar 9, 2020 | BinarizationObject | CodeCode Available | 1 |
| GOOD: Exploring Geometric Cues for Detecting Objects in an Open World | Dec 22, 2022 | Class-agnostic Object DetectionObject | CodeCode Available | 1 |
| DASH: Detection and Assessment of Systematic Hallucinations of VLMs | Mar 30, 2025 | Object | CodeCode Available | 1 |
| GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene Supervision | Apr 16, 2025 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Attribute Descent: Simulating Object-Centric Datasets on the Content Level and Beyond | Feb 28, 2022 | AttributeData Augmentation | CodeCode Available | 1 |
| Graphical Object Detection in Document Images | Aug 25, 2020 | Domain AdaptationObject | CodeCode Available | 1 |
| Attribute-guided image generation from layout | Aug 27, 2020 | AttributeImage Generation | CodeCode Available | 1 |
| Grasp Multiple Objects with One Hand | Oct 24, 2023 | Object | CodeCode Available | 1 |
| Decoupled Adaptation for Cross-Domain Object Detection | Oct 6, 2021 | Domain AdaptationObject | CodeCode Available | 1 |
| African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification | Jun 20, 2024 | BenchmarkingClassification | CodeCode Available | 1 |
| CubeSLAM: Monocular 3D Object SLAM | Jun 1, 2018 | 3D Object DetectionCamera Pose Estimation | CodeCode Available | 1 |
| Grounded Affordance from Exocentric View | Aug 28, 2022 | DiversityHuman-Object Interaction Detection | CodeCode Available | 1 |
| Grounding 3D Object Affordance from 2D Interactions in Images | Mar 18, 2023 | Object | CodeCode Available | 1 |
| Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment | Jul 26, 2022 | Data AugmentationDecoder | CodeCode Available | 1 |
| Group-Free 3D Object Detection via Transformers | Apr 1, 2021 | 3D Object DetectionObject | CodeCode Available | 1 |
| Open-vocabulary Object Segmentation with Diffusion Models | Jan 12, 2023 | Image SegmentationObject | CodeCode Available | 1 |
| Augmentation for small object detection | Feb 19, 2019 | Instance SegmentationObject | CodeCode Available | 1 |
| Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion | Jun 9, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| H3DNet: 3D Object Detection Using Hybrid Geometric Primitives | Jun 10, 2020 | 3D Object DetectionObject | CodeCode Available | 1 |
| CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers | Oct 21, 2022 | 6D Pose Estimation using RGBObject | CodeCode Available | 1 |
| HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image | Sep 14, 2023 | Motion PlanningObject | CodeCode Available | 1 |
| Hand-Object Interaction Controller (HOIC): Deep Reinforcement Learning for Reconstructing Interactions with Physics | May 4, 2024 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| HarDNet-MSEG: A Simple Encoder-Decoder Polyp Segmentation Neural Network that Achieves over 0.9 Mean Dice and 86 FPS | Jan 18, 2021 | DecoderGPU | CodeCode Available | 1 |
| 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction | Apr 2, 2016 | 3D Object Reconstruction3D Reconstruction | CodeCode Available | 1 |
| Harmonizing Transferability and Discriminability for Adapting Object Detectors | Mar 13, 2020 | Objectobject-detection | CodeCode Available | 1 |
| CST-YOLO: A Novel Method for Blood Cell Detection Based on Improved YOLOv7 and CNN-Swin Transformer | Jun 26, 2023 | 2D Object DetectionBlood Cell Detection | CodeCode Available | 1 |
| CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation | May 24, 2024 | Generalized Referring Expression SegmentationObject | CodeCode Available | 1 |
| Cue Point Estimation using Object Detection | Jul 9, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Helping Hands: An Object-Aware Ego-Centric Video Recognition Model | Aug 15, 2023 | DecoderObject | CodeCode Available | 1 |
| A Unified Object Motion and Affinity Model for Online Multi-Object Tracking | Mar 25, 2020 | Metric LearningMulti-Object Tracking | CodeCode Available | 1 |
| HIC-YOLOv5: Improved YOLOv5 For Small Object Detection | Sep 28, 2023 | Objectobject-detection | CodeCode Available | 1 |
| AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation | Jun 1, 2023 | Binary ClassificationInteractive Segmentation | CodeCode Available | 1 |
| AutoAssign: Differentiable Label Assignment for Dense Object Detection | Jul 7, 2020 | Dense Object DetectionObject | CodeCode Available | 1 |
| 3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation | Mar 30, 2020 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 1 |
| Object Segmentation Without Labels with Large-Scale Generative Models | Jun 8, 2020 | image-classificationImage Classification | CodeCode Available | 1 |
| Cross-Modality Fusion Transformer for Multispectral Object Detection | Oct 30, 2021 | Multispectral Object DetectionObject | CodeCode Available | 1 |
| AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection | Aug 25, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Accelerated Video Annotation driven by Deep Detector and Tracker | Feb 19, 2023 | Object | CodeCode Available | 1 |
| High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs | Nov 30, 2017 | Conditional Image GenerationFundus to Angiography Generation | CodeCode Available | 1 |
| On Object Symmetries and 6D Pose Estimation from Images | Aug 20, 2019 | 6D Pose EstimationObject | CodeCode Available | 1 |
| 3D-AVS: LiDAR-based 3D Auto-Vocabulary Segmentation | Jun 13, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |