| Learning Layout and Style Reconfigurable GANs for Controllable Image Synthesis | Mar 25, 2020 | Image GenerationLayout-to-Image Generation | CodeCode Available | 1 |
| Learning Long-term Visual Dynamics with Region Proposal Interaction Networks | Aug 5, 2020 | Common Sense ReasoningObject | CodeCode Available | 1 |
| Object Detection with Transformers: A Review | Jun 7, 2023 | 2D Object DetectionObject | CodeCode Available | 1 |
| Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views | Nov 13, 2021 | ObjectScene Understanding | CodeCode Available | 1 |
| Learning Object-Language Alignments for Open-Vocabulary Object Detection | Nov 27, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Learning Object-level Point Augmentor for Semi-supervised 3D Object Detection | Dec 19, 2022 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| EAO-SLAM: Monocular Semi-Dense Object SLAM Based on Ensemble Data Association | Apr 27, 2020 | ObjectObject SLAM | CodeCode Available | 1 |
| DiffuBox: Refining 3D Object Detection with Point Diffusion | May 25, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Learning Open-World Object Proposals without Learning to Classify | Aug 15, 2021 | Objectobject-detection | CodeCode Available | 1 |
| Learning Physical Graph Representations from Visual Scenes | Jun 22, 2020 | ObjectObject Categorization | CodeCode Available | 1 |
| Learning RoI Transformer for Oriented Object Detection in Aerial Images | Jun 1, 2019 | Objectobject-detection | CodeCode Available | 1 |
| Learning Spatial-Frequency Transformer for Visual Object Tracking | Aug 18, 2022 | ObjectObject Tracking | CodeCode Available | 1 |
| Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark | Nov 25, 2021 | Matrix CompletionMoving Object Detection | CodeCode Available | 1 |
| Boosting 3D Object Detection via Object-Focused Image Fusion | Jul 21, 2022 | 3D Object DetectionObject | CodeCode Available | 1 |
| Detecting Camouflaged Object in Frequency Domain | Jan 1, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Learning to Detect Mobile Objects from LiDAR Scans Without Labels | Mar 29, 2022 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 1 |
| Detecting Every Object from Events | Apr 8, 2024 | Autonomous DrivingClass-agnostic Object Detection | CodeCode Available | 1 |
| Learning to Discover and Detect Objects | Oct 19, 2022 | Novel Class DiscoveryNovel Object Detection | CodeCode Available | 1 |
| Detecting Human-Object Contact in Images | Mar 6, 2023 | Object | CodeCode Available | 1 |
| Learning to Regrasp by Learning to Place | Sep 18, 2021 | DiversityObject | CodeCode Available | 1 |
| Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics | Feb 1, 2022 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| Detecting Human-Object Interaction via Fabricated Compositional Learning | Mar 15, 2021 | Affordance RecognitionHuman-Object Interaction Detection | CodeCode Available | 1 |
| Learning Video Salient Object Detection Progressively from Unlabeled Videos | Apr 5, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Learning What and Where: Disentangling Location and Identity Tracking Without Supervision | May 26, 2022 | ObjectVideo Object Tracking | CodeCode Available | 1 |
| Detecting Invisible People | Dec 15, 2020 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 1 |
| Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation | Jun 6, 2023 | ObjectVideo Generation | CodeCode Available | 1 |
| Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning | Oct 4, 2021 | HallucinationImage Captioning | CodeCode Available | 1 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing | Mar 30, 2025 | AttributeDisentanglement | CodeCode Available | 1 |
| EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering | Dec 19, 2023 | ObjectObject Counting | CodeCode Available | 1 |
| Edge-Aware Mirror Network for Camouflaged Object Detection | Jul 8, 2023 | Camouflaged Object SegmentationEdge Detection | CodeCode Available | 1 |
| Efficient Decoder-free Object Detection with Transformers | Jun 14, 2022 | DecoderObject | CodeCode Available | 1 |
| Boosting R-CNN: Reweighting R-CNN Samples by RPN's Error for Underwater Object Detection | Jun 28, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Light Field Salient Object Detection: A Review and Benchmark | Oct 10, 2020 | BenchmarkingObject | CodeCode Available | 1 |
| Linking vision and motion for self-supervised object-centric perception | Jul 14, 2023 | Autonomous DrivingObject | CodeCode Available | 1 |
| DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera Scenarios | Mar 25, 2025 | 3D Object DetectionObject | CodeCode Available | 1 |
| Boosting Segment Anything Model Towards Open-Vocabulary Learning | Dec 6, 2023 | modelObject | CodeCode Available | 1 |
| LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts | Oct 16, 2023 | Image GenerationLayout-to-Image Generation | CodeCode Available | 1 |
| Detecting the open-world objects with the help of the Brain | Mar 21, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection | Dec 5, 2023 | 3D Object DetectionDenoising | CodeCode Available | 1 |
| Detecting tiny objects in aerial images: A normalized Wasserstein distance and a new benchmark | Jun 28, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Detection and tracking of fingertips for geometric transformation of objects in virtual environment | Mar 16, 2020 | Fingertip DetectionHand Detection | CodeCode Available | 1 |
| Localization Distillation for Dense Object Detection | Feb 24, 2021 | Dense Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| D3D-HOI: Dynamic 3D Human-Object Interactions from Videos | Aug 19, 2021 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| Detection as Regression: Certified Object Detection by Median Smoothing | Jul 7, 2020 | Objectobject-detection | CodeCode Available | 1 |
| Localized Vision-Language Matching for Open-vocabulary Object Detection | May 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering | Jul 30, 2024 | Inverse RenderingNeRF | CodeCode Available | 1 |
| DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object Detection | Oct 30, 2023 | DenoisingGPU | CodeCode Available | 1 |
| Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer | Jul 15, 2020 | Objectobject-detection | CodeCode Available | 1 |
| Dynamic Relevance Learning for Few-Shot Object Detection | Aug 4, 2021 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 1 |