| Featurized Query R-CNN | Jun 13, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Discovering Object Masks with Transformers for Unsupervised Semantic Segmentation | Jun 13, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection | Jun 13, 2022 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| Rethinking Spatial Invariance of Convolutional Networks for Object Counting | Jun 10, 2022 | Crowd CountingObject | CodeCode Available | 1 |
| VITA: Video Instance Segmentation via Object Token Association | Jun 9, 2022 | GPUInstance Segmentation | CodeCode Available | 1 |
| Patch-based Object-centric Transformers for Efficient Video Generation | Jun 8, 2022 | ObjectVideo Editing | CodeCode Available | 1 |
| GenSDF: Two-Stage Learning of Generalizable Signed Distance Functions | Jun 6, 2022 | Meta-LearningObject | CodeCode Available | 1 |
| Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in Videos | Jun 5, 2022 | ObjectObject Tracking | CodeCode Available | 1 |
| Modeling Image Composition for Complex Scene Generation | Jun 2, 2022 | Image GenerationLayout-to-Image Generation | CodeCode Available | 1 |
| Label-Efficient Online Continual Object Detection in Streaming Video | Jun 1, 2022 | Continual LearningHippocampus | CodeCode Available | 1 |
| Differentiable Soft-Masked Attention | Jun 1, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| Voxel Field Fusion for 3D Object Detection | May 31, 2022 | 3D Object DetectionData Augmentation | CodeCode Available | 1 |
| Towards Efficient 3D Object Detection with Knowledge Distillation | May 30, 2022 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Fast Object Placement Assessment | May 28, 2022 | Object | CodeCode Available | 1 |
| Simple Unsupervised Object-Centric Learning for Complex and Naturalistic Videos | May 27, 2022 | DecoderObject | CodeCode Available | 1 |
| CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping | May 27, 2022 | ClusteringObject | CodeCode Available | 1 |
| Unsupervised Multi-object Segmentation Using Attention and Soft-argmax | May 26, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Phantom Sponges: Exploiting Non-Maximum Suppression to Attack Deep Object Detectors | May 26, 2022 | Autonomous DrivingObject | CodeCode Available | 1 |
| Learning What and Where: Disentangling Location and Identity Tracking Without Supervision | May 26, 2022 | ObjectVideo Object Tracking | CodeCode Available | 1 |
| AO2-DETR: Arbitrary-Oriented Object Detection Transformer | May 25, 2022 | DecoderInductive Bias | CodeCode Available | 1 |
| Deep Gradient Learning for Efficient Camouflaged Object Detection | May 25, 2022 | Defect DetectionObject | CodeCode Available | 1 |
| TraCon: A novel dataset for real-time traffic cones detection using deep learning | May 24, 2022 | Objectobject-detection | CodeCode Available | 1 |
| PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models | May 23, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Deeper Understanding of Camouflaged Object Detection | May 23, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection | May 19, 2022 | DecoderFew-Shot Object Detection | CodeCode Available | 1 |
| Disentangling Visual Embeddings for Attributes and Objects | May 17, 2022 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| Localized Vision-Language Matching for Open-vocabulary Object Detection | May 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Topologically-Aware Deformation Fields for Single-View 3D Reconstruction | May 12, 2022 | 3D ReconstructionObject | CodeCode Available | 1 |
| Identifying concept libraries from language about object structure | May 11, 2022 | 2kMachine Translation | CodeCode Available | 1 |
| Learning Non-target Knowledge for Few-shot Semantic Segmentation | May 10, 2022 | Contrastive LearningFew-Shot Semantic Segmentation | CodeCode Available | 1 |
| Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning | May 9, 2022 | Image CaptioningObject | CodeCode Available | 1 |
| Transformer Tracking with Cyclic Shifting Window Attention | May 8, 2022 | ObjectObject Tracking | CodeCode Available | 1 |
| Recurrent Dynamic Embedding for Video Object Segmentation | May 8, 2022 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| GenISP: Neural ISP for Low-Light Machine Cognition | May 7, 2022 | BenchmarkingImage Restoration | CodeCode Available | 1 |
| Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism | May 6, 2022 | Continual LearningDiversity | CodeCode Available | 1 |
| Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection | May 6, 2022 | 2D Object DetectionObject | CodeCode Available | 1 |
| MTTrans: Cross-Domain Object Detection with Mean-Teacher Transformer | May 3, 2022 | Domain AdaptationObject | CodeCode Available | 1 |
| Cross Domain Object Detection by Target-Perceived Dual Branch Distillation | May 3, 2022 | Objectobject-detection | CodeCode Available | 1 |
| VICE: Variational Interpretable Concept Embeddings | May 2, 2022 | Experimental DesignObject | CodeCode Available | 1 |
| 3D Object Detection with a Self-supervised Lidar Scene Flow Backbone | May 2, 2022 | 3D Object DetectionObject | CodeCode Available | 1 |
| Self-Supervised Learning of Object Parts for Semantic Segmentation | Apr 27, 2022 | Community DetectionImage Segmentation | CodeCode Available | 1 |
| Coupled Iterative Refinement for 6D Multi-Object Pose Estimation | Apr 26, 2022 | 6D Pose Estimation using RGBObject | CodeCode Available | 1 |
| RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning | Apr 24, 2022 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds | Apr 22, 2022 | 3D dense captioning3D Object Detection | CodeCode Available | 1 |
| Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency | Apr 21, 2022 | 3D Object Reconstruction3D Object Reconstruction From A Single Image | CodeCode Available | 1 |
| Modeling Missing Annotations for Incremental Learning in Object Detection | Apr 19, 2022 | Incremental LearningInstance Segmentation | CodeCode Available | 1 |
| Dense Learning based Semi-Supervised Object Detection | Apr 15, 2022 | Objectobject-detection | CodeCode Available | 1 |
| Towards PAC Multi-Object Detection and Tracking | Apr 15, 2022 | Autonomous NavigationConformal Prediction | CodeCode Available | 1 |
| Interactive Object Segmentation in 3D Point Clouds | Apr 14, 2022 | 3D Instance SegmentationImage Segmentation | CodeCode Available | 1 |
| BEHAVE: Dataset and Method for Tracking Human Object Interactions | Apr 14, 2022 | 3D Human Reconstruction3D Object Reconstruction | CodeCode Available | 1 |