| Multiple Planar Object Tracking | Jan 1, 2023 | ObjectObject Tracking | —Unverified | 0 |
| Unsupervised Prompt Tuning for Text-Driven Object Detection | Jan 1, 2023 | Data AugmentationObject | —Unverified | 0 |
| Novel Scenes & Classes: Towards Adaptive Open-set Object Detection | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Bidirectional Alignment for Domain Adaptive Detection with Transformers | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Weakly Supervised Referring Image Segmentation with Intra-Chunk and Inter-Chunk Consistency | Jan 1, 2023 | Image SegmentationImage-text matching | —Unverified | 0 |
| Learning Neural Implicit Surfaces with Object-Aware Radiance Fields | Jan 1, 2023 | 3D Object ReconstructionObject | —Unverified | 0 |
| Learning Image Harmonization in the Linear Color Space | Jan 1, 2023 | Image HarmonizationObject | —Unverified | 0 |
| s-Adaptive Decoupled Prototype for Few-Shot Object Detection | Jan 1, 2023 | Few-Shot Object DetectionMeta-Learning | —Unverified | 0 |
| Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval | Jan 1, 2023 | DiversityObject | CodeCode Available | 1 |
| Segment Every Reference Object in Spatial and Temporal Spaces | Jan 1, 2023 | Image SegmentationObject | —Unverified | 0 |
| Foreground-Background Distribution Modeling Transformer for Visual Object Tracking | Jan 1, 2023 | ObjectObject Tracking | —Unverified | 0 |
| FocalFormer3D: Focusing on Hard Instance for 3D Object Detection | Jan 1, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| ObjectFusion: Multi-modal 3D Object Detection with Object-Centric Fusion | Jan 1, 2023 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Confidence-aware Pseudo-label Learning for Weakly Supervised Visual Grounding | Jan 1, 2023 | DescriptiveObject | CodeCode Available | 1 |
| Semantic Information in Contrastive Learning | Jan 1, 2023 | Contrastive LearningDepth Estimation | CodeCode Available | 0 |
| Reconciling Object-Level and Global-Level Objectives for Long-Tail Detection | Jan 1, 2023 | Multi-Task LearningObject | CodeCode Available | 0 |
| CHORUS : Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images | Jan 1, 2023 | Common Sense ReasoningDiversity | —Unverified | 0 |
| Category-aware Allocation Transformer for Weakly Supervised Object Localization | Jan 1, 2023 | ObjectObject Localization | —Unverified | 0 |
| Self-Supervised Object Detection from Egocentric Videos | Jan 1, 2023 | Class-agnostic Object DetectionObject | —Unverified | 0 |
| Deep Active Contours for Real-time 6-DoF Object Tracking | Jan 1, 2023 | Computational EfficiencyObject | —Unverified | 0 |
| ObjectStitch: Object Compositing With Diffusion Model | Jan 1, 2023 | Data Augmentationmodel | —Unverified | 0 |
| Learning To Segment Every Referring Object Point by Point | Jan 1, 2023 | ObjectReferring Expression | CodeCode Available | 0 |
| Leverage Interactive Affinity for Affordance Learning | Jan 1, 2023 | Human-Object Interaction DetectionObject | CodeCode Available | 0 |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Jan 1, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers | Jan 1, 2023 | 3D Object DetectionObject | CodeCode Available | 0 |
| MetaFusion: Infrared and Visible Image Fusion via Meta-Feature Embedding From Object Detection | Jan 1, 2023 | Infrared And Visible Image FusionMeta-Learning | CodeCode Available | 1 |
| Command-Driven Articulated Object Understanding and Manipulation | Jan 1, 2023 | motion predictionObject | —Unverified | 0 |
| MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence | Jan 1, 2023 | ObjectObject Recognition | CodeCode Available | 0 |
| Weak-Shot Object Detection Through Mutual Knowledge Transfer | Jan 1, 2023 | Multiple Instance LearningObject | —Unverified | 0 |
| Transformer-Based Unified Recognition of Two Hands Manipulating Objects | Jan 1, 2023 | Action RecognitionObject | CodeCode Available | 1 |
| SMOC-Net: Leveraging Camera Pose for Self-Supervised Monocular Object Pose Estimation | Jan 1, 2023 | 6D Pose Estimation using RGBKnowledge Distillation | —Unverified | 0 |
| Semi-Supervised Stereo-Based 3D Object Detection via Cross-View Consensus | Jan 1, 2023 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| What You Can Reconstruct From a Shadow | Jan 1, 2023 | 3D ReconstructionObject | —Unverified | 0 |
| ORCa: Glossy Objects As Radiance-Field Cameras | Jan 1, 2023 | Novel View SynthesisObject | —Unverified | 0 |
| Discriminating Known From Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoder | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 0 |
| LSTFE-Net:Long Short-Term Feature Enhancement Network for Video Small Object Detection | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Generalized UAV Object Detection via Frequency Domain Disentanglement | Jan 1, 2023 | DisentanglementObject | —Unverified | 0 |
| Autonomous Manipulation Learning for Similar Deformable Objects via Only One Demonstration | Jan 1, 2023 | Deformable Object ManipulationObject | —Unverified | 0 |
| MOVES: Manipulated Objects in Video Enable Segmentation | Jan 1, 2023 | ObjectOptical Flow Estimation | —Unverified | 0 |
| AttentionShift: Iteratively Estimated Part-Based Attention Map for Pointly Supervised Instance Segmentation | Jan 1, 2023 | Instance SegmentationObject | —Unverified | 0 |
| Gaussian Label Distribution Learning for Spherical Image Object Detection | Jan 1, 2023 | Objectobject-detection | —Unverified | 0 |
| L-CoIns: Language-Based Colorization With Instance Awareness | Jan 1, 2023 | ColorizationDescriptive | —Unverified | 0 |
| Few-Shot Referring Relationships in Videos | Jan 1, 2023 | ObjectRelation Network | CodeCode Available | 0 |
| Harmonious Teacher for Cross-Domain Object Detection | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Feature Aggregated Queries for Transformer-Based Video Object Detectors | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding | Jan 1, 2023 | ObjectSpatio-Temporal Video Grounding | —Unverified | 0 |
| Evolved Part Masking for Self-Supervised Learning | Jan 1, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Learnable Skeleton-Aware 3D Point Cloud Sampling | Jan 1, 2023 | ObjectPoint Cloud Classification | —Unverified | 0 |
| Toward RAW Object Detection: A New Benchmark and a New Model | Jan 1, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| RealFusion: 360deg Reconstruction of Any Object From a Single Image | Jan 1, 2023 | 3D ReconstructionObject | —Unverified | 0 |