| Can an Embodied Agent Find Your "Cat-shaped Mug"? LLM-Guided Exploration for Zero-Shot Object Navigation | Mar 6, 2023 | Motion PlanningObject | CodeCode Available | 1 |
| Zero-shot Object Counting | Mar 3, 2023 | ObjectObject Counting | CodeCode Available | 1 |
| BSH-Det3D: Improving 3D Object Detection with BEV Shape Heatmap | Mar 3, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices | Mar 3, 2023 | 3D geometry3D Object Reconstruction | CodeCode Available | 1 |
| LANDMARK: Language-guided Representation Enhancement Framework for Scene Graph Generation | Mar 2, 2023 | Graph GenerationObject | CodeCode Available | 1 |
| Self-Supervised Category-Level Articulated Object Pose Estimation with Part-Level SE(3) Equivariance | Feb 28, 2023 | DisentanglementObject | CodeCode Available | 1 |
| DFR-FastMOT: Detection Failure Resistant Tracker for Fast Multi-Object Tracking Based on Sensor Fusion | Feb 28, 2023 | Autonomous VehiclesMulti-Object Tracking | CodeCode Available | 1 |
| Aligning Bag of Regions for Open-Vocabulary Object Detection | Feb 27, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Causality Compensated Attention for Contextual Biased Visual Recognition | Feb 25, 2023 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | CodeCode Available | 1 |
| Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions | Feb 23, 2023 | ObjectPrediction | CodeCode Available | 1 |
| Cross-domain Compositing with Pretrained Diffusion Models | Feb 20, 2023 | Data AugmentationObject | CodeCode Available | 1 |
| Accelerated Video Annotation driven by Deep Detector and Tracker | Feb 19, 2023 | Object | CodeCode Available | 1 |
| DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes | Feb 15, 2023 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking Applications | Feb 12, 2023 | 3D Object TrackingObject | CodeCode Available | 1 |
| Dual Relation Knowledge Distillation for Object Detection | Feb 11, 2023 | Knowledge DistillationModel Compression | CodeCode Available | 1 |
| Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames | Feb 9, 2023 | ObjectObject Discovery | CodeCode Available | 1 |
| Look Around and Learn: Self-Training Object Detection by Exploration | Feb 7, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Self-Supervised Unseen Object Instance Segmentation via Long-Term Robot Interaction | Feb 7, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| Normalizing Flow based Feature Synthesis for Outlier-Aware Object Detection | Feb 1, 2023 | Autonomous DrivingObject | CodeCode Available | 1 |
| Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection | Feb 1, 2023 | ObjectRelation | CodeCode Available | 1 |
| Few-Shot Object Detection via Variational Feature Aggregation | Jan 31, 2023 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 1 |
| Unlocking Slot Attention by Changing Optimal Transport Costs | Jan 30, 2023 | Object | CodeCode Available | 1 |
| On the Adversarial Robustness of Camera-based 3D Object Detection | Jan 25, 2023 | 3D Object DetectionAdversarial Attack | CodeCode Available | 1 |
| Planar Object Tracking via Weighted Optical Flow | Jan 24, 2023 | ObjectObject Tracking | CodeCode Available | 1 |
| OvarNet: Towards Open-vocabulary Object Attribute Recognition | Jan 23, 2023 | AttributeKnowledge Distillation | CodeCode Available | 1 |
| Long-tail Detection with Effective Class-Margins | Jan 23, 2023 | Binary ClassificationInstance Segmentation | CodeCode Available | 1 |
| Recurrent Generic Contour-based Instance Segmentation with Progressive Learning | Jan 21, 2023 | Instance SegmentationLane Detection | CodeCode Available | 1 |
| Towards Spatial Equilibrium Object Detection | Jan 14, 2023 | Objectobject-detection | CodeCode Available | 1 |
| CLIP the Gap: A Single Domain Generalization Approach for Object Detection | Jan 13, 2023 | Domain Generalizationimage-classification | CodeCode Available | 1 |
| Open-vocabulary Object Segmentation with Diffusion Models | Jan 12, 2023 | Image SegmentationObject | CodeCode Available | 1 |
| SHUNIT: Style Harmonization for Unpaired Image-to-Image Translation | Jan 11, 2023 | Image-to-Image TranslationObject | CodeCode Available | 1 |
| Rethinking Voxelization and Classification for 3D Object Detection | Jan 10, 2023 | 3D Object DetectionClassification | CodeCode Available | 1 |
| HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection | Jan 8, 2023 | global-optimizationObject | CodeCode Available | 1 |
| FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection | Jan 8, 2023 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| Object as Query: Lifting any 2D Object Detector to 3D Detection | Jan 6, 2023 | 3D Object DetectionObject | CodeCode Available | 1 |
| End-to-End 3D Dense Captioning with Vote2Cap-DETR | Jan 6, 2023 | 3D dense captioningDecoder | CodeCode Available | 1 |
| TempSAL -- Uncovering Temporal Information for Deep Saliency Prediction | Jan 5, 2023 | ObjectObject Recognition | CodeCode Available | 1 |
| Correlation Loss: Enforcing Correlation between Classification and Localization | Jan 3, 2023 | ClassificationInductive Bias | CodeCode Available | 1 |
| LSTFE-Net:Long Short-Term Feature Enhancement Network for Video Small Object Detection | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Feature Aggregated Queries for Transformer-Based Video Object Detectors | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation | Jan 1, 2023 | ObjectScene Understanding | CodeCode Available | 1 |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Jan 1, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| Object Detection With Self-Supervised Scene Adaptation | Jan 1, 2023 | Data AugmentationObject | CodeCode Available | 1 |
| A Fast Unified System for 3D Object Detection and Tracking | Jan 1, 2023 | 3D Object DetectionMulti-Object Tracking | CodeCode Available | 1 |
| Novel Scenes & Classes: Towards Adaptive Open-set Object Detection | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| TempSAL - Uncovering Temporal Information for Deep Saliency Prediction | Jan 1, 2023 | ObjectObject Recognition | CodeCode Available | 1 |
| Video Object Segmentation-aware Video Frame Interpolation | Jan 1, 2023 | ObjectPose Estimation | CodeCode Available | 1 |
| Harmonious Teacher for Cross-Domain Object Detection | Jan 1, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Foreground-Background Separation through Concept Distillation from Generative Image Foundation Models | Jan 1, 2023 | Conditional Image GenerationImage Generation | CodeCode Available | 1 |
| PartDistillation: Learning Parts From Instance Segmentation | Jan 1, 2023 | Instance SegmentationObject | CodeCode Available | 1 |