| Panoramic Video Salient Object Detection with Ambisonic Audio Guidance | Nov 26, 2022 | Objectobject-detection | —Unverified | 0 |
| Language-Assisted 3D Feature Learning for Semantic Scene Understanding | Nov 25, 2022 | DescriptiveInstance Segmentation | CodeCode Available | 1 |
| PoET: Pose Estimation Transformer for Single-View, Multi-Object 6D Pose Estimation | Nov 25, 2022 | 6D Pose Estimation6D Pose Estimation using RGB | CodeCode Available | 1 |
| Interaction Region Visual Transformer for Egocentric Action Anticipation | Nov 25, 2022 | Action AnticipationHuman-Object Interaction Detection | CodeCode Available | 0 |
| Temporal Super-Resolution using Multi-Channel Illumination Source | Nov 25, 2022 | Motion EstimationObject | —Unverified | 0 |
| Combating noisy labels in object detection datasets | Nov 25, 2022 | Objectobject-detection | CodeCode Available | 0 |
| Physics-Based Object 6D-Pose Estimation during Non-Prehensile Manipulation | Nov 24, 2022 | 6D Pose EstimationObject | —Unverified | 0 |
| Multi-Task Learning of Object State Changes from Uncurated Videos | Nov 24, 2022 | Multi-Task LearningObject | CodeCode Available | 1 |
| Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark | Nov 24, 2022 | 2D Object DetectionImage Retrieval | CodeCode Available | 2 |
| One-Shot General Object Localization | Nov 24, 2022 | ObjectObject Localization | CodeCode Available | 0 |
| Object Detection in Foggy Scenes by Embedding Depth and Reconstruction into Domain Adaptation | Nov 24, 2022 | DecoderDomain Adaptation | CodeCode Available | 1 |
| On Designing Light-Weight Object Trackers through Network Pruning: Use CNNs or Transformers? | Nov 24, 2022 | Network PruningObject | CodeCode Available | 0 |
| Few-shot Object Detection with Refined Contrastive Learning | Nov 24, 2022 | Contrastive LearningFew-Shot Object Detection | —Unverified | 0 |
| UV-Based 3D Hand-Object Reconstruction with Grasp Optimization | Nov 24, 2022 | 3D Hand Pose EstimationObject | —Unverified | 0 |
| 1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results | Nov 24, 2022 | Objectobject-detection | —Unverified | 0 |
| Video Instance Shadow Detection Under the Sun and Sky | Nov 23, 2022 | Contrastive LearningInstance Shadow Detection | CodeCode Available | 1 |
| Learning to Imitate Object Interactions from Internet Videos | Nov 23, 2022 | Object | —Unverified | 0 |
| Open-vocabulary Attribute Detection | Nov 23, 2022 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Structural Knowledge Distillation for Object Detection | Nov 23, 2022 | Feature ImportanceKnowledge Distillation | —Unverified | 0 |
| Reason from Context with Self-supervised Learning | Nov 23, 2022 | ObjectObject Recognition | —Unverified | 0 |
| Autonomous Marker-less Rapid Aerial Grasping | Nov 23, 2022 | ObjectObject Localization | —Unverified | 0 |
| UpCycling: Semi-supervised 3D Object Detection without Sharing Raw-level Unlabeled Scenes | Nov 22, 2022 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Boundary-aware Camouflaged Object Detection via Deformable Point Sampling | Nov 22, 2022 | Objectobject-detection | —Unverified | 0 |
| Transformation-Equivariant 3D Object Detection for Autonomous Driving | Nov 22, 2022 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| ONeRF: Unsupervised 3D Object Segmentation from Multiple Views | Nov 22, 2022 | 3D scene EditingObject | —Unverified | 0 |
| Dual Prototype Attention for Unsupervised Video Object Segmentation | Nov 22, 2022 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| β-Multivariational Autoencoder for Entangled Representation Learning in Video Frames | Nov 22, 2022 | Decision MakingObject | CodeCode Available | 0 |
| AeDet: Azimuth-invariant Multi-view 3D Object Detection | Nov 22, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 1 |
| OCTET: Object-aware Counterfactual Explanations | Nov 22, 2022 | Autonomous Drivingcounterfactual | CodeCode Available | 1 |
| Explaining YOLO: Leveraging Grad-CAM to Explain Object Detections | Nov 22, 2022 | Object | —Unverified | 0 |
| Improving Crowded Object Detection via Copy-Paste | Nov 22, 2022 | Data AugmentationObject | —Unverified | 0 |
| Open-Set Object Detection Using Classification-free Object Proposal and Instance-level Contrastive Learning | Nov 21, 2022 | Contrastive LearningObject | —Unverified | 0 |
| Object-level 3D Semantic Mapping using a Network of Smart Edge Sensors | Nov 21, 2022 | ObjectPose Estimation | —Unverified | 0 |
| Compositional Scene Modeling with Global Object-Centric Representations | Nov 21, 2022 | ObjectPatch Matching | —Unverified | 0 |
| LISA: Localized Image Stylization with Audio via Implicit Neural Representation | Nov 21, 2022 | Image StylizationObject | —Unverified | 0 |
| Plug and Play Active Learning for Object Detection | Nov 21, 2022 | Active LearningDiversity | CodeCode Available | 1 |
| Learning Implicit Probability Distribution Functions for Symmetric Orientation Estimation from RGB Images Without Pose Labels | Nov 21, 2022 | ObjectPoint Cloud Registration | —Unverified | 0 |
| Visual Dexterity: In-Hand Reorientation of Novel and Complex Object Shapes | Nov 21, 2022 | Object | CodeCode Available | 1 |
| Mean Shift Mask Transformer for Unseen Object Instance Segmentation | Nov 21, 2022 | ClusteringImage Segmentation | CodeCode Available | 1 |
| Simultaneous Multiple Object Detection and Pose Estimation using 3D Model Infusion with Monocular Vision | Nov 21, 2022 | Autonomous DrivingObject | CodeCode Available | 1 |
| Unifying Tracking and Image-Video Object Detection | Nov 20, 2022 | Multi-Object TrackingObject | —Unverified | 0 |
| Context-Aware Data Augmentation for LIDAR 3D Object Detection | Nov 20, 2022 | 3D Object DetectionData Augmentation | —Unverified | 0 |
| Efficient Representations of Object Geometry for Reinforcement Learning of Interactive Grasping Policies | Nov 20, 2022 | Objectreinforcement-learning | —Unverified | 0 |
| Distinctive Self-Similar Object Detection | Nov 20, 2022 | Objectobject-detection | —Unverified | 0 |
| ProCC: Progressive Cross-primitive Compatibility for Open-World Compositional Zero-Shot Learning | Nov 19, 2022 | Compositional Zero-Shot LearningObject | —Unverified | 0 |
| Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning | Nov 19, 2022 | Compositional Zero-Shot LearningNovel Concepts | CodeCode Available | 1 |
| An Enhanced Object Detection Model for Scene Graph Generation | Nov 18, 2022 | Graph GenerationImage Captioning | —Unverified | 0 |
| Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization | Nov 18, 2022 | Object | CodeCode Available | 1 |
| Detect Only What You Specify : Object Detection with Linguistic Target | Nov 18, 2022 | DecoderObject | —Unverified | 0 |
| A mixed-reality dataset for category-level 6D pose and size estimation of hand-occluded containers | Nov 18, 2022 | 6D Pose Estimation using RGBMixed Reality | —Unverified | 0 |