| MonoNext: A 3D Monocular Object Detection with ConvNext | Aug 1, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Diffusion Model for Camouflaged Object Detection | Aug 1, 2023 | Camouflaged Object SegmentationDenoising | CodeCode Available | 0 |
| A Modular Ontology for MODS -- Metadata Object Description Schema | Jul 31, 2023 | Knowledge GraphsObject | —Unverified | 0 |
| Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics | Jul 31, 2023 | ObjectSegmentation | —Unverified | 0 |
| Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks | Jul 31, 2023 | Image RetrievalObject | —Unverified | 0 |
| Detecting Out-of-distribution Objects Using Neuron Activation Patterns | Jul 31, 2023 | Autonomous VehiclesObject | CodeCode Available | 0 |
| Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving | Jul 30, 2023 | Autonomous DrivingMixture-of-Experts | —Unverified | 0 |
| Implementing Edge Based Object Detection For Microplastic Debris | Jul 30, 2023 | Objectobject-detection | —Unverified | 0 |
| Enhancing Object Detection in Ancient Documents with Synthetic Data Generation and Transformer-Based Models | Jul 29, 2023 | Objectobject-detection | —Unverified | 0 |
| RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control | Jul 28, 2023 | ObjectQuestion Answering | CodeCode Available | 2 |
| Uncertainty-aware Unsupervised Multi-Object Tracking | Jul 28, 2023 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Aligned Unsupervised Pretraining of Object Detectors with Self-training | Jul 28, 2023 | Few-Shot Object DetectionObject | —Unverified | 0 |
| MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking | Jul 28, 2023 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 |
| Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation | Jul 28, 2023 | 6D Pose Estimation6D Pose Estimation using RGB | CodeCode Available | 1 |
| TrackAgent: 6D Object Tracking via Reinforcement Learning | Jul 28, 2023 | ObjectObject Tracking | —Unverified | 0 |
| Generalized Open-World Semi-Supervised Object Detection | Jul 28, 2023 | Objectobject-detection | —Unverified | 0 |
| The detection and rectification for identity-switch based on unfalsified control | Jul 27, 2023 | Multi-Object TrackingObject | —Unverified | 0 |
| A Memory-Augmented Multi-Task Collaborative Framework for Unsupervised Traffic Accident Detection in Driving Videos | Jul 27, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| Tracking Anything in High Quality | Jul 26, 2023 | ObjectObject Tracking | CodeCode Available | 2 |
| YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems | Jul 26, 2023 | BenchmarkingCPU | CodeCode Available | 0 |
| Cos R-CNN for Online Few-shot Object Detection | Jul 25, 2023 | Few-Shot Object DetectionObject | —Unverified | 0 |
| Optical Flow boosts Unsupervised Localization and Segmentation | Jul 25, 2023 | Lifelong learningObject | CodeCode Available | 1 |
| Learning Transferable Object-Centric Diffeomorphic Transformations for Data Augmentation in Medical Image Segmentation | Jul 25, 2023 | Data AugmentationImage Segmentation | —Unverified | 0 |
| 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding | Jul 25, 2023 | 3D visual groundingObject | —Unverified | 0 |
| Spectrum-guided Multi-granularity Referring Video Object Segmentation | Jul 25, 2023 | ObjectReferring Expression Segmentation | CodeCode Available | 1 |
| RecursiveDet: End-to-End Region-based Recursive Object Detection | Jul 25, 2023 | DecoderObject | CodeCode Available | 1 |
| Described Object Detection: Liberating Object Detection with Flexible Expressions | Jul 24, 2023 | Binary ClassificationDescribed Object Detection | CodeCode Available | 1 |
| COCO-O: A Benchmark for Object Detectors under Natural Distribution Shifts | Jul 24, 2023 | Autonomous DrivingObject | CodeCode Available | 2 |
| TransNet: Transparent Object Manipulation Through Category-Level Pose Estimation | Jul 23, 2023 | Depth CompletionObject | —Unverified | 0 |
| Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation | Jul 23, 2023 | Instance SegmentationObject | CodeCode Available | 1 |
| LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction | Jul 23, 2023 | 3D ReconstructionObject | CodeCode Available | 0 |
| Towards Generic and Controllable Attacks Against Object Detection | Jul 23, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Challenges for Monocular 6D Object Pose Estimation in Robotics | Jul 22, 2023 | 6D Pose Estimation using RGBObject | —Unverified | 0 |
| Spatial Self-Distillation for Object Detection with Inaccurate Bounding Boxes | Jul 22, 2023 | Multiple Instance LearningObject | —Unverified | 0 |
| Leveraging Knowledge Graphs for Zero-Shot Object-agnostic State Classification | Jul 22, 2023 | AttributeClassification | —Unverified | 0 |
| KVN: Keypoints Voting Network with Differentiable RANSAC for Stereo Pose Estimation | Jul 21, 2023 | ObjectPose Estimation | CodeCode Available | 1 |
| YOLOPose V2: Understanding and Improving Transformer-based 6D Pose Estimation | Jul 21, 2023 | 6D Pose Estimation6D Pose Estimation using RGB | —Unverified | 0 |
| R2Det: Redemption from Range-view for Accurate 3D Object Detection | Jul 21, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Deep Directly-Trained Spiking Neural Networks for Object Detection | Jul 21, 2023 | Objectobject-detection | CodeCode Available | 1 |
| A novel integrated method of detection-grasping for specific object based on the box coordinate matching | Jul 20, 2023 | Instance SegmentationObject | —Unverified | 0 |
| SCA-PVNet: Self-and-Cross Attention Based Aggregation of Point Cloud and Multi-View for 3D Object Retrieval | Jul 20, 2023 | 3D Object RetrievalObject | —Unverified | 0 |
| CNOS: A Strong Baseline for CAD-based Novel Object Segmentation | Jul 20, 2023 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Improving Online Lane Graph Extraction by Object-Lane Clustering | Jul 20, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Cascade-DETR: Delving into High-Quality Universal Object Detection | Jul 20, 2023 | DecoderObject | CodeCode Available | 1 |
| PE-YOLO: Pyramid Enhancement Network for Dark Object Detection | Jul 20, 2023 | Objectobject-detection | CodeCode Available | 1 |
| OBJECT 3DIT: Language-guided 3D-aware Image Editing | Jul 20, 2023 | 3D geometryObject | CodeCode Available | 1 |
| Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection | Jul 19, 2023 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Online Continual Learning for Robust Indoor Object Recognition | Jul 19, 2023 | Continual LearningObject | —Unverified | 0 |
| Divert More Attention to Vision-Language Object Tracking | Jul 19, 2023 | AttributeObject | —Unverified | 0 |
| Generative Prompt Model for Weakly Supervised Object Localization | Jul 19, 2023 | DenoisingImage Denoising | CodeCode Available | 1 |