| AFPN: Asymptotic Feature Pyramid Network for Object Detection | Jun 28, 2023 | Objectobject-detection | CodeCode Available | 1 |
| High-Quality Unknown Object Instance Segmentation via Quadruple Boundary Error Refinement | Jun 28, 2023 | Instance SegmentationObject | CodeCode Available | 1 |
| C^2Former: Calibrated and Complementary Transformer for RGB-Infrared Object Detection | Jun 28, 2023 | 2D Object DetectionMultispectral Object Detection | CodeCode Available | 1 |
| CST-YOLO: A Novel Method for Blood Cell Detection Based on Improved YOLOv7 and CNN-Swin Transformer | Jun 26, 2023 | 2D Object DetectionBlood Cell Detection | CodeCode Available | 1 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Shape-Constraint Recurrent Flow for 6D Object Pose Estimation | Jun 23, 2023 | 6D Pose Estimation using RGBObject | CodeCode Available | 1 |
| Iterative Scale-Up ExpansionIoU and Deep Features Association for Multi-Object Tracking in Sports | Jun 22, 2023 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 1 |
| CrossKD: Cross-Head Knowledge Distillation for Object Detection | Jun 20, 2023 | Dense Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Multi-view 3D Object Reconstruction and Uncertainty Modelling with Neural Shape Prior | Jun 17, 2023 | 3D Object ReconstructionObject | CodeCode Available | 1 |
| CAD-Estate: Large-scale CAD Model Annotation in RGB Videos | Jun 15, 2023 | 3D Object ReconstructionObject | CodeCode Available | 1 |
| OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments | Jun 14, 2023 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Object Detection in Hyperspectral Image via Unified Spectral-Spatial Feature Aggregation | Jun 14, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Multiclass Confidence and Localization Calibration for Object Detection | Jun 14, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Predict to Detect: Prediction-guided 3D Object Detection using Sequential Images | Jun 14, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Referring Camouflaged Object Detection | Jun 13, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Revisiting Token Pruning for Object Detection and Instance Segmentation | Jun 12, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Compositor: Bottom-up Clustering and Compositing for Robust Part and Object Segmentation | Jun 12, 2023 | ClusteringObject | CodeCode Available | 1 |
| On the Efficacy of 3D Point Cloud Reinforcement Learning | Jun 11, 2023 | 3D Point Cloud Reinforcement LearningInductive Bias | CodeCode Available | 1 |
| EventCLIP: Adapting CLIP for Event-based Object Recognition | Jun 10, 2023 | Few-Shot LearningObject | CodeCode Available | 1 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 |
| TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses | Jun 9, 2023 | 3D Multi-Object Tracking3D Object Tracking | CodeCode Available | 1 |
| Multi-Modal Classifiers for Open-Vocabulary Object Detection | Jun 8, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities | Jun 7, 2023 | ObjectObject Discovery | CodeCode Available | 1 |
| Object Detection with Transformers: A Review | Jun 7, 2023 | 2D Object DetectionObject | CodeCode Available | 1 |
| Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection | Jun 6, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Human-Object Interaction Prediction in Videos through Gaze Following | Jun 6, 2023 | Human-Object Interaction AnticipationHuman-Object Interaction Detection | CodeCode Available | 1 |
| Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation | Jun 6, 2023 | ObjectVideo Generation | CodeCode Available | 1 |
| MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences | Jun 5, 2023 | 3D Object DetectionMotion Forecasting | CodeCode Available | 1 |
| Towards Better Explanations for Object Detection | Jun 5, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Reassembling Broken Objects using Breaking Curves | Jun 5, 2023 | Object | CodeCode Available | 1 |
| Cross-Drone Transformer Network for Robust Single Object Tracking | Jun 5, 2023 | ObjectObject Tracking | CodeCode Available | 1 |
| Detector Guidance for Multi-Object Text-to-Image Generation | Jun 4, 2023 | Image GenerationObject | CodeCode Available | 1 |
| Open-world Text-specified Object Counting | Jun 2, 2023 | DecoderObject | CodeCode Available | 1 |
| AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation | Jun 1, 2023 | Binary ClassificationInteractive Segmentation | CodeCode Available | 1 |
| Object pop-up: Can we infer 3D objects and their poses from human interactions alone? | Jun 1, 2023 | Object | CodeCode Available | 1 |
| Inferring and Leveraging Parts from Object Shape for Improving Semantic Image Synthesis | May 31, 2023 | Image GenerationObject | CodeCode Available | 1 |
| Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast | May 31, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 1 |
| CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models | May 29, 2023 | DenoisingObject | CodeCode Available | 1 |
| PaLI-X: On Scaling up a Multilingual Vision and Language Model | May 29, 2023 | Chart Question Answeringdocument understanding | CodeCode Available | 1 |
| Lighting and Rotation Invariant Real-time Vehicle Wheel Detector based on YOLOv5 | May 28, 2023 | Objectobject-detection | CodeCode Available | 1 |
| SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation | May 26, 2023 | cross-modal alignmentObject | CodeCode Available | 1 |
| Learning Occupancy for Monocular 3D Object Detection | May 25, 2023 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 1 |
| Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3) | May 25, 2023 | 6D Pose Estimation using RGBComputational Efficiency | CodeCode Available | 1 |
| CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion | May 25, 2023 | DiversityObject | CodeCode Available | 1 |
| Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation | May 25, 2023 | ObjectReferring Expression Segmentation | CodeCode Available | 1 |
| POPE: 6-DoF Promptable Pose Estimation of Any Object, in Any Scene, with One Reference | May 25, 2023 | 3D geometryObject | CodeCode Available | 1 |
| NAP: Neural 3D Articulation Prior | May 25, 2023 | 3D GenerationDenoising | CodeCode Available | 1 |
| Learning high-level visual representations from a child's perspective without strong inductive biases | May 24, 2023 | ObjectObject Localization | CodeCode Available | 1 |
| DC-Net: Divide-and-Conquer for Salient Object Detection | May 24, 2023 | DecoderObject | CodeCode Available | 1 |
| Text encoders bottleneck compositionality in contrastive vision-language models | May 24, 2023 | AttributeImage Captioning | CodeCode Available | 1 |