| GRIP: Generating Interaction Poses Using Spatial Cues and Latent Consistency | Aug 22, 2023 | Mixed RealityObject | —Unverified | 0 |
| Affordance segmentation of hand-occluded containers from exocentric images | Aug 22, 2023 | Mixed RealityObject | CodeCode Available | 0 |
| Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views | Aug 22, 2023 | NeRFNeural Rendering | —Unverified | 0 |
| LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training | Aug 22, 2023 | ObjectObject Discovery | CodeCode Available | 0 |
| Enhancing Interpretable Object Abstraction via Clustering-based Slot Initialization | Aug 22, 2023 | ClusteringNovel View Synthesis | —Unverified | 0 |
| Video OWL-ViT: Temporally-consistent open-world localization in video | Aug 22, 2023 | DecoderObject | —Unverified | 0 |
| Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection | Aug 22, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Delving into Motion-Aware Matching for Monocular 3D Object Tracking | Aug 22, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 1 |
| Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding | Aug 22, 2023 | Contrastive LearningObject | —Unverified | 0 |
| Multi-Modal Dataset Acquisition for Photometrically Challenging Object | Aug 21, 2023 | Object | —Unverified | 0 |
| Spatial Transform Decoupling for Oriented Object Detection | Aug 21, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Representation Disparity-aware Distillation for 3D Object Detection | Aug 20, 2023 | 3D Object DetectionKnowledge Distillation | —Unverified | 0 |
| ThermRad: A Multi-modal Dataset for Robust 3D Object Detection under Challenging Conditions | Aug 20, 2023 | 3D Object DetectionObject | —Unverified | 0 |
| HODN: Disentangling Human-Object Feature for HOI Detection | Aug 20, 2023 | DecoderHuman Detection | —Unverified | 0 |
| Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos | Aug 19, 2023 | ObjectObject Discovery | CodeCode Available | 0 |
| Scalable Video Object Segmentation with Simplified Framework | Aug 19, 2023 | ObjectSemantic Segmentation | —Unverified | 0 |
| VI-Net: Boosting Category-level 6D Object Pose Estimation via Learning Decoupled Rotations on the Spherical Representations | Aug 19, 2023 | 6D Pose Estimation using RGBBenchmarking | CodeCode Available | 1 |
| Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos | Aug 19, 2023 | 3D ReconstructionObject | —Unverified | 0 |
| DiffusionTrack: Diffusion Model For Multi-Object Tracking | Aug 19, 2023 | Denoisingmodel | CodeCode Available | 2 |
| DESOBAv2: Towards Large-scale Real-world Dataset for Shadow Generation | Aug 19, 2023 | ObjectShadow Detection | CodeCode Available | 1 |
| LEGO: Learning and Graph-Optimized Modular Tracker for Online Multi-Object Tracking with Point Clouds | Aug 19, 2023 | Multi-Object TrackingMultiple Object Tracking | —Unverified | 0 |
| SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos | Aug 18, 2023 | 3D Object DetectionObject | CodeCode Available | 2 |
| Label-Free Event-based Object Recognition via Joint Learning with Image Reconstruction from Events | Aug 18, 2023 | Image ReconstructionObject | CodeCode Available | 1 |
| MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection | Aug 18, 2023 | 3D geometry3D Object Detection | CodeCode Available | 1 |
| Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning | Aug 18, 2023 | Contrastive LearningImitation Learning | CodeCode Available | 1 |
| Deep Equilibrium Object Detection | Aug 18, 2023 | DecoderObject | CodeCode Available | 1 |
| A Fusion of Variational Distribution Priors and Saliency Map Replay for Continual 3D Reconstruction | Aug 17, 2023 | 3D ReconstructionContinual Learning | —Unverified | 0 |
| Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation | Aug 17, 2023 | Edge-computingInstance Segmentation | —Unverified | 0 |
| BOTT: Box Only Transformer Tracker for 3D Object Tracking | Aug 17, 2023 | 3D Object TrackingAutonomous Driving | —Unverified | 0 |
| Frequency Perception Network for Camouflaged Object Detection | Aug 17, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Semantic Information for Object Detection | Aug 17, 2023 | Objectobject-detection | —Unverified | 0 |
| MV-ROPE: Multi-view Constraints for Robust Category-level Object Pose and Size Estimation | Aug 17, 2023 | Depth EstimationObject | —Unverified | 0 |
| Agglomerative Transformer for Human-Object Interaction Detection | Aug 16, 2023 | ClusteringDecoder | —Unverified | 0 |
| Classification Committee for Active Deep Object Detection | Aug 16, 2023 | Active LearningClassification | —Unverified | 0 |
| Improving Audio-Visual Segmentation with Bidirectional Generation | Aug 16, 2023 | Motion EstimationObject | CodeCode Available | 0 |
| Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos | Aug 16, 2023 | Action AnticipationActive Object Localization | —Unverified | 0 |
| Diagnosing Human-object Interaction Detectors | Aug 16, 2023 | ClassificationHuman-Object Interaction Detection | CodeCode Available | 1 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 |
| Exploiting Sparsity in Automotive Radar Object Detection Networks | Aug 15, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| Identity-Consistent Aggregation for Video Object Detection | Aug 15, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Learning Better Keypoints for Multi-Object 6DoF Pose Estimation | Aug 15, 2023 | AllObject | CodeCode Available | 0 |
| ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces | Aug 15, 2023 | 3D ReconstructionMulti-View 3D Reconstruction | CodeCode Available | 1 |
| Helping Hands: An Object-Aware Ego-Centric Video Recognition Model | Aug 15, 2023 | DecoderObject | CodeCode Available | 1 |
| Grasp Transfer based on Self-Aligning Implicit Representations of Local Surfaces | Aug 15, 2023 | Object | —Unverified | 0 |
| Improved Region Proposal Network for Enhanced Few-Shot Object Detection | Aug 15, 2023 | Few-Shot Object DetectionObject | CodeCode Available | 1 |
| FOLT: Fast Multiple Object Tracking from UAV-captured Videos Based on Optical Flow | Aug 14, 2023 | motion predictionMultiple Object Tracking | —Unverified | 0 |
| Space Object Identification and Classification from Hyperspectral Material Analysis | Aug 14, 2023 | ClassificationMaterial Classification | —Unverified | 0 |
| HHTrack: Hyperspectral Object Tracking Using Hybrid Attention | Aug 14, 2023 | ObjectObject Tracking | —Unverified | 0 |
| A One Stop 3D Target Reconstruction and multilevel Segmentation Method | Aug 14, 2023 | 3D Object Reconstruction3D Reconstruction | CodeCode Available | 1 |
| PatchContrast: Self-Supervised Pre-training for 3D Object Detection | Aug 14, 2023 | 3D Object DetectionAutonomous Vehicles | —Unverified | 0 |