| Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding | Sep 1, 2023 | Graph GenerationImage Captioning | CodeCode Available | 0 |
| What Makes Good Open-Vocabulary Detector: A Disassembling Perspective | Sep 1, 2023 | Objectobject-detection | —Unverified | 0 |
| SoccerNet 2023 Tracking Challenge -- 3rd place MOT4MOT Team Technical Report | Aug 31, 2023 | Multi-Object TrackingObject | —Unverified | 0 |
| MS23D: A 3D Object Detection Method Using Multi-Scale Semantic Feature Points to Construct 3D Feature Layer | Aug 31, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects | Aug 31, 2023 | 6D Pose EstimationObject | —Unverified | 0 |
| Fusing Pseudo Labels with Weak Supervision for Dynamic Traffic Scenarios | Aug 30, 2023 | Decision MakingObject | —Unverified | 0 |
| WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model | Aug 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection | Aug 30, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object Tracking | Aug 30, 2023 | Multi-Object TrackingObject | —Unverified | 0 |
| Ego-Motion Estimation and Dynamic Motion Separation from 3D Point Clouds for Accumulating Data and Improving 3D Object Detection | Aug 29, 2023 | 3D Object DetectionMotion Estimation | —Unverified | 0 |
| Group Regression for Query Based Object Detection and Tracking | Aug 28, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| The Interstate-24 3D Dataset: a new benchmark for 3D multi-camera vehicle tracking | Aug 28, 2023 | 3D Object TrackingObject | —Unverified | 0 |
| Modeling infant object perception as program induction | Aug 28, 2023 | AttributeInductive Learning | —Unverified | 0 |
| RobustCLEVR: A Benchmark and Framework for Evaluating Robustness in Object-centric Learning | Aug 28, 2023 | Image GenerationObject | —Unverified | 0 |
| Improving the performance of object detection by preserving label distribution | Aug 28, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Nonrigid Object Contact Estimation With Regional Unwrapping Transformer | Aug 27, 2023 | Object | —Unverified | 0 |
| Image Coding for Machines with Object Region Learning | Aug 27, 2023 | Image CompressionObject | —Unverified | 0 |
| Joint Gaze-Location and Gaze-Object Detection | Aug 26, 2023 | Objectobject-detection | —Unverified | 0 |
| SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data | Aug 24, 2023 | ObjectRelation | CodeCode Available | 0 |
| On Offline Evaluation of 3D Object Detection for Autonomous Driving | Aug 24, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Perspective-aware Convolution for Monocular 3D Object Detection | Aug 24, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| I3DOD: Towards Incremental 3D Object Detection via Prompting | Aug 24, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Data-Side Efficiencies for Lightweight Convolutional Neural Networks | Aug 24, 2023 | image-classificationImage Classification | —Unverified | 0 |
| ROAM: Robust and Object-Aware Motion Generation Using Neural Pose Descriptors | Aug 24, 2023 | Motion GenerationMotion Synthesis | —Unverified | 0 |
| Computational models of object motion detectors accelerated using FPGA technology | Aug 23, 2023 | Motion DetectionObject | —Unverified | 0 |
| CHORUS: Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images | Aug 23, 2023 | Common Sense ReasoningDiversity | —Unverified | 0 |
| Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields | Aug 23, 2023 | NeRFObject | —Unverified | 0 |
| GRIP: Generating Interaction Poses Using Spatial Cues and Latent Consistency | Aug 22, 2023 | Mixed RealityObject | —Unverified | 0 |
| TrackFlow: Multi-Object Tracking with Normalizing Flows | Aug 22, 2023 | Multi-Object TrackingObject | —Unverified | 0 |
| Opening the Vocabulary of Egocentric Actions | Aug 22, 2023 | Action RecognitionObject | CodeCode Available | 0 |
| Affordance segmentation of hand-occluded containers from exocentric images | Aug 22, 2023 | Mixed RealityObject | CodeCode Available | 0 |
| LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training | Aug 22, 2023 | ObjectObject Discovery | CodeCode Available | 0 |
| Video OWL-ViT: Temporally-consistent open-world localization in video | Aug 22, 2023 | DecoderObject | —Unverified | 0 |
| Small Object Detection for Birds with Swin Transformer | Aug 22, 2023 | Objectobject-detection | —Unverified | 0 |
| Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection | Aug 22, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Ensemble Fusion for Small Object Detection | Aug 22, 2023 | Objectobject-detection | —Unverified | 0 |
| Enhancing Interpretable Object Abstraction via Clustering-based Slot Initialization | Aug 22, 2023 | ClusteringNovel View Synthesis | —Unverified | 0 |
| Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views | Aug 22, 2023 | NeRFNeural Rendering | —Unverified | 0 |
| Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding | Aug 22, 2023 | Contrastive LearningObject | —Unverified | 0 |
| Multi-Modal Dataset Acquisition for Photometrically Challenging Object | Aug 21, 2023 | Object | —Unverified | 0 |
| ThermRad: A Multi-modal Dataset for Robust 3D Object Detection under Challenging Conditions | Aug 20, 2023 | 3D Object DetectionObject | —Unverified | 0 |
| Representation Disparity-aware Distillation for 3D Object Detection | Aug 20, 2023 | 3D Object DetectionKnowledge Distillation | —Unverified | 0 |
| HODN: Disentangling Human-Object Feature for HOI Detection | Aug 20, 2023 | DecoderHuman Detection | —Unverified | 0 |
| Scalable Video Object Segmentation with Simplified Framework | Aug 19, 2023 | ObjectSemantic Segmentation | —Unverified | 0 |
| Root Pose Decomposition Towards Generic Non-rigid 3D Reconstruction with Monocular Videos | Aug 19, 2023 | 3D ReconstructionObject | —Unverified | 0 |
| Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos | Aug 19, 2023 | ObjectObject Discovery | CodeCode Available | 0 |
| LEGO: Learning and Graph-Optimized Modular Tracker for Online Multi-Object Tracking with Point Clouds | Aug 19, 2023 | Multi-Object TrackingMultiple Object Tracking | —Unverified | 0 |
| A Fusion of Variational Distribution Priors and Saliency Map Replay for Continual 3D Reconstruction | Aug 17, 2023 | 3D ReconstructionContinual Learning | —Unverified | 0 |
| Semantic Information for Object Detection | Aug 17, 2023 | Objectobject-detection | —Unverified | 0 |
| MV-ROPE: Multi-view Constraints for Robust Category-level Object Pose and Size Estimation | Aug 17, 2023 | Depth EstimationObject | —Unverified | 0 |