| Object recognition in primates: What can early visual areas contribute? | Jul 5, 2024 | FoveationObject | —Unverified | 0 |
| TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking | Jul 5, 2024 | Multi-Object TrackingObject | CodeCode Available | 0 |
| Towards Stable 3D Object Detection | Jul 5, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection | Jul 4, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| Attention Normalization Impacts Cardinality Generalization in Slot Attention | Jul 4, 2024 | Image SegmentationObject | CodeCode Available | 0 |
| The Solution for the GAIIC2024 RGB-TIR object detection Challenge | Jul 4, 2024 | Objectobject-detection | —Unverified | 0 |
| FIPGNet:Pyramid grafting network with feature interaction strategies | Jul 4, 2024 | Objectobject-detection | —Unverified | 0 |
| TrackPGD: Efficient Adversarial Attack using Object Binary Masks against Robust Transformer Trackers | Jul 4, 2024 | Adversarial AttackAdversarial Robustness | CodeCode Available | 0 |
| Beyond Viewpoint: Robust 3D Object Recognition under Arbitrary Views through Joint Multi-Part Representation | Jul 4, 2024 | 3D Object RecognitionObject | —Unverified | 0 |
| Comics Datasets Framework: Mix of Comics datasets for detection benchmarking | Jul 3, 2024 | BenchmarkingObject | CodeCode Available | 1 |
| Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation | Jul 3, 2024 | ObjectObject Discovery | CodeCode Available | 1 |
| Visual Grounding with Attention-Driven Constraint Balancing | Jul 3, 2024 | Objectobject-detection | —Unverified | 0 |
| Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers | Jul 3, 2024 | AttributeObject | —Unverified | 0 |
| EgoFlowNet: Non-Rigid Scene Flow from Point Clouds with Ego-Motion Support | Jul 3, 2024 | ClusteringObject | —Unverified | 0 |
| Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-View 3D Detection and Tracking | Jul 3, 2024 | ObjectRepresentation Learning | —Unverified | 0 |
| HOIMotion: Forecasting Human Motion During Human-Object Interactions Using Egocentric 3D Object Bounding Boxes | Jul 2, 2024 | DecoderHuman-Object Interaction Detection | —Unverified | 0 |
| Magic Insert: Style-Aware Drag-and-Drop | Jul 2, 2024 | Domain AdaptationObject | —Unverified | 0 |
| Similarity Distance-Based Label Assignment for Tiny Object Detection | Jul 2, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Scarecrow monitoring system:employing mobilenet ssd for enhanced animal supervision | Jul 1, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Formal Verification of Deep Neural Networks for Object Detection | Jul 1, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Grouped Discrete Representation Guides Object-Centric Learning | Jul 1, 2024 | AttributeObject | —Unverified | 0 |
| SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection | Jul 1, 2024 | Objectobject-detection | CodeCode Available | 2 |
| SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving | Jul 1, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Grasping in Dexterous Robotics | Jun 30, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| DroBoost: An Intelligent Score and Model Boosting Method for Drone Detection | Jun 30, 2024 | Objectobject-detection | —Unverified | 0 |
| Object Space is Embodied | Jun 28, 2024 | Object | —Unverified | 0 |
| PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators | Jun 28, 2024 | DecoderObject | —Unverified | 0 |
| EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting | Jun 28, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking | Jun 28, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data | Jun 27, 2024 | Contact-rich ManipulationObject | —Unverified | 0 |
| Weighted Circle Fusion: Ensembling Circle Representation from Different Object Detection Results | Jun 27, 2024 | Objectobject-detection | CodeCode Available | 0 |
| CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangement | Jun 27, 2024 | Human-Object Interaction DetectionHuman-Object Interaction Generation | CodeCode Available | 2 |
| HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection | Jun 27, 2024 | Objectobject-detection | CodeCode Available | 0 |
| 3D Feature Distillation with Object-Centric Priors | Jun 26, 2024 | 3D Instance SegmentationInstance Segmentation | —Unverified | 0 |
| CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection | Jun 26, 2024 | 3D Object DetectionDomain Adaptation | —Unverified | 0 |
| SpY: A Context-Based Approach to Spacecraft Component Detection | Jun 26, 2024 | Objectobject-detection | —Unverified | 0 |
| Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models | Jun 26, 2024 | Collision AvoidanceHuman-Object Interaction Detection | —Unverified | 0 |
| Geometric Features Enhanced Human-Object Interaction Detection | Jun 26, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 0 |
| BiTrack: Bidirectional Offline 3D Multi-Object Tracking Using Camera-LiDAR Data | Jun 26, 2024 | 3D Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 1 |
| Vision Controlled Sensorized Prosthetic Hand | Jun 25, 2024 | Object | CodeCode Available | 0 |
| Pixel-weighted Multi-pose Fusion for Metal Artifact Reduction in X-ray Computed Tomography | Jun 25, 2024 | Computed Tomography (CT)Metal Artifact Reduction | —Unverified | 0 |
| Uncertainty for SVBRDF Acquisition using Frequency Analysis | Jun 25, 2024 | Inverse RenderingObject | CodeCode Available | 1 |
| Human-Object Interaction from Human-Level Instructions | Jun 25, 2024 | Common Sense ReasoningHuman-Object Interaction Detection | —Unverified | 0 |
| ET tu, CLIP? Addressing Common Object Errors for Unseen Environments | Jun 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Towards Open-set Camera 3D Object Detection | Jun 25, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning | Jun 25, 2024 | ObjectObject Recognition | CodeCode Available | 2 |
| OCALM: Object-Centric Assessment with Language Models | Jun 24, 2024 | ObjectReinforcement Learning (RL) | —Unverified | 0 |
| Exploring Test-Time Adaptation for Object Detection in Continually Changing Environments | Jun 24, 2024 | Contrastive LearningObject | —Unverified | 0 |
| Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models | Jun 24, 2024 | Common Sense ReasoningHallucination | CodeCode Available | 1 |
| High-resolution open-vocabulary object 6D pose estimation | Jun 24, 2024 | 6D Pose EstimationObject | —Unverified | 0 |