| DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection | Jul 18, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection | Jul 18, 2024 | DenoisingObject | —Unverified | 0 |
| OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction | Jul 18, 2024 | DecoderObject | CodeCode Available | 0 |
| Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval | Jul 17, 2024 | Image-text RetrievalObject | CodeCode Available | 0 |
| HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects | Jul 17, 2024 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Jul 17, 2024 | DescriptiveGrasp Generation | —Unverified | 0 |
| Strawberry detection and counting based on YOLOv7 pruning and information based tracking algorithm | Jul 17, 2024 | Multiple Object TrackingObject | —Unverified | 0 |
| Data-driven Verification of DNNs for Object Recognition | Jul 17, 2024 | Image SegmentationObject | —Unverified | 0 |
| MaskVD: Region Masking for Efficient Video Object Detection | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| Improving Unsupervised Video Object Segmentation via Fake Flow Generation | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models | Jul 15, 2024 | ObjectVideo Editing | —Unverified | 0 |
| Anticipating Future Object Compositions without Forgetting | Jul 15, 2024 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| Can Textual Semantics Mitigate Sounding Object Segmentation Preference? | Jul 15, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Introducing VaDA: Novel Image Segmentation Model for Maritime Object Segmentation Using New Dataset | Jul 12, 2024 | Autonomous NavigationImage Segmentation | —Unverified | 0 |
| 3x2: 3D Object Part Segmentation by 2D Semantic Correspondences | Jul 12, 2024 | ObjectSegmentation | —Unverified | 0 |
| CLOVER: Context-aware Long-term Object Viewpoint- and Environment- Invariant Representation Learning | Jul 12, 2024 | Foreground SegmentationObject | —Unverified | 0 |
| KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting | Jul 12, 2024 | 6D Pose EstimationGraph Embedding | —Unverified | 0 |
| StyleSplat: 3D Object Style Transfer with Gaussian Splatting | Jul 12, 2024 | ObjectStyle Transfer | —Unverified | 0 |
| Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets | Jul 11, 2024 | Image GenerationInformativeness | CodeCode Available | 0 |
| Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer | Jul 11, 2024 | Data AugmentationObject | —Unverified | 0 |
| OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects | Jul 11, 2024 | ObjectPrediction | —Unverified | 0 |
| HACMan++: Spatially-Grounded Motion Primitives for Manipulation | Jul 11, 2024 | ObjectRobot Manipulation | —Unverified | 0 |
| Learning Spatial-Semantic Features for Robust Video Object Segmentation | Jul 10, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation | Jul 10, 2024 | Object | —Unverified | 0 |
| Bayesian Detector Combination for Object Detection with Crowdsourced Annotations | Jul 10, 2024 | Objectobject-detection | CodeCode Available | 0 |
| LSM: A Comprehensive Metric for Assessing the Safety of Lane Detection Systems in Autonomous Driving | Jul 10, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Self-supervised visual learning from interactions with objects | Jul 9, 2024 | ObjectRepresentation Learning | CodeCode Available | 0 |
| Sketch-Guided Scene Image Generation | Jul 9, 2024 | Image GenerationObject | —Unverified | 0 |
| Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images | Jul 9, 2024 | DecoderObject | —Unverified | 0 |
| Rethinking Image-to-Video Adaptation: An Object-centric Perspective | Jul 9, 2024 | Action RecognitionObject | —Unverified | 0 |
| Universal Multi-view Black-box Attack against Object Detectors via Layout Optimization | Jul 9, 2024 | Object | —Unverified | 0 |
| Context Propagation from Proposals for Semantic Video Object Segmentation | Jul 8, 2024 | ObjectSegmentation | —Unverified | 0 |
| Short-term Object Interaction Anticipation with Disentangled Object Detection @ Ego4D Short Term Object Interaction Anticipation Challenge | Jul 8, 2024 | Objectobject-detection | CodeCode Available | 0 |
| TARGO: Benchmarking Target-driven Object Grasping under Occlusions | Jul 8, 2024 | BenchmarkingObject | —Unverified | 0 |
| Towards Reflected Object Detection: A Benchmark | Jul 8, 2024 | Objectobject-detection | —Unverified | 0 |
| Submodular video object proposal selection for semantic object segmentation | Jul 8, 2024 | ObjectSegmentation | —Unverified | 0 |
| Weakly Supervised Test-Time Domain Adaptation for Object Detection | Jul 8, 2024 | Domain AdaptationObject | CodeCode Available | 0 |
| Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework | Jul 8, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images | Jul 8, 2024 | Activity Recognitionimage-classification | —Unverified | 0 |
| Addressing single object tracking in satellite imagery through prompt-engineered solutions | Jul 7, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image | Jul 7, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| TF-SASM: Training-free Spatial-aware Sparse Memory for Multi-object Tracking | Jul 5, 2024 | Multi-Object TrackingObject | CodeCode Available | 0 |
| Towards Stable 3D Object Detection | Jul 5, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Object recognition in primates: What can early visual areas contribute? | Jul 5, 2024 | FoveationObject | —Unverified | 0 |
| Attention Normalization Impacts Cardinality Generalization in Slot Attention | Jul 4, 2024 | Image SegmentationObject | CodeCode Available | 0 |
| FIPGNet:Pyramid grafting network with feature interaction strategies | Jul 4, 2024 | Objectobject-detection | —Unverified | 0 |
| TrackPGD: Efficient Adversarial Attack using Object Binary Masks against Robust Transformer Trackers | Jul 4, 2024 | Adversarial AttackAdversarial Robustness | CodeCode Available | 0 |
| The Solution for the GAIIC2024 RGB-TIR object detection Challenge | Jul 4, 2024 | Objectobject-detection | —Unverified | 0 |
| Beyond Viewpoint: Robust 3D Object Recognition under Arbitrary Views through Joint Multi-Part Representation | Jul 4, 2024 | 3D Object RecognitionObject | —Unverified | 0 |
| Visual Grounding with Attention-Driven Constraint Balancing | Jul 3, 2024 | Objectobject-detection | —Unverified | 0 |