| NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models | Jul 16, 2024 | 6D Pose Estimation using RGBNovel View Synthesis | CodeCode Available | 1 |
| MaskVD: Region Masking for Efficient Video Object Detection | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| Improving Unsupervised Video Object Segmentation via Fake Flow Generation | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| VISA: Reasoning Video Object Segmentation via Large Language Models | Jul 16, 2024 | DecoderObject | CodeCode Available | 3 |
| AccDiffusion: An Accurate Method for Higher-Resolution Image Generation | Jul 15, 2024 | Image GenerationObject | CodeCode Available | 2 |
| Can Textual Semantics Mitigate Sounding Object Segmentation Preference? | Jul 15, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Anticipating Future Object Compositions without Forgetting | Jul 15, 2024 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models | Jul 15, 2024 | ObjectVideo Editing | —Unverified | 0 |
| OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Jul 15, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| Plain-Det: A Plain Multi-Dataset Object Detector | Jul 14, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Part2Object: Hierarchical Unsupervised 3D Instance Segmentation | Jul 14, 2024 | 3D Instance SegmentationClustering | CodeCode Available | 1 |
| CLOVER: Context-aware Long-term Object Viewpoint- and Environment- Invariant Representation Learning | Jul 12, 2024 | Foreground SegmentationObject | —Unverified | 0 |
| 3x2: 3D Object Part Segmentation by 2D Semantic Correspondences | Jul 12, 2024 | ObjectSegmentation | —Unverified | 0 |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Jul 12, 2024 | Collaborative InferenceLanguage Modelling | CodeCode Available | 1 |
| KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting | Jul 12, 2024 | 6D Pose EstimationGraph Embedding | —Unverified | 0 |
| DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects | Jul 12, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Introducing VaDA: Novel Image Segmentation Model for Maritime Object Segmentation Using New Dataset | Jul 12, 2024 | Autonomous NavigationImage Segmentation | —Unverified | 0 |
| StyleSplat: 3D Object Style Transfer with Gaussian Splatting | Jul 12, 2024 | ObjectStyle Transfer | —Unverified | 0 |
| Textual Query-Driven Mask Transformer for Domain Generalized Segmentation | Jul 12, 2024 | Domain GeneralizationObject | CodeCode Available | 1 |
| DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training | Jul 12, 2024 | Image GenerationObject | CodeCode Available | 1 |
| Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets | Jul 11, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects | Jul 11, 2024 | ObjectPrediction | —Unverified | 0 |
| Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer | Jul 11, 2024 | Data AugmentationObject | —Unverified | 0 |
| SRPose: Two-view Relative Pose Estimation with Sparse Keypoints | Jul 11, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| HACMan++: Spatially-Grounded Motion Primitives for Manipulation | Jul 11, 2024 | ObjectRobot Manipulation | —Unverified | 0 |
| Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets | Jul 11, 2024 | Image GenerationInformativeness | CodeCode Available | 0 |
| Bayesian Detector Combination for Object Detection with Crowdsourced Annotations | Jul 10, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Vegetable Peeling: A Case Study in Constrained Dexterous Manipulation | Jul 10, 2024 | Object | —Unverified | 0 |
| ActionVOS: Actions as Prompts for Video Object Segmentation | Jul 10, 2024 | ObjectReferring Video Object Segmentation | CodeCode Available | 1 |
| InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior | Jul 10, 2024 | BenchmarkingDecoder | CodeCode Available | 2 |
| LSM: A Comprehensive Metric for Assessing the Safety of Lane Detection Systems in Autonomous Driving | Jul 10, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Learning Spatial-Semantic Features for Robust Video Object Segmentation | Jul 10, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Sketch-Guided Scene Image Generation | Jul 9, 2024 | Image GenerationObject | —Unverified | 0 |
| Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images | Jul 9, 2024 | DecoderObject | —Unverified | 0 |
| Universal Multi-view Black-box Attack against Object Detectors via Layout Optimization | Jul 9, 2024 | Object | —Unverified | 0 |
| Self-supervised visual learning from interactions with objects | Jul 9, 2024 | ObjectRepresentation Learning | CodeCode Available | 0 |
| Cue Point Estimation using Object Detection | Jul 9, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Rethinking Image-to-Video Adaptation: An Object-centric Perspective | Jul 9, 2024 | Action RecognitionObject | —Unverified | 0 |
| GeoWATCH for Detecting Heavy Construction in Heterogeneous Time Series of Satellite Images | Jul 8, 2024 | Activity Recognitionimage-classification | —Unverified | 0 |
| TARGO: Benchmarking Target-driven Object Grasping under Occlusions | Jul 8, 2024 | BenchmarkingObject | —Unverified | 0 |
| Submodular video object proposal selection for semantic object segmentation | Jul 8, 2024 | ObjectSegmentation | —Unverified | 0 |
| Weakly Supervised Test-Time Domain Adaptation for Object Detection | Jul 8, 2024 | Domain AdaptationObject | CodeCode Available | 0 |
| Context Propagation from Proposals for Semantic Video Object Segmentation | Jul 8, 2024 | ObjectSegmentation | —Unverified | 0 |
| Boosting 3D Object Detection with Semantic-Aware Multi-Branch Framework | Jul 8, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Towards Reflected Object Detection: A Benchmark | Jul 8, 2024 | Objectobject-detection | —Unverified | 0 |
| CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation | Jul 8, 2024 | DecoderObject | CodeCode Available | 1 |
| Short-term Object Interaction Anticipation with Disentangled Object Detection @ Ego4D Short Term Object Interaction Anticipation Challenge | Jul 8, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Addressing single object tracking in satellite imagery through prompt-engineered solutions | Jul 7, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image | Jul 7, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Zero-shot Object Counting with Good Exemplars | Jul 6, 2024 | Contrastive LearningObject | CodeCode Available | 1 |