| Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs | Aug 5, 2024 | Domain AdaptationSegmentation | CodeCode Available | 0 |
| A Two-Stage Progressive Pre-training using Multi-Modal Contrastive Masked Autoencoders | Aug 5, 2024 | Contrastive LearningDenoising | —Unverified | 0 |
| View-consistent Object Removal in Radiance Fields | Aug 4, 2024 | Image InpaintingObject | —Unverified | 0 |
| PanicleNeRF: low-cost, high-precision in-field phenotypingof rice panicles with smartphone | Aug 4, 2024 | 3D ReconstructionImage Segmentation | —Unverified | 0 |
| Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation | Aug 4, 2024 | Domain AdaptationObject | CodeCode Available | 0 |
| Challenge Summary U-MedSAM: Uncertainty-aware MedSAM for Medical Image Segmentation | Aug 3, 2024 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| MedUHIP: Towards Human-In-the-Loop Medical Segmentation | Aug 3, 2024 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| NuLite -- Lightweight and Fast Model for Nuclei Instance Segmentation and Classification | Aug 3, 2024 | Cell DetectionCell Segmentation | CodeCode Available | 1 |
| LAM3D: Leveraging Attention for Monocular 3D Object Detection | Aug 3, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Bayesian Active Learning for Semantic Segmentation | Aug 3, 2024 | Active LearningSemantic Segmentation | —Unverified | 0 |
| A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection | Aug 3, 2024 | Semantic Segmentation | —Unverified | 0 |
| Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design | Aug 3, 2024 | Code GenerationImage Segmentation | CodeCode Available | 0 |
| Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation | Aug 3, 2024 | Autonomous Vehiclesgraph construction | —Unverified | 0 |
| SHARP-Net: A Refined Pyramid Network for Deficiency Segmentation in Culverts and Sewer Pipes | Aug 2, 2024 | Semantic Segmentation | —Unverified | 0 |
| Multi-Unit Floor Plan Recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans | Aug 2, 2024 | Semantic Segmentation | —Unverified | 0 |
| Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation | Aug 2, 2024 | class-incremental learningClass Incremental Learning | —Unverified | 0 |
| StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation | Aug 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| A Robotics-Inspired Scanpath Model Reveals the Importance of Uncertainty and Semantic Object Cues for Gaze Guidance in Dynamic Scenes | Aug 2, 2024 | FoveationObject | CodeCode Available | 0 |
| Amodal Segmentation for Laparoscopic Surgery Video Instruments | Aug 2, 2024 | Instance SegmentationSegmentation | —Unverified | 0 |
| Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model | Aug 2, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach | Aug 2, 2024 | cross-modal alignmentMultiple Object Tracking | CodeCode Available | 2 |
| Extracting Object Heights From LiDAR & Aerial Imagery | Aug 2, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Aug 1, 2024 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation | Aug 1, 2024 | 3D Semantic SegmentationDenoising | CodeCode Available | 0 |
| A Simple Background Augmentation Method for Object Detection with Diffusion Model | Aug 1, 2024 | Data AugmentationDiversity | —Unverified | 0 |