| OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Nov 24, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |
| SegVol: Universal and Interactive Volumetric Medical Image Segmentation | Nov 22, 2023 | Computed Tomography (CT)Image Segmentation | CodeCode Available | 2 |
| Open-Vocabulary Camouflaged Object Segmentation | Nov 19, 2023 | Camouflaged Object SegmentationImage Segmentation | CodeCode Available | 2 |
| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 |
| Medical Image Segmentation with Domain Adaptation: A Survey | Nov 3, 2023 | Domain AdaptationImage Segmentation | CodeCode Available | 2 |
| SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images | Oct 23, 2023 | 3D ArchitectureImage Segmentation | CodeCode Available | 2 |
| Is Weakly-supervised Action Segmentation Ready For Human-Robot Interaction? No, Let's Improve It With Action-union Learning | Oct 22, 2023 | Action RecognitionAction Segmentation | CodeCode Available | 2 |
| You Only Look at Once for Real-time and Generic Multi-Task | Oct 2, 2023 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 |
| CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction | Oct 2, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance | Sep 29, 2023 | Few-Shot LearningHeart Segmentation | CodeCode Available | 2 |