SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 501550 of 2262 papers

TitleStatusHype
Graph Relation Distillation for Efficient Biomedical Instance SegmentationCode1
Improving the Detection of Small Oriented Objects in Aerial ImagesCode1
PartSTAD: 2D-to-3D Part Segmentation Task AdaptationCode2
Multi-scale attention-based instance segmentation for measuring crystals with large size variation0
ENSTRECT: A Stage-based Approach to 2.5D Structural Damage DetectionCode0
ODIN: A Single Model for 2D and 3D SegmentationCode2
DIOD: Self-Distillation Meets Object DiscoveryCode1
Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Multi-Scale Aggregation and Anthropic Prior Knowledge0
Edge-Aware 3D Instance Segmentation Network with Intelligent Semantic Prior0
Mudslide: A Universal Nuclear Instance Segmentation Method0
FISBe: A Real-World Benchmark Dataset for Instance Segmentation of Long-Range Thin Filamentous Structures0
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes0
Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation0
Generalized Mask-aware IoU for Anchor Assignment for Real-time Instance Segmentation0
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language ModelCode4
Unsupervised Universal Image SegmentationCode2
Semantic-aware SAM for Point-Prompted Instance SegmentationCode1
DVIS++: Improved Decoupled Framework for Universal Video SegmentationCode1
Cached Transformers: Improving Transformers with Differentiable Memory CacheCode1
SoftCTM: Cell detection by soft instance segmentation and consideration of cell-tissue interactionCode1
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion ProcessCode2
The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and BenchmarkCode1
All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes0
Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical RepresentationCode1
SAI3D: Segment Any Instance in 3D Scenes0
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask GuidanceCode1
Semantic-Aware Autoregressive Image Modeling for Visual Representation LearningCode1
General Object Foundation Model for Images and Videos at ScaleCode3
CattleEyeView: A Multi-task Top-down View Cattle Dataset for Smarter Precision Livestock FarmingCode1
Offshore Wind Plant Instance Segmentation Using Sentinel-1 Time Series, GIS, and Semantic Segmentation Models0
SAM-guided Graph Cut for 3D Instance Segmentation0
Comparing YOLOv8 and Mask RCNN for object segmentation in complex orchard environments0
Automated Behavioral Analysis Using Instance SegmentationCode0
MWSIS: Multimodal Weakly Supervised Instance Segmentation with 2D Box Annotations for Autonomous DrivingCode0
MaxQ: Multi-Axis Query for N:M Sparsity NetworkCode1
DFGET: Displacement-Field Assisted Graph Energy Transmitter for Gland Instance Segmentation0
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance SegmentationCode1
ASF-YOLO: A Novel YOLO Model with Attentional Scale Sequence Fusion for Cell Instance SegmentationCode1
EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation0
Immature Green Apple Detection and Sizing in Commercial Orchards using YOLOv8 and Shape Fitting Techniques0
VISAGE: Video Instance Segmentation with Appearance-Guided EnhancementCode1
Bottom-Up Instance Segmentation of Catheters for Chest X-Rays0
Uni3DL: Unified Model for 3D and Language Understanding0
PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood EstimationCode1
Panoptica -- instance-wise evaluation of 3D semantic and instance segmentation mapsCode1
A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing0
A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors0
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingCode4
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
CellMixer: Annotation-free Semantic Cell Segmentation of Heterogeneous Cell Populations0
Show:102550
← PrevPage 11 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified