SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 301350 of 2262 papers

TitleStatusHype
WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water SurfacesCode1
SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital PathologyCode1
Towards accurate instance segmentation in large-scale LiDAR point cloudsCode1
MobileViG: Graph-Based Sparse Attention for Mobile Vision ApplicationsCode1
High-Quality Unknown Object Instance Segmentation via Quadruple Boundary Error RefinementCode1
PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and AggregationCode1
Segmentation and Tracking of Vegetable Plants by Exploiting Vegetable Shape Feature for Precision Spray of Agricultural RobotsCode1
Inter-Instance Similarity Modeling for Contrastive LearningCode1
Primitive Generation and Semantic-related Alignment for Universal Zero-Shot SegmentationCode1
VISION Datasets: A Benchmark for Vision-based InduStrial InspectiONCode1
Revisiting Token Pruning for Object Detection and Instance SegmentationCode1
PhenoBench -- A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural DomainCode1
Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive FusionCode1
DFormer: Diffusion-guided Transformer for Universal Image SegmentationCode1
DVIS: Decoupled Video Instance Segmentation FrameworkCode1
Asymmetric Patch Sampling for Contrastive LearningCode1
Cyclic Learning: Bridging Image-level Labels and Nuclei Instance SegmentationCode1
A Robust Feature Downsampling Module for Remote Sensing Visual TasksCode1
OpenVIS: Open-vocabulary Video Instance SegmentationCode1
Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance SegmentationCode1
Explain Any Concept: Segment Anything Meets Concept-Based ExplanationCode1
A Comparative Evaluation of Deep Learning Techniques for Photovoltaic Panel Detection from Aerial ImagesCode1
Thermal Bridges on Building RooftopsCode1
FreePoint: Unsupervised Point Cloud Instance SegmentationCode1
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance SegmentationCode1
CLUSTSEG: Clustering for Universal SegmentationCode1
LineFormer: Rethinking Line Chart Data Extraction as Instance SegmentationCode1
RT-K-Net: Revisiting K-Net for Real-Time Panoptic SegmentationCode1
MARS: Mask Attention Refinement with Sequential Quadtree Nodes for Car Damage Instance SegmentationCode1
A Review of Panoptic Segmentation for Mobile Mapping Point CloudsCode1
EDAPS: Enhanced Domain-Adaptive Panoptic SegmentationCode1
Instance Segmentation in the DarkCode1
Zero-shot Unsupervised Transfer Instance SegmentationCode1
Fully Sparse Fusion for 3D Object DetectionCode1
AutoFocusFormer: Image Segmentation off the GridCode1
MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision TransformerCode1
Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation ModelsCode1
An Instance Segmentation Dataset of Yeast Cells in MicrostructuresCode1
Improving Segmentation of Objects with Varying Sizes in Biomedical Images using Instance-wise and Center-of-Instance Segmentation Loss FunctionCode1
Instance Neural Radiance FieldCode1
Towards Open-Vocabulary Video Instance SegmentationCode1
FinnWoodlands DatasetCode1
JacobiNeRF: NeRF Shaping with Mutual Information GradientsCode1
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerCode1
The Devil is in the Points: Weakly Semi-Supervised Instance Segmentation via Point-Guided Mask RepresentationCode1
Hi4D: 4D Instance Segmentation of Close Human InteractionCode1
BoxVIS: Video Instance Segmentation with Box AnnotationsCode1
You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene UnderstandingCode1
DoNet: Deep De-overlapping Network for Cytology Instance SegmentationCode1
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging VideosCode1
Show:102550
← PrevPage 7 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8GLEE-Promask AP54.2Unverified
9ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified