SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 801850 of 2262 papers

TitleStatusHype
Instance-level Few-shot Learning with Class Hierarchy MiningCode0
An Instance Segmentation Dataset of Yeast Cells in MicrostructuresCode1
DynaMITe: Dynamic Query Bootstrapping for Multi-object Interactive Segmentation Transformer0
SpectFormer: Frequency and Attention is what you need in a Vision Transformer0
Improving Segmentation of Objects with Varying Sizes in Biomedical Images using Instance-wise and Center-of-Instance Segmentation Loss FunctionCode1
HaDR: Applying Domain Randomization for Generating Synthetic Multimodal Dataset for Hand Instance Segmentation in Cluttered Industrial EnvironmentsCode0
Instance Neural Radiance FieldCode1
DPPD: Deformable Polar Polygon Object Detection0
Towards Open-Vocabulary Video Instance SegmentationCode1
FinnWoodlands DatasetCode1
LIGHT: Joint Individual Building Extraction and Height Estimation from Satellite Images through a Unified Multitask Learning Network0
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene UnderstandingCode2
Video Instance Segmentation in an Open-WorldCode0
JacobiNeRF: NeRF Shaping with Mutual Information GradientsCode1
INoD: Injected Noise Discriminator for Self-Supervised Representation Learning in Agricultural Fields0
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerCode1
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation0
Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning0
MobileInst: Video Instance Segmentation on the Mobile0
Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations0
ARMBench: An Object-centric Benchmark Dataset for Robotic Manipulation0
AutoKary2022: A Large-Scale Densely Annotated Dataset for Chromosome Instance SegmentationCode0
Mask-Free Video Instance SegmentationCode2
OpenInst: A Simple Query-Based Method for Open-World Instance Segmentation0
The Devil is in the Points: Weakly Semi-Supervised Instance Segmentation via Point-Guided Mask RepresentationCode1
Hi4D: 4D Instance Segmentation of Close Human InteractionCode1
You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene UnderstandingCode1
BoxVIS: Video Instance Segmentation with Box AnnotationsCode1
UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes0
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging VideosCode1
SUDS: Scalable Urban Dynamic Scenes0
DoNet: Deep De-overlapping Network for Cytology Instance SegmentationCode1
A Simple and Generic Framework for Feature Distillation via Channel-wise Transformation0
Position-Guided Point Cloud Panoptic Segmentation TransformerCode1
Uncertainty Aware Active Learning for Reconfiguration of Pre-trained Deep Object-Detection Networks for New Target Domains0
On Domain-Specific Pre-Training for Effective Semantic Perception in Agricultural Robotics0
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images0
BoxSnake: Polygonal Instance Segmentation with Box SupervisionCode1
3D Mitochondria Instance Segmentation with Spatio-Temporal TransformersCode1
Weakly-Supervised Text Instance Segmentation0
Bimodal SegNet: Instance Segmentation Fusing Events and RGB Frames for Robotic GraspingCode0
Making Vision Transformers Efficient from A Token Sparsification ViewCode1
FastInst: A Simple Query-Based Model for Real-Time Instance SegmentationCode2
Panoptic One-Click Segmentation: Applied to Agricultural Data0
SIM: Semantic-aware Instance Mask Generation for Box-Supervised Instance SegmentationCode1
A Simple Framework for Open-Vocabulary Segmentation and DetectionCode3
DynaMask: Dynamic Mask Selection for Instance SegmentationCode1
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale AttentionCode2
Three Guidelines You Should Know for Universally Slimmable Self-Supervised LearningCode1
OSIS: Efficient One-stage Network for 3D Instance Segmentation0
Show:102550
← PrevPage 17 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified