SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 751800 of 2262 papers

TitleStatusHype
Asymmetric Patch Sampling for Contrastive LearningCode1
OAFormer: Learning Occlusion Distinguishable Feature for Amodal Instance Segmentation0
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model0
A Robust Feature Downsampling Module for Remote Sensing Visual TasksCode1
Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesCode0
ConES: Concept Embedding Search for Parameter Efficient Tuning Large Vision Language Models0
Independent Component Alignment for Multi-Task Learning0
Human Body Shape Classification Based on a Single Image0
ZeroPose: CAD-Prompted Zero-shot Object 6D Pose Estimation in Cluttered Scenes0
Rate-Distortion Theory in Coding for Machines and its Application0
Linear Object Detection in Document Images using Multiple Object Tracking0
GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance SegmentationCode0
OpenVIS: Open-vocabulary Video Instance SegmentationCode1
Sampling-based Uncertainty Estimation for an Instance Segmentation Network0
Streaming Object Detection on Fisheye Cameras for Automatic Parking0
Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance SegmentationCode1
Explain Any Concept: Segment Anything Meets Concept-Based ExplanationCode1
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode0
ROI-based Deep Image Compression with Swin Transformers0
A Comparative Evaluation of Deep Learning Techniques for Photovoltaic Panel Detection from Aerial ImagesCode1
FreePoint: Unsupervised Point Cloud Instance SegmentationCode1
Thermal Bridges on Building RooftopsCode1
Self-Supervised Instance Segmentation by Grasping0
Real-time instance segmentation with polygons using an Intersection-over-Union lossCode0
SwinDocSegmenter: An End-to-End Unified Domain Adaptive Transformer for Document Instance SegmentationCode1
Segmentation of the veterinary cytological images for fast neoplastic tumors diagnosis0
UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation0
HAISTA-NET: Human Assisted Instance Segmentation Through Attention0
Point2Tree(P2T) -- framework for parameter tuning of semantic and instance segmentation used with mobile laser scanning data in coniferous forest0
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything ModelCode2
CLUSTSEG: Clustering for Universal SegmentationCode1
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN0
LineFormer: Rethinking Line Chart Data Extraction as Instance SegmentationCode1
RT-K-Net: Revisiting K-Net for Real-Time Panoptic SegmentationCode1
MARS: Mask Attention Refinement with Sequential Quadtree Nodes for Car Damage Instance SegmentationCode1
Sensor Equivariance by LiDAR Projection Images0
Instance Segmentation in the DarkCode1
A Review of Panoptic Segmentation for Mobile Mapping Point CloudsCode1
Zero-shot Unsupervised Transfer Instance SegmentationCode1
EDAPS: Enhanced Domain-Adaptive Panoptic SegmentationCode1
Methods and datasets for segmentation of minimally invasive surgical instruments in endoscopic images and videos: A review of the state of the art0
AutoFocusFormer: Image Segmentation off the GridCode1
MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision TransformerCode1
Fully Sparse Fusion for 3D Object DetectionCode1
Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation ModelsCode1
Ensembling Instance and Semantic Segmentation for Panoptic Segmentation0
Baybayin Character Instance Detection0
Perceive, Excavate and Purify: A Novel Object Mining Framework for Instance Segmentation0
UDTIRI: An Online Open-Source Intelligent Road Inspection Benchmark Suite0
Self-Supervised Learning from Non-Object Centric Images with a Geometric Transformation Sensitive ArchitectureCode0
Show:102550
← PrevPage 16 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified