SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 21012150 of 2262 papers

TitleStatusHype
Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning0
Lebanon Solar Rooftop Potential Assessment using Buildings Segmentation from Aerial Images0
SOLO: A Simple Framework for Instance Segmentation0
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration0
SOS: Segment Object System for Open-World Instance Segmentation With Object Priors0
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning0
Spatial Attention Pyramid Network for Unsupervised Domain Adaptation0
Spatial Sampling Network for Fast Scene Understanding0
Spatio-temporal Human Action Localisation and Instance Segmentation in Temporally Untrimmed Videos0
SpectFormer: Frequency and Attention is what you need in a Vision Transformer0
Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds0
SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds0
StandardSim: A Synthetic Dataset For Retail Environments0
Stateless actor-critic for instance segmentation with high-level priors0
Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation0
STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation0
STEAM: Squeeze and Transform Enhanced Attention Module0
Streaming Object Detection on Fisheye Cameras for Automatic Parking0
Structured Model Pruning for Efficient Inference in Computational Pathology0
Structure-Preserving Instance Segmentation via Skeleton-Aware Distance Transform0
SUDS: Scalable Urban Dynamic Scenes0
SUGAR: Pre-training 3D Visual Representations for Robotics0
SUNet: Scale-aware Unified Network for Panoptic Segmentation0
supervised adptive threshold network for instance segmentation0
SVIRO: Synthetic Vehicle Interior Rear Seat Occupancy Dataset and Benchmark0
Symmetric masking strategy enhances the performance of Masked Image Modeling0
Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities0
UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation0
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency0
Tag-Based Attention Guided Bottom-Up Approach for Video Instance Segmentation0
TA-Net: Topology-Aware Network for Gland Segmentation0
Task-Specific Data Augmentation and Inference Processing for VIPriors Instance Segmentation Challenge0
Team PFDet's Methods for Open Images Challenge 20190
Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Anthropic Prior Knowledge0
Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Multi-Scale Aggregation and Anthropic Prior Knowledge0
Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs0
Temporal Prediction and Evaluation of Brassica Growth in the Field using Conditional Generative Adversarial Networks0
Temporal RoI Align for Video Object Recognition0
TESA: Tensor Element Self-Attention via Matricization0
Test-time Adaptation vs. Training-time Generalization: A Case Study in Human Instance Segmentation using Keypoints Estimation0
TextMountain: Accurate Scene Text Detection via Instance Segmentation0
The Best of Both Modes: Separately Leveraging RGB and Depth for Unseen Object Instance Segmentation0
The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation0
Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings0
The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes0
The MIS Check-Dam Dataset for Object Detection and Instance Segmentation Tasks0
The ParallelEye Dataset: Constructing Large-Scale Artificial Scenes for Traffic Vision Research0
The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning0
The Role of Regularization in Shaping Weight and Node Pruning Dependency and Dynamics0
The Runner-up Solution for YouTube-VIS Long Video Challenge 20220
Show:102550
← PrevPage 43 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified