SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 9511000 of 2262 papers

TitleStatusHype
Deep learning approaches to building rooftop thermal bridge detection from aerial imagesCode1
Test-time Adaptation vs. Training-time Generalization: A Case Study in Human Instance Segmentation using Keypoints Estimation0
Robust Perception through EquivarianceCode0
Benchmarking Self-Supervised Learning on Diverse Pathology DatasetsCode1
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusionCode1
AsyInst: Asymmetric Affinity with DepthGrad and Color for Box-Supervised Instance Segmentation0
LWSIS: LiDAR-guided Weakly Supervised Instance Segmentation for Autonomous DrivingCode1
MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality MicroscopyCode1
Framework-agnostic Semantically-aware Global Reasoning for Segmentation0
Iterative Next Boundary Detection for Instance Segmentation of Tree Rings in Microscopy Images of Shrub Cross SectionsCode1
DiffusionInst: Diffusion Model for Instance SegmentationCode2
Med-Query: Steerable Parsing of 9-DoF Medical Anatomies with Query EmbeddingCode1
Box2Mask: Box-supervised Instance Segmentation via Level-set EvolutionCode2
3D Segmentation of Humans in Point Clouds with Synthetic Data0
Uncertainty-Aware Contour Proposal Networks for Cell Segmentation in Multi-Modality High-Resolution Microscopy ImagesCode1
AIO-P: Expanding Neural Performance Predictors Beyond Image ClassificationCode1
Growing Instance Mask on Leaf0
PLA: Language-Driven Open-Vocabulary 3D Scene UnderstandingCode2
Superpoint Transformer for 3D Scene Instance SegmentationCode1
FsaNet: Frequency Self-attention for Semantic SegmentationCode1
From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments0
EasyMLServe: Easy Deployment of REST Machine Learning ServicesCode0
Automating Cobb Angle Measurement for Adolescent Idiopathic Scoliosis using Instance Segmentation0
Language-Assisted 3D Feature Learning for Semantic Scene UnderstandingCode1
A Benchmark of Long-tailed Instance Segmentation with Noisy LabelsCode0
How do Cross-View and Cross-Modal Alignment Affect Representations in Contrastive Learning?0
EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational DataCode0
DETRs with Collaborative Hybrid Assignments TrainingCode3
Rethinking Implicit Neural Representations for Vision Learners0
Task-Specific Data Augmentation and Inference Processing for VIPriors Instance Segmentation Challenge0
Mean Shift Mask Transformer for Unseen Object Instance SegmentationCode1
PIDray: A Large-scale X-ray Benchmark for Real-World Prohibited Item DetectionCode1
SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification0
The Runner-up Solution for YouTube-VIS Long Video Challenge 20220
TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation0
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation0
A Generalized Framework for Video Instance SegmentationCode1
Robust Online Video Instance Segmentation with Track QueriesCode0
Label-Efficient Object Detection via Region Proposal Network Pre-Training0
Forecasting Future Instance Segmentation with Learned Optical Flow and Warping0
PAI3D: Painting Adaptive Instance-Prior for 3D Object Detection0
Deep Instance Segmentation and Visual Servoing to Play Jenga with a Cost-Effective Robotic System0
Recursive Cross-View: Use Only 2D Detectors to Achieve 3D Object Detection without 3D Annotations0
MR-NOM: Multi-scale Resolution of Neuronal cells in Nissl-stained histological slices via deliberate Over-segmentation and Merging0
EVA: Exploring the Limits of Masked Visual Representation Learning at ScaleCode0
OneFormer: One Transformer to Rule Universal Image SegmentationCode3
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable ConvolutionsCode4
Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding0
MogaNet: Multi-order Gated Aggregation NetworkCode2
BriFiSeg: a deep learning-based method for semantic and instance segmentation of nuclei in brightfield imagesCode0
Show:102550
← PrevPage 20 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified