SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 901950 of 2262 papers

TitleStatusHype
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked ModelingCode3
TarViS: A Unified Approach for Target-based Video SegmentationCode1
The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed ManipulationCode1
InsPro: Propagating Instance Query and Proposal for Online Video Instance Segmentation0
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
Object Segmentation with Audio Context0
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance SegmentationCode1
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance SegmentationCode1
LoTE-Animal: A Long Time-span Dataset for Endangered Animal Behavior Understanding0
MUVA: A New Large-Scale Benchmark for Multi-View Amodal Instance Segmentation in the Shopping Scenario0
Re:PolyWorld - A Graph Neural Network for Polygonal Scene Parsing0
Class-incremental Continual Learning for Instance Segmentation with Image-level Weak SupervisionCode1
Exploring the Sim2Real Gap Using Digital Twins0
AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts0
3D Instance Segmentation via Enhanced Spatial and Semantic Supervision0
Learning Cross-Representation Affinity Consistency for Sparsely Supervised Biomedical Instance SegmentationCode1
Query Refinement Transformer for 3D Instance Segmentation0
TopoSeg: Topology-Aware Nuclear Instance SegmentationCode0
Semantic Information in Contrastive LearningCode0
WaterMask: Instance Segmentation for Underwater ImageryCode1
X3KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection0
1% VS 100%: Parameter-Efficient Low Rank Adapter for Dense Predictions0
Context-Aware Relative Object Queries To Unify Video Instance and Panoptic SegmentationCode1
Tree Instance Segmentation With Temporal Contour Graph0
Exemplar-FreeSOLO: Enhancing Unsupervised Instance Segmentation With Exemplars0
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode1
PartDistillation: Learning Parts From Instance SegmentationCode1
AttentionShift: Iteratively Estimated Part-Based Attention Map for Pointly Supervised Instance Segmentation0
Mask-Guided Matting in the Wild0
Camouflaged Instance Segmentation via Explicit De-Camouflaging0
Tracking Passengers and Baggage Items using Multiple Overhead Cameras at Security CheckpointsCode0
PanDepth: Joint Panoptic Segmentation and Depth CompletionCode1
CellTranspose: Few-shot Domain Adaptation for Cellular Instance Segmentation0
Brain Cancer Segmentation Using YOLOv5 Deep Neural Network0
PMODE: Prototypical Mask based Object Dimension Estimation0
A Close Look at Spatial Modeling: From Attention to ConvolutionCode1
Precise Location Matching Improves Dense Contrastive Learning in Digital PathologyCode0
SupeRGB-D: Zero-shot Instance Segmentation in Cluttered Indoor EnvironmentsCode1
Generalized Decoding for Pixel, Image, and LanguageCode3
Eff-3DPSeg: 3D organ-level plant shoot segmentation using annotation-efficient point clouds0
Which Pixel to Annotate: a Label-Efficient Nuclei Segmentation FrameworkCode1
Building Height Prediction with Instance Segmentation0
An annotated instance segmentation XXL-CT data-set from a historic airplane0
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration0
Unsupervised Object Localization: Observing the Background to Discover ObjectsCode1
EM-Paste: EM-guided Cut-Paste with DALL-E Augmentation for Image-level Weakly Supervised Instance SegmentationCode1
RTMDet: An Empirical Study of Designing Real-Time Object DetectorsCode4
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group PropagationCode1
CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion0
Look Before You Match: Instance Understanding Matters in Video Object Segmentation0
Show:102550
← PrevPage 19 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified