SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 13011350 of 2262 papers

TitleStatusHype
VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance SegmentationCode1
Geometry-Aware Fruit Grasping Estimation for Robotic Harvesting in Orchards0
Deep Level Set for Box-supervised Instance Segmentation in Aerial ImagesCode0
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding0
DIY Graphics Tab: A Cost-Effective Alternative to Graphics Tablet for Educators0
Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation0
Learning to Detect Every Thing in an Open World0
Artificial Intelligence-driven Image Analysis of Bacterial Cells and BiofilmsCode0
Masked-attention Mask Transformer for Universal Image SegmentationCode2
MViTv2: Improved Multiscale Vision Transformers for Classification and DetectionCode1
DenseCLIP: Language-Guided Dense Prediction with Context-Aware PromptingCode1
The Second Place Solution for ICCV2021 VIPriors Instance Segmentation Challenge0
SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency0
Dimensions of Motion: Monocular Prediction through Flow Subspaces0
Putting 3D Spatially Sparse Networks on a Diet0
DeepSportLab: a Unified Framework for Ball Detection, Player Instance Segmentation and Pose Estimation in Team Sports ScenesCode0
Container: Context Aggregation NetworksCode1
The MIS Check-Dam Dataset for Object Detection and Instance Segmentation Tasks0
PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite ImagesCode1
Point Cloud Instance Segmentation with Semi-supervised Bounding-Box MiningCode0
End-to-End Referring Video Object Segmentation with Multimodal TransformersCode1
Mask Transfiner for High-Quality Instance SegmentationCode1
Contrastive Object-level Pre-training with Spatial Noise Curriculum LearningCode1
Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware AssociationCode0
BoxeR: Box-Attention for 2D and 3D TransformersCode1
Conditional Object-Centric Learning from VideoCode1
Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-LabelingCode1
Bounding Box-Free Instance Segmentation Using Semi-Supervised Learning for Generating a City-Scale Vehicle Dataset0
Paris-CARLA-3D: A Real and Synthetic Outdoor Point Cloud Dataset for Challenging Tasks in 3D Mapping0
Depth-aware Object Segmentation and Grasp Detection for Robotic Picking TasksCode1
Lebanon Solar Rooftop Potential Assessment using Buildings Segmentation from Aerial Images0
Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing ImagesCode1
HoughCL: Finding Better Positive Pairs in Dense Self-supervised Learning0
Panoptic Segmentation: A ReviewCode1
Swin Transformer V2: Scaling Up Capacity and ResolutionCode1
TransMix: Attend to Mix for Vision TransformersCode1
Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion0
Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance SegmentationCode0
iBOT: Image BERT Pre-Training with Online TokenizerCode1
Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge0
Deep-learning in the bioimaging wild: Handling ambiguous data with deepflash2Code1
Evaluation of Deep Learning Topcoders Method for Neuron Individualization in Histological Macaque Brain Section0
Unsupervised Spiking Instance Segmentation on Event Data using STDP0
Real-time Instance Segmentation of Surgical Instruments using Attention and Multi-scale Feature Fusion0
Segmentation of Multiple Myeloma Plasma Cells in Microscopy Images with Noisy Labels0
ROFT: Real-Time Optical Flow-Aided 6D Object Pose and Velocity TrackingCode1
LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation0
Towards Panoptic 3D Parsing for Single Image in the Wild0
Panoptic 3D Scene Reconstruction From a Single RGB ImageCode1
CPSeg: Cluster-free Panoptic Segmentation of 3D LiDAR Point Clouds0
Show:102550
← PrevPage 27 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified