| Stochastic positional embeddings improve masked image modeling | Jul 31, 2023 | Language ModellingMasked Language Modeling | CodeCode Available | 1 |
| XMem++: Production-level Video Segmentation From Few Annotated Frames | Jul 29, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation | Jul 15, 2023 | DecoderSegmentation | CodeCode Available | 0 |
| Rectifying Noisy Labels with Sequential Prior: Multi-Scale Temporal Feature Affinity Learning for Robust Video Segmentation | Jul 12, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| Segment Anything Meets Point Tracking | Jul 3, 2023 | Interactive Video Object SegmentationObject | CodeCode Available | 3 |
| A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering | May 12, 2023 | Edge Detectionmodel | —Unverified | 0 |
| Automatic Interaction and Activity Recognition from Videos of Human Manual Demonstrations with Application to Anomaly Detection | Apr 19, 2023 | Activity RecognitionAnomaly Detection | —Unverified | 0 |
| MED-VT++: Unifying Multimodal Learning with a Multiscale Encoder-Decoder Video Transformer | Apr 12, 2023 | Action SegmentationDecoder | —Unverified | 0 |
| Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation | Mar 22, 2023 | Contrastive LearningSegmentation | CodeCode Available | 1 |
| Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation | Mar 17, 2023 | SegmentationSelf-Supervised Learning | CodeCode Available | 0 |