| Stochastic positional embeddings improve masked image modeling | Jul 31, 2023 | Language ModellingMasked Language Modeling | CodeCode Available | 1 |
| XMem++: Production-level Video Segmentation From Few Annotated Frames | Jul 29, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation | Jul 15, 2023 | DecoderSegmentation | CodeCode Available | 0 |
| Rectifying Noisy Labels with Sequential Prior: Multi-Scale Temporal Feature Affinity Learning for Robust Video Segmentation | Jul 12, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| Segment Anything Meets Point Tracking | Jul 3, 2023 | Interactive Video Object SegmentationObject | CodeCode Available | 3 |
| A Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt Engineering | May 12, 2023 | Edge Detectionmodel | —Unverified | 0 |
| Automatic Interaction and Activity Recognition from Videos of Human Manual Demonstrations with Application to Anomaly Detection | Apr 19, 2023 | Activity RecognitionAnomaly Detection | —Unverified | 0 |
| MED-VT++: Unifying Multimodal Learning with a Multiscale Encoder-Decoder Video Transformer | Apr 12, 2023 | Action SegmentationDecoder | —Unverified | 0 |
| Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation | Mar 22, 2023 | Contrastive LearningSegmentation | CodeCode Available | 1 |
| Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation | Mar 17, 2023 | SegmentationSelf-Supervised Learning | CodeCode Available | 0 |
| Global Knowledge Calibration for Fast Open-Vocabulary Segmentation | Mar 16, 2023 | Knowledge DistillationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| InstMove: Instance Motion for Object-centric Video Segmentation | Mar 14, 2023 | ObjectOptical Flow Estimation | CodeCode Available | 2 |
| A Threefold Review on Deep Semantic Segmentation: Efficiency-oriented, Temporal and Depth-aware design | Mar 8, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Learning to Adapt to Online Streams with Distribution Shifts | Mar 2, 2023 | BenchmarkingMeta-Learning | —Unverified | 0 |
| Video-SwinUNet: Spatio-temporal Deep Learning Framework for VFSS Instance Segmentation | Feb 22, 2023 | DecoderImage Segmentation | CodeCode Available | 1 |
| PolyFormer: Referring Image Segmentation as Sequential Polygon Generation | Feb 14, 2023 | DecoderImage Segmentation | CodeCode Available | 1 |
| Approximating DTW with a convolutional neural network on EEG data | Jan 30, 2023 | Anomaly DetectionComputational Efficiency | —Unverified | 0 |
| A Comprehensive Review of Modern Object Segmentation Approaches | Jan 13, 2023 | Image SegmentationObject | —Unverified | 0 |
| TarViS: A Unified Approach for Target-based Video Segmentation | Jan 6, 2023 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| Object Segmentation with Audio Context | Jan 4, 2023 | audio-visual learningDecoder | —Unverified | 0 |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Jan 1, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation | Jan 1, 2023 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| Video Segmentation Learning Using Cascade Residual Convolutional Neural Network | Dec 20, 2022 | Action RecognitionAnomaly Detection | —Unverified | 0 |
| Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation | Dec 9, 2022 | Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION | —Unverified | 0 |
| Robust Online Video Instance Segmentation with Track Queries | Nov 16, 2022 | Image SegmentationInstance Segmentation | CodeCode Available | 0 |
| Generalized Product-of-Experts for Learning Multimodal Representations in Noisy Environments | Nov 7, 2022 | 3D Hand Pose EstimationHand Pose Estimation | —Unverified | 0 |
| EISeg: An Efficient Interactive Segmentation Tool based on PaddlePaddle | Oct 17, 2022 | Image SegmentationInteractive Segmentation | CodeCode Available | 0 |
| Motion-inductive Self-supervised Object Discovery in Videos | Oct 1, 2022 | ObjectObject Discovery | —Unverified | 0 |
| EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Sep 26, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward | Sep 25, 2022 | DecoderVideo Editing | CodeCode Available | 1 |
| Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation | Aug 24, 2022 | Hierarchical Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Heterogeneous Video Segmentation at the Edge | Aug 24, 2022 | CPUGPU | —Unverified | 0 |
| Visual Subtitle Feature Enhanced Video Outline Generation | Aug 24, 2022 | ArticlesHeadline Generation | —Unverified | 0 |
| Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations | Jul 18, 2022 | object-detectionObject Detection | CodeCode Available | 1 |
| Personalized PCA: Decoupling Shared and Unique Features | Jul 17, 2022 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| MAC-DO: An Efficient Output-Stationary GEMM Accelerator for CNNs Using DRAM Technology | Jul 16, 2022 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Domain Adaptive Video Segmentation via Temporal Pseudo Supervision | Jul 6, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Segmenting Moving Objects via an Object-Centric Layered Representation | Jul 5, 2022 | Instance SegmentationMotion Segmentation | CodeCode Available | 1 |
| Towards Robust Video Object Segmentation with Adaptive Object Calibration | Jul 2, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| 5th Place Solution for YouTube-VOS Challenge 2022: Video Object Segmentation | Jun 20, 2022 | ObjectSegmentation | —Unverified | 0 |
| Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation | Jun 20, 2022 | Autonomous DrivingNetwork Pruning | —Unverified | 0 |
| An Image Processing Pipeline for Camera Trap Time-Lapse Recordings | Jun 10, 2022 | BIG-bench Machine LearningVideo Segmentation | CodeCode Available | 0 |
| A Machine Learning-based Segmentation Approach for Measuring Similarity between Sign Languages | Jun 1, 2022 | Machine TranslationVideo Segmentation | —Unverified | 0 |
| Differentiable Soft-Masked Attention | Jun 1, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| TubeFormer-DeepLab: Video Mask Transformer | May 30, 2022 | Panoptic SegmentationSegmentation | —Unverified | 0 |
| Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion | May 16, 2022 | Image SegmentationOptical Flow Estimation | —Unverified | 0 |
| 3D Convolutional Networks for Action Recognition: Application to Sport Gesture Recognition | Apr 13, 2022 | Action RecognitionClassification | —Unverified | 0 |
| Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation | Apr 10, 2022 | Image SegmentationInstance Segmentation | CodeCode Available | 1 |
| Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation | Apr 6, 2022 | Optical Flow EstimationReferring Expression Segmentation | CodeCode Available | 1 |
| Human Instance Segmentation and Tracking via Data Association and Single-stage Detector | Mar 31, 2022 | Human Instance SegmentationInstance Segmentation | —Unverified | 0 |