| LISA: Reasoning Segmentation via Large Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| Semantic-SAM: Segment and Recognize Anything at Any Granularity | Jul 10, 2023 | Image SegmentationSegmentation | CodeCode Available | 4 | 5 |
| Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation | Mar 29, 2022 | Binary ClassificationSegmentation | CodeCode Available | 4 | 5 |
| Highly Accurate Dichotomous Image Segmentation | Mar 6, 2022 | 2k3D Reconstruction | CodeCode Available | 4 | 5 |
| Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering | Jan 12, 2024 | 3D Panoptic Segmentation3D Semantic Segmentation | CodeCode Available | 4 | 5 |
| LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model | Dec 28, 2023 | Instance SegmentationLanguage Modeling | CodeCode Available | 4 | 5 |
| SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree | Oct 21, 2024 | Heuristic SearchObject | CodeCode Available | 4 | 5 |
| SAMPart3D: Segment Any Part in 3D Objects | Nov 11, 2024 | 3D Generation3D Part Segmentation | CodeCode Available | 4 | 5 |
| SegGPT: Segmenting Everything In Context | Apr 6, 2023 | Few-Shot Semantic SegmentationIn-Context Learning | CodeCode Available | 4 | 5 |
| PVUW 2024 Challenge on Complex Video Understanding: Methods and Results | Jun 24, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 4 | 5 |