| SAM 2: Segment Anything in Images and Videos | Aug 1, 2024 | Image SegmentationRobot Manipulation Generalization | CodeCode Available | 11 | 5 |
| Grounded SAM: Assembling Open-World Models for Diverse Visual Tasks | Jan 25, 2024 | Segmentation | CodeCode Available | 9 | 5 |
| Efficient MedSAMs: Segment Anything in Medical Images on Laptop | Dec 20, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 7 | 5 |
| Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion | Feb 21, 2022 | BinarizationModel Optimization | CodeCode Available | 7 | 5 |
| Segment Anything in Medical Images and Videos: Benchmark and Deployment | Aug 6, 2024 | BenchmarkingSegmentation | CodeCode Available | 7 | 5 |
| Efficient Track Anything | Nov 28, 2024 | ObjectSegmentation | CodeCode Available | 7 | 5 |
| U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation | Nov 29, 2023 | Computational EfficiencyDecoder | CodeCode Available | 6 | 5 |
| FeatUp: A Model-Agnostic Framework for Features at Any Resolution | Mar 15, 2024 | Depth EstimationDepth Prediction | CodeCode Available | 5 | 5 |
| Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model | Jun 27, 2024 | MambaSegmentation | CodeCode Available | 5 | 5 |
| OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding | Jun 27, 2024 | DecoderSegmentation | CodeCode Available | 5 | 5 |