| DINOv2 based Self Supervised Learning For Few Shot Medical Image Segmentation | Mar 5, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 1 |
| Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning | Mar 5, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| End-to-End Human Instance Matting | Mar 3, 2024 | Image MattingInstance Segmentation | CodeCode Available | 1 |
| AIO2: Online Correction of Object Labels for Deep Learning with Incomplete Annotation in Remote Sensing Image Segmentation | Mar 3, 2024 | Earth ObservationImage Segmentation | CodeCode Available | 1 |
| Benchmarking Segmentation Models with Mask-Preserved Attribute Editing | Mar 2, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| VideoMAC: Video Masked Autoencoders Meet ConvNets | Feb 29, 2024 | Pose TrackingRepresentation Learning | CodeCode Available | 1 |
| FedLPPA: Learning Personalized Prompt and Aggregation for Federated Weakly-supervised Medical Image Segmentation | Feb 27, 2024 | DecoderFederated Learning | CodeCode Available | 1 |
| Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label | Feb 27, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation | Feb 27, 2024 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 1 |
| BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM | Feb 26, 2024 | Image SegmentationSegmentation | CodeCode Available | 1 |
| Placing Objects in Context via Inpainting for Out-of-distribution Segmentation | Feb 26, 2024 | Anomaly SegmentationSegmentation | CodeCode Available | 1 |
| LLMBind: A Unified Modality-Task Integration Framework | Feb 22, 2024 | AI AgentAudio Generation | CodeCode Available | 1 |
| DeiSAM: Segment Anything with Deictic Prompting | Feb 21, 2024 | Image SegmentationSegmentation | CodeCode Available | 1 |
| BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery | Feb 21, 2024 | Body DetectionCloud Detection | CodeCode Available | 1 |
| Object-level Geometric Structure Preserving for Natural Image Stitching | Feb 20, 2024 | Image StitchingObject | CodeCode Available | 1 |
| LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks | Feb 19, 2024 | Explainable artificial intelligenceExplainable Artificial Intelligence (XAI) | CodeCode Available | 1 |
| Perceiving Longer Sequences With Bi-Directional Cross-Attention Transformers | Feb 19, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation Strategies | Feb 17, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual Connections | Feb 17, 2024 | Diversityimage-classification | CodeCode Available | 1 |
| ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation Models | Feb 17, 2024 | Earth ObservationImage Captioning | CodeCode Available | 1 |
| LSRFormer: Efficient Transformer Supply Convolutional Neural Networks with Global Information for Aerial Image Segmentation | Feb 16, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Lester: rotoscope animation through video object segmentation and tracking | Feb 15, 2024 | 3D Human Pose EstimationObject | CodeCode Available | 1 |
| MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations | Feb 15, 2024 | Contrastive LearningImage Clustering | CodeCode Available | 1 |
| MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding | Feb 15, 2024 | 3D Part Segmentation3D Semantic Segmentation | CodeCode Available | 1 |
| TDViT: Temporal Dilated Video Transformer for Dense Video Tasks | Feb 14, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |