| Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation | Apr 4, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks | Apr 4, 2024 | Autonomous DrivingInstance Segmentation | —Unverified | 0 |
| HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature Fusion | Apr 4, 2024 | Scene ParsingSemantic Segmentation | CodeCode Available | 0 |
| OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views | Apr 4, 2024 | Image SegmentationNeRF | —Unverified | 0 |
| EndoViT: pretraining vision transformers on a large collection of endoscopic images | Apr 3, 2024 | Action Triplet RecognitionSegmentation | CodeCode Available | 1 |
| Enhancing crop segmentation in satellite image time-series with transformer networks | Apr 3, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 0 |
| Investigation of Energy-efficient AI Model Architectures and Compression Techniques for "Green" Fetal Brain Segmentation | Apr 3, 2024 | Brain SegmentationImage Segmentation | —Unverified | 0 |
| Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression | Apr 3, 2024 | Autonomous VehiclesData Compression | —Unverified | 0 |
| HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Apr 3, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Active learning for efficient annotation in precision agriculture: a use-case on crop-weed semantic segmentation | Apr 3, 2024 | Active LearningSemantic Segmentation | —Unverified | 0 |
| GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation | Apr 3, 2024 | GPUSemantic Segmentation | —Unverified | 0 |
| A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task | Apr 3, 2024 | Semantic Segmentation | —Unverified | 0 |
| Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation | Apr 3, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation | Apr 3, 2024 | AttributeSemantic Segmentation | CodeCode Available | 1 |
| RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation | Apr 3, 2024 | Long-range modelingMamba | —Unverified | 0 |
| RS-Mamba for Large Remote Sensing Image Dense Prediction | Apr 3, 2024 | Building change detection for remote sensing imagesChange Detection | CodeCode Available | 3 |
| Constrained Robotic Navigation on Preferred Terrains Using LLMs and Speech Instruction: Exploiting the Power of Adverbs | Apr 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss | Apr 2, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation | Apr 2, 2024 | Image SegmentationSegmentation | CodeCode Available | 0 |
| Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model | Apr 2, 2024 | DecoderMamba | CodeCode Available | 2 |
| Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation | Apr 2, 2024 | Semantic SegmentationSemi-Supervised Semantic Segmentation | —Unverified | 0 |
| Adaptive Feature Fusion Neural Network for Glaucoma Segmentation on Unseen Fundus Images | Apr 2, 2024 | DecoderDomain Generalization | —Unverified | 0 |
| Event-assisted Low-Light Video Object Segmentation | Apr 2, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Synthetic Data for Robust Stroke Segmentation | Apr 2, 2024 | Lesion SegmentationSegmentation | CodeCode Available | 1 |
| Improving Bird's Eye View Semantic Segmentation by Task Decomposition | Apr 2, 2024 | Autonomous DrivingBEV Segmentation | —Unverified | 0 |
| Red-Teaming Segment Anything Model | Apr 2, 2024 | Image Segmentationmodel | CodeCode Available | 0 |
| Segment Any 3D Object with Language | Apr 2, 2024 | 3D Instance SegmentationDecoder | —Unverified | 0 |
| Versatile Navigation under Partial Observability via Value-guided Diffusion Policy | Apr 1, 2024 | Autonomous DrivingSemantic Segmentation | —Unverified | 0 |
| Diffusion based Zero-shot Medical Image-to-Image Translation for Cross Modality Segmentation | Apr 1, 2024 | Image SegmentationImage-to-Image Translation | —Unverified | 0 |
| Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation | Apr 1, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Instance-Aware Group Quantization for Vision Transformers | Apr 1, 2024 | image-classificationImage Classification | —Unverified | 0 |
| SUGAR: Pre-training 3D Visual Representations for Robotics | Apr 1, 2024 | 3D Instance Segmentation3D Object Recognition | —Unverified | 0 |
| PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation | Apr 1, 2024 | DecoderKnowledge Distillation | CodeCode Available | 1 |
| Language Guided Domain Generalized Medical Image Segmentation | Apr 1, 2024 | Contrastive LearningDomain Adaptation | CodeCode Available | 1 |
| GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields | Apr 1, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Anthropic Prior Knowledge | Apr 1, 2024 | Image SegmentationInstance Segmentation | —Unverified | 0 |
| What is Point Supervision Worth in Video Instance Segmentation? | Apr 1, 2024 | Instance SegmentationObject | —Unverified | 0 |
| T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation | Apr 1, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation | Apr 1, 2024 | Image SegmentationImage to text | —Unverified | 0 |
| Medical Visual Prompting (MVP): A Unified Framework for Versatile and High-Quality Medical Image Segmentation | Apr 1, 2024 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| Training-Free Semantic Segmentation via LLM-Supervision | Mar 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts | Mar 31, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| MugenNet: A Novel Combined Convolution Neural Network and Transformer Network with its Application for Colonic Polyp Image Segmentation | Mar 31, 2024 | Computational EfficiencyImage Segmentation | —Unverified | 0 |
| LAESI: Leaf Area Estimation with Synthetic Imagery | Mar 31, 2024 | Semantic Segmentation | —Unverified | 0 |
| Deep Instruction Tuning for Segment Anything Model | Mar 31, 2024 | DecoderImage Segmentation | CodeCode Available | 1 |
| Attention-based Shape-Deformation Networks for Artifact-Free Geometry Reconstruction of Lumbar Spine from MR Images | Mar 30, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 0 |
| Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation | Mar 30, 2024 | AttributeOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias | Mar 30, 2024 | Multi-Label Text ClassificationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation | Mar 30, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation | Mar 29, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |