| ShareCMP: Polarization-Aware RGB-P Semantic Segmentation | Dec 6, 2023 | Semantic Segmentation | CodeCode Available | 1 |
| AI-SAM: Automatic and Interactive Segment Anything Model | Dec 5, 2023 | Medical Image SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Uni3DL: Unified Model for 3D and Language Understanding | Dec 5, 2023 | Cross-Modal RetrievalInstance Segmentation | —Unverified | 0 |
| PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation | Dec 5, 2023 | 3D Instance Segmentation3D Part Segmentation | CodeCode Available | 1 |
| DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control | Dec 5, 2023 | Autonomous DrivingDomain Generalization | —Unverified | 0 |
| Towards More Unified In-context Visual Understanding | Dec 5, 2023 | DecoderImage Captioning | —Unverified | 0 |
| Graph Information Bottleneck for Remote Sensing Segmentation | Dec 5, 2023 | Change DetectionContrastive Learning | —Unverified | 0 |
| Towards Granularity-adjusted Pixel-level Semantic Annotation | Dec 5, 2023 | Semantic Segmentation | —Unverified | 0 |
| Panoptica -- instance-wise evaluation of 3D semantic and instance segmentation maps | Dec 5, 2023 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints | Dec 5, 2023 | Model OptimizationNovel Concepts | CodeCode Available | 2 |
| Breast Cancer Detection Using Deep Learning Technique Based On Ultrasound Image | Dec 4, 2023 | Breast Cancer DetectionDeep Learning | —Unverified | 0 |
| Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding | Dec 4, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation | Dec 4, 2023 | BenchmarkingContrastive Learning | —Unverified | 0 |
| SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference | Dec 4, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Class-Discriminative Attention Maps for Vision Transformers | Dec 4, 2023 | Computed Tomography (CT)Feature Importance | —Unverified | 0 |
| Learning Efficient Unsupervised Satellite Image-based Building Damage Detection | Dec 4, 2023 | Building Damage AssessmentDamaged Building Detection | CodeCode Available | 1 |
| MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation | Dec 4, 2023 | Image SegmentationInductive Bias | CodeCode Available | 1 |
| SRSNetwork: Siamese Reconstruction-Segmentation Networks based on Dynamic-Parameter Convolution | Dec 4, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| Strong but simple: A Baseline for Domain Generalized Dense Perception by CLIP-based Transfer Learning | Dec 4, 2023 | Domain Generalizationobject-detection | CodeCode Available | 1 |
| Instance-guided Cartoon Editing with a Large-scale Dataset | Dec 4, 2023 | Image SegmentationSegmentation | —Unverified | 0 |
| Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation | Dec 4, 2023 | DiversityDomain Adaptation | CodeCode Available | 1 |
| Few Clicks Suffice: Active Test-Time Adaptation for Semantic Segmentation | Dec 4, 2023 | Active LearningSemantic Segmentation | —Unverified | 0 |
| ResEnsemble-DDPM: Residual Denoising Diffusion Probabilistic Models for Ensemble Learning | Dec 4, 2023 | DenoisingEnsemble Learning | —Unverified | 0 |
| Hulk: A Universal Knowledge Translator for Human-Centric Tasks | Dec 4, 2023 | 3D Human Pose EstimationAction Recognition | CodeCode Available | 2 |
| SANeRF-HQ: Segment Anything for NeRF in High Quality | Dec 3, 2023 | NeRFNovel View Synthesis | —Unverified | 0 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT | Dec 3, 2023 | Caption GenerationDecoder | CodeCode Available | 0 |
| G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training | Dec 3, 2023 | object-detectionObject Detection | CodeCode Available | 0 |
| A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors | Dec 3, 2023 | Active LearningInstance Segmentation | —Unverified | 0 |
| A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing | Dec 3, 2023 | Autonomous NavigationData Augmentation | —Unverified | 0 |
| T3D: Advancing 3D Medical Vision-Language Pre-training by Learning Multi-View Visual Consistency | Dec 3, 2023 | Clinical KnowledgeContrastive Learning | —Unverified | 0 |
| TranSegPGD: Improving Transferability of Adversarial Examples on Semantic Segmentation | Dec 3, 2023 | Adversarial Attackimage-classification | —Unverified | 0 |
| Semantic segmentation of SEM images of lower bainitic and tempered martensitic steels | Dec 2, 2023 | Deep LearningSemantic Segmentation | —Unverified | 0 |
| Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited Labels | Dec 2, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything | Dec 1, 2023 | Decoderimage-classification | CodeCode Available | 4 |
| Improve Supervised Representation Learning with Masked Image Modeling | Dec 1, 2023 | DecoderImage Retrieval | —Unverified | 0 |
| Towards Generalizable Referring Image Segmentation via Target Prompt and Visual Coherence | Dec 1, 2023 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers | Dec 1, 2023 | DecoderObject | CodeCode Available | 1 |
| Grounding Everything: Emerging Localization Properties in Vision-Language Transformers | Dec 1, 2023 | Image RetrievalObject Localization | CodeCode Available | 1 |
| A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing | Dec 1, 2023 | AllSemantic Segmentation | —Unverified | 0 |
| Improving Normalization with the James-Stein Estimator | Dec 1, 2023 | 3D Object Classificationimage-classification | —Unverified | 0 |
| A Recent Survey of Vision Transformers for Medical Image Segmentation | Dec 1, 2023 | Image SegmentationInductive Bias | —Unverified | 0 |
| Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning | Dec 1, 2023 | Decoderobject-detection | CodeCode Available | 1 |
| SCHEME: Scalable Channel Mixer for Vision Transformers | Dec 1, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Generative Parameter-Efficient Fine-Tuning | Dec 1, 2023 | Arithmetic ReasoningFine-Grained Image Classification | CodeCode Available | 1 |
| CellMixer: Annotation-free Semantic Cell Segmentation of Heterogeneous Cell Populations | Dec 1, 2023 | Cell SegmentationInstance Segmentation | —Unverified | 0 |
| Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment | Dec 1, 2023 | Contrastive LearningFew-Shot Learning | CodeCode Available | 3 |
| Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals | Dec 1, 2023 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| Learning Part Segmentation from Synthetic Animals | Nov 30, 2023 | Domain AdaptationPseudo Label | —Unverified | 0 |
| InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation | Nov 30, 2023 | Image CaptioningReferring Expression | CodeCode Available | 0 |
| SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation | Nov 30, 2023 | Objectobject-detection | —Unverified | 0 |