| PyPotteryLens: An Open-Source Deep Learning Framework for Automated Digitisation of Archaeological Pottery Documentation | Dec 16, 2024 | Instance SegmentationSemantic Segmentation | CodeCode Available | 0 |
| Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation | Dec 16, 2024 | Adversarial RobustnessMixture-of-Experts | CodeCode Available | 0 |
| Adapting Segment Anything Model (SAM) to Experimental Datasets via Fine-Tuning on GAN-based Simulation: A Case Study in Additive Manufacturing | Dec 16, 2024 | Computational EfficiencyGenerative Adversarial Network | CodeCode Available | 0 |
| SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation | Dec 16, 2024 | DecoderSemantic Segmentation | CodeCode Available | 3 |
| MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation | Dec 16, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation | Dec 16, 2024 | AnatomyComputed Tomography (CT) | —Unverified | 0 |
| Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation | Dec 16, 2024 | DiversitySemantic Segmentation | CodeCode Available | 1 |
| SAMIC: Segment Anything with In-Context Spatial Prompt Engineering | Dec 16, 2024 | Few-Shot LearningPrompt Engineering | —Unverified | 0 |
| Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment | Dec 15, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 0 |
| MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation | Dec 15, 2024 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 1 |
| Classification Drives Geographic Bias in Street Scene Segmentation | Dec 15, 2024 | ClassificationDiversity | —Unverified | 0 |
| SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation | Dec 15, 2024 | DecoderFew-shot Instance Segmentation | —Unverified | 0 |
| OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving | Dec 14, 2024 | Autonomous DrivingSemantic Segmentation | —Unverified | 0 |
| Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation | Dec 14, 2024 | Anomaly Segmentationregression | CodeCode Available | 0 |
| MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance | Dec 14, 2024 | DecoderDepth Estimation | —Unverified | 0 |
| CATALOG: A Camera Trap Language-guided Contrastive Learning Model | Dec 14, 2024 | Contrastive Learningimage-classification | CodeCode Available | 0 |
| DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting | Dec 14, 2024 | 3D ReconstructionSegmentation | CodeCode Available | 1 |
| RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone | Dec 14, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| SegACIL: Solving the Stability-Plasticity Dilemma in Class-Incremental Semantic Segmentation | Dec 14, 2024 | Class-Incremental Semantic SegmentationContinual Learning | CodeCode Available | 0 |
| Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer | Dec 13, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Object-Focused Data Selection for Dense Prediction Tasks | Dec 13, 2024 | Objectobject-detection | —Unverified | 0 |
| A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation | Dec 13, 2024 | Domain AdaptationSemantic Segmentation | —Unverified | 0 |
| SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians | Dec 13, 2024 | GPUObject Localization | —Unverified | 0 |
| SPT: Sequence Prompt Transformer for Interactive Image Segmentation | Dec 13, 2024 | Image SegmentationInteractive Segmentation | —Unverified | 0 |
| DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations | Dec 12, 2024 | image-classificationImage Classification | —Unverified | 0 |
| ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation | Dec 12, 2024 | Phrase GroundingQuestion Answering | —Unverified | 0 |
| On the effectiveness of Rotation-Equivariance in U-Net: A Benchmark for Image Segmentation | Dec 12, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| STEAM: Squeeze and Transform Enhanced Attention Module | Dec 12, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Towards Open-Vocabulary Video Semantic Segmentation | Dec 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation | Dec 12, 2024 | Cross-Domain Few-ShotDomain Generalization | CodeCode Available | 2 |
| Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis | Dec 12, 2024 | AllClassification | —Unverified | 0 |
| VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation | Dec 12, 2024 | Domain AdaptationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| MaskTerial: A Foundation Model for Automated 2D Material Flake Detection | Dec 12, 2024 | Instance SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Automatic Image Annotation for Mapped Features Detection | Dec 11, 2024 | Autonomous DrivingImage Segmentation | —Unverified | 0 |
| A feature refinement module for light-weight semantic segmentation network | Dec 11, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| SegFace: Face Segmentation of Long-Tail Classes | Dec 11, 2024 | Face ParsingFace Swapping | CodeCode Available | 2 |
| Unified HT-CNNs Architecture: Transfer Learning for Segmenting Diverse Brain Tumors in MRI from Gliomas to Pediatric Tumors | Dec 11, 2024 | Brain Tumor SegmentationImage Segmentation | —Unverified | 0 |
| ConDSeg: A General Medical Image Segmentation Framework via Contrast-Driven Feature Enhancement | Dec 11, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Utilizing Multi-step Loss for Single Image Reflection Removal | Dec 11, 2024 | Depth EstimationImage Segmentation | CodeCode Available | 0 |
| Post-Hoc MOTS: Exploring the Capabilities of Time-Symmetric Multi-Object Tracking | Dec 11, 2024 | Multi-Object TrackingObject Tracking | —Unverified | 0 |
| Structured IB: Improving Information Bottleneck with Structured Feature Learning | Dec 11, 2024 | Image SegmentationSemantic Communication | —Unverified | 0 |
| Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin | Dec 11, 2024 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Dec 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Dec 11, 2024 | DecoderGPU | CodeCode Available | 1 |
| Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Dec 11, 2024 | Autonomous DrivingContrastive Learning | —Unverified | 0 |
| Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction | Dec 11, 2024 | 3D Semantic Occupancy PredictionLIDAR Semantic Segmentation | —Unverified | 0 |
| A Deep Semantic Segmentation Network with Semantic and Contextual Refinements | Dec 11, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Annotation-Efficient Task Guidance for Medical Segment Anything | Dec 11, 2024 | Computed Tomography (CT)Image Segmentation | CodeCode Available | 0 |
| Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion | Dec 11, 2024 | GPUImage Segmentation | CodeCode Available | 0 |
| Stable Mean Teacher for Semi-supervised Video Action Detection | Dec 10, 2024 | Action DetectionSemantic Segmentation | CodeCode Available | 0 |