| Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation | Jun 17, 2024 | DecoderSegmentation | CodeCode Available | 2 |
| Discriminative Hamiltonian Variational Autoencoder for Accurate Tumor Segmentation in Data-Scarce Regimes | Jun 17, 2024 | Data AugmentationImage Generation | —Unverified | 0 |
| Multimodal Learning With Intraoperative CBCT & Variably Aligned Preoperative CT Data To Improve Segmentation | Jun 17, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| OoDIS: Anomaly Instance Segmentation Benchmark | Jun 17, 2024 | Anomaly Instance SegmentationAnomaly Segmentation | CodeCode Available | 1 |
| Visually Consistent Hierarchical Image Classification | Jun 17, 2024 | Classificationimage-classification | —Unverified | 0 |
| Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding | Jun 17, 2024 | 3D Object Detection3D Semantic Segmentation | —Unverified | 0 |
| SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation | Jun 17, 2024 | Semantic Segmentation | CodeCode Available | 0 |
| Boosting Medical Image Classification with Segmentation Foundation Model | Jun 16, 2024 | Classificationimage-classification | —Unverified | 0 |
| Benchmarking Label Noise in Instance Segmentation: Spatial Noise Matters | Jun 16, 2024 | BenchmarkingInstance Segmentation | CodeCode Available | 0 |
| ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model | Jun 16, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery | Jun 16, 2024 | DecoderEarth Observation | CodeCode Available | 5 |
| α-OCC: Uncertainty-Aware Camera-based 3D Semantic Occupancy Prediction | Jun 16, 2024 | 3D Semantic Occupancy Prediction3D Semantic Scene Completion | —Unverified | 0 |
| Microscopy Image Dataset for Deep Learning-Based Quantitative Assessment of Pulmonary Vascular Changes | Jun 15, 2024 | PrognosisSemantic Segmentation | —Unverified | 0 |
| Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation | Jun 15, 2024 | Image SegmentationMedical Image Analysis | —Unverified | 0 |
| GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR | Jun 15, 2024 | Autonomous DrivingDepth Estimation | —Unverified | 0 |
| A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection | Jun 15, 2024 | Change DetectionEarth Observation | CodeCode Available | 0 |
| MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception | Jun 15, 2024 | Autonomous Drivingenergy management | CodeCode Available | 0 |
| The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences | Jun 14, 2024 | Depth EstimationImage Segmentation | —Unverified | 0 |
| Enhancing Incomplete Multi-modal Brain Tumor Segmentation with Intra-modal Asymmetry and Inter-modal Dependency | Jun 14, 2024 | AnatomyBrain Tumor Segmentation | CodeCode Available | 0 |
| Open-Vocabulary Semantic Segmentation with Image Embedding Balancing | Jun 14, 2024 | DecoderOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers | Jun 14, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video | Jun 14, 2024 | Depth EstimationDynamic Reconstruction | —Unverified | 0 |
| Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation | Jun 14, 2024 | Domain AdaptationOut-of-Distribution Generalization | CodeCode Available | 1 |
| Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions | Jun 14, 2024 | Few-Shot Semantic SegmentationSemantic Segmentation | CodeCode Available | 1 |
| RobustSAM: Segment Anything Robustly on Degraded Images | Jun 13, 2024 | DeblurringImage Dehazing | CodeCode Available | 3 |
| DenoiseRep: Denoising Model for Representation Learning | Jun 13, 2024 | DenoisingFine-Grained Image Classification | CodeCode Available | 1 |
| 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities | Jun 13, 2024 | Instance Segmentationmultimodal generation | CodeCode Available | 5 |
| Instance-level quantitative saliency in multiple sclerosis lesion segmentation | Jun 13, 2024 | Date UnderstandingLesion Segmentation | CodeCode Available | 0 |
| A Labeled Array Distance Metric for Measuring Image Segmentation Quality | Jun 12, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model | Jun 12, 2024 | Change DetectionImage Segmentation | —Unverified | 0 |
| Dataset Enhancement with Instance-Level Augmentations | Jun 12, 2024 | Data AugmentationObject | CodeCode Available | 1 |
| SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation | Jun 12, 2024 | Image SegmentationSegmentation | CodeCode Available | 0 |
| APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation | Jun 12, 2024 | Cross-Domain Few-ShotFew-Shot Semantic Segmentation | —Unverified | 0 |
| A^2-MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder | Jun 12, 2024 | Change Detectionimage-classification | —Unverified | 0 |
| GRU-Net: Gaussian Attention Aided Dense Skip Connection Based MultiResUNet for Breast Histopathology Image Segmentation | Jun 12, 2024 | DecoderDiagnostic | CodeCode Available | 0 |
| 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 12, 2024 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| Real2Code: Reconstruct Articulated Objects via Code Generation | Jun 12, 2024 | Code GenerationImage Segmentation | —Unverified | 0 |
| RMem: Restricted Memory Banks Improve Video Object Segmentation | Jun 12, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Small Scale Data-Free Knowledge Distillation | Jun 12, 2024 | Data-free Knowledge DistillationGenerative Adversarial Network | CodeCode Available | 1 |
| OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Jun 12, 2024 | 3D Scene ReconstructionNeRF | CodeCode Available | 1 |
| Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation | Jun 12, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion | Jun 11, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos | Jun 11, 2024 | ObjectObject Tracking | —Unverified | 0 |
| LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jun 11, 2024 | 3D Semantic SegmentationAutonomous Driving | —Unverified | 0 |
| Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph | Jun 11, 2024 | 3D Object Retrieval3D Semantic Segmentation | —Unverified | 0 |
| PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Jun 11, 2024 | 3D Instance Segmentation3D Scene Reconstruction | —Unverified | 0 |
| 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 11, 2024 | Referring Video Object SegmentationSegmentation | CodeCode Available | 1 |
| Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Jun 11, 2024 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| Dual Thinking and Logical Processing -- Are Multi-modal Large Language Models Closing the Gap with Human Vision ? | Jun 11, 2024 | Autonomous DrivingDeep Learning | CodeCode Available | 0 |
| UVIS: Unsupervised Video Instance Segmentation | Jun 11, 2024 | Instance SegmentationLanguage Modelling | —Unverified | 0 |