| DenoiseRep: Denoising Model for Representation Learning | Jun 13, 2024 | DenoisingFine-Grained Image Classification | CodeCode Available | 1 |
| 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities | Jun 13, 2024 | Instance Segmentationmultimodal generation | CodeCode Available | 5 |
| Instance-level quantitative saliency in multiple sclerosis lesion segmentation | Jun 13, 2024 | Date UnderstandingLesion Segmentation | CodeCode Available | 0 |
| A Labeled Array Distance Metric for Measuring Image Segmentation Quality | Jun 12, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| Generalizable Disaster Damage Assessment via Change Detection with Vision Foundation Model | Jun 12, 2024 | Change DetectionImage Segmentation | —Unverified | 0 |
| Dataset Enhancement with Instance-Level Augmentations | Jun 12, 2024 | Data AugmentationObject | CodeCode Available | 1 |
| SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation | Jun 12, 2024 | Image SegmentationSegmentation | CodeCode Available | 0 |
| APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation | Jun 12, 2024 | Cross-Domain Few-ShotFew-Shot Semantic Segmentation | —Unverified | 0 |
| A^2-MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder | Jun 12, 2024 | Change Detectionimage-classification | —Unverified | 0 |
| GRU-Net: Gaussian Attention Aided Dense Skip Connection Based MultiResUNet for Breast Histopathology Image Segmentation | Jun 12, 2024 | DecoderDiagnostic | CodeCode Available | 0 |
| 2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 12, 2024 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| Real2Code: Reconstruct Articulated Objects via Code Generation | Jun 12, 2024 | Code GenerationImage Segmentation | —Unverified | 0 |
| RMem: Restricted Memory Banks Improve Video Object Segmentation | Jun 12, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Small Scale Data-Free Knowledge Distillation | Jun 12, 2024 | Data-free Knowledge DistillationGenerative Adversarial Network | CodeCode Available | 1 |
| OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Jun 12, 2024 | 3D Scene ReconstructionNeRF | CodeCode Available | 1 |
| Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation | Jun 12, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion | Jun 11, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos | Jun 11, 2024 | ObjectObject Tracking | —Unverified | 0 |
| LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection | Jun 11, 2024 | 3D Semantic SegmentationAutonomous Driving | —Unverified | 0 |
| Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph | Jun 11, 2024 | 3D Object Retrieval3D Semantic Segmentation | —Unverified | 0 |
| PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving | Jun 11, 2024 | 3D Instance Segmentation3D Scene Reconstruction | —Unverified | 0 |
| 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 11, 2024 | Referring Video Object SegmentationSegmentation | CodeCode Available | 1 |
| Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach | Jun 11, 2024 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| Dual Thinking and Logical Processing -- Are Multi-modal Large Language Models Closing the Gap with Human Vision ? | Jun 11, 2024 | Autonomous DrivingDeep Learning | CodeCode Available | 0 |
| UVIS: Unsupervised Video Instance Segmentation | Jun 11, 2024 | Instance SegmentationLanguage Modelling | —Unverified | 0 |