| ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation | Dec 12, 2024 | Phrase GroundingQuestion Answering | —Unverified | 0 |
| On the effectiveness of Rotation-Equivariance in U-Net: A Benchmark for Image Segmentation | Dec 12, 2024 | Image SegmentationSegmentation | —Unverified | 0 |
| STEAM: Squeeze and Transform Enhanced Attention Module | Dec 12, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Towards Open-Vocabulary Video Semantic Segmentation | Dec 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation | Dec 12, 2024 | Cross-Domain Few-ShotDomain Generalization | CodeCode Available | 2 |
| Embeddings are all you need! Achieving High Performance Medical Image Classification through Training-Free Embedding Analysis | Dec 12, 2024 | AllClassification | —Unverified | 0 |
| VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation | Dec 12, 2024 | Domain AdaptationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| MaskTerial: A Foundation Model for Automated 2D Material Flake Detection | Dec 12, 2024 | Instance SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Automatic Image Annotation for Mapped Features Detection | Dec 11, 2024 | Autonomous DrivingImage Segmentation | —Unverified | 0 |
| A feature refinement module for light-weight semantic segmentation network | Dec 11, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| SegFace: Face Segmentation of Long-Tail Classes | Dec 11, 2024 | Face ParsingFace Swapping | CodeCode Available | 2 |
| Unified HT-CNNs Architecture: Transfer Learning for Segmenting Diverse Brain Tumors in MRI from Gliomas to Pediatric Tumors | Dec 11, 2024 | Brain Tumor SegmentationImage Segmentation | —Unverified | 0 |
| ConDSeg: A General Medical Image Segmentation Framework via Contrast-Driven Feature Enhancement | Dec 11, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| Utilizing Multi-step Loss for Single Image Reflection Removal | Dec 11, 2024 | Depth EstimationImage Segmentation | CodeCode Available | 0 |
| Post-Hoc MOTS: Exploring the Capabilities of Time-Symmetric Multi-Object Tracking | Dec 11, 2024 | Multi-Object TrackingObject Tracking | —Unverified | 0 |
| Structured IB: Improving Information Bottleneck with Structured Feature Learning | Dec 11, 2024 | Image SegmentationSemantic Communication | —Unverified | 0 |
| Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin | Dec 11, 2024 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Dec 11, 2024 | AttributeBenchmarking | CodeCode Available | 1 |
| EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Dec 11, 2024 | DecoderGPU | CodeCode Available | 1 |
| Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation | Dec 11, 2024 | Autonomous DrivingContrastive Learning | —Unverified | 0 |
| Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction | Dec 11, 2024 | 3D Semantic Occupancy PredictionLIDAR Semantic Segmentation | —Unverified | 0 |
| A Deep Semantic Segmentation Network with Semantic and Contextual Refinements | Dec 11, 2024 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Annotation-Efficient Task Guidance for Medical Segment Anything | Dec 11, 2024 | Computed Tomography (CT)Image Segmentation | CodeCode Available | 0 |
| Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion | Dec 11, 2024 | GPUImage Segmentation | CodeCode Available | 0 |
| Stable Mean Teacher for Semi-supervised Video Action Detection | Dec 10, 2024 | Action DetectionSemantic Segmentation | CodeCode Available | 0 |