| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT | Dec 3, 2023 | Caption GenerationDecoder | CodeCode Available | 0 |
| G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training | Dec 3, 2023 | object-detectionObject Detection | CodeCode Available | 0 |
| A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors | Dec 3, 2023 | Active LearningInstance Segmentation | —Unverified | 0 |
| A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing | Dec 3, 2023 | Autonomous NavigationData Augmentation | —Unverified | 0 |
| T3D: Advancing 3D Medical Vision-Language Pre-training by Learning Multi-View Visual Consistency | Dec 3, 2023 | Clinical KnowledgeContrastive Learning | —Unverified | 0 |
| TranSegPGD: Improving Transferability of Adversarial Examples on Semantic Segmentation | Dec 3, 2023 | Adversarial Attackimage-classification | —Unverified | 0 |
| Semantic segmentation of SEM images of lower bainitic and tempered martensitic steels | Dec 2, 2023 | Deep LearningSemantic Segmentation | —Unverified | 0 |
| Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited Labels | Dec 2, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything | Dec 1, 2023 | Decoderimage-classification | CodeCode Available | 4 |
| Improve Supervised Representation Learning with Masked Image Modeling | Dec 1, 2023 | DecoderImage Retrieval | —Unverified | 0 |
| Towards Generalizable Referring Image Segmentation via Target Prompt and Visual Coherence | Dec 1, 2023 | Image SegmentationSemantic Segmentation | —Unverified | 0 |
| SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers | Dec 1, 2023 | DecoderObject | CodeCode Available | 1 |
| Grounding Everything: Emerging Localization Properties in Vision-Language Transformers | Dec 1, 2023 | Image RetrievalObject Localization | CodeCode Available | 1 |
| A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing | Dec 1, 2023 | AllSemantic Segmentation | —Unverified | 0 |
| Improving Normalization with the James-Stein Estimator | Dec 1, 2023 | 3D Object Classificationimage-classification | —Unverified | 0 |
| A Recent Survey of Vision Transformers for Medical Image Segmentation | Dec 1, 2023 | Image SegmentationInductive Bias | —Unverified | 0 |
| Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning | Dec 1, 2023 | Decoderobject-detection | CodeCode Available | 1 |
| SCHEME: Scalable Channel Mixer for Vision Transformers | Dec 1, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Generative Parameter-Efficient Fine-Tuning | Dec 1, 2023 | Arithmetic ReasoningFine-Grained Image Classification | CodeCode Available | 1 |
| CellMixer: Annotation-free Semantic Cell Segmentation of Heterogeneous Cell Populations | Dec 1, 2023 | Cell SegmentationInstance Segmentation | —Unverified | 0 |
| Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment | Dec 1, 2023 | Contrastive LearningFew-Shot Learning | CodeCode Available | 3 |
| Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals | Dec 1, 2023 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| Learning Part Segmentation from Synthetic Animals | Nov 30, 2023 | Domain AdaptationPseudo Label | —Unverified | 0 |
| InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation | Nov 30, 2023 | Image CaptioningReferring Expression | CodeCode Available | 0 |
| SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation | Nov 30, 2023 | Objectobject-detection | —Unverified | 0 |