| A Critical View of Vision-Based Long-Term Dynamics Prediction Under Environment Misalignment | May 12, 2023 | PredictionRegion Proposal | CodeCode Available | 0 |
| Event Camera as Region Proposal Network | May 1, 2023 | Region Proposal | —Unverified | 0 |
| Towards Precise Weakly Supervised Object Detection via Interactive Contrastive Learning of Context Information | Apr 27, 2023 | Contrastive LearningObject | —Unverified | 0 |
| [CLS] Token is All You Need for Zero-Shot Semantic Segmentation | Apr 13, 2023 | AllFew-Shot Semantic Segmentation | —Unverified | 0 |
| MOST: Multiple Object localization with Self-supervised Transformers for object discovery | Apr 11, 2023 | Objectobject-detection | —Unverified | 0 |
| Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA | Apr 4, 2023 | Answer GenerationLanguage Modelling | —Unverified | 0 |
| End-to-end Semantic Object Detection with Cross-Modal Alignment | Feb 10, 2023 | Contrastive Learningcross-modal alignment | —Unverified | 0 |
| s-Adaptive Decoupled Prototype for Few-Shot Object Detection | Jan 1, 2023 | Few-Shot Object DetectionMeta-Learning | —Unverified | 0 |
| Multi-level and multi-modal feature fusion for accurate 3D object detection in Connected and Automated Vehicles | Dec 15, 2022 | 3D Object Detectionobject-detection | —Unverified | 0 |
| Multimodal Query-guided Object Localization | Dec 1, 2022 | ObjectObject Localization | —Unverified | 0 |