| Multi-Scale and Detail-Enhanced Segment Anything Model for Salient Object Detection | Aug 8, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More | Aug 8, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 5 |
| Detecting Car Speed using Object Detection and Depth Estimation: A Deep Learning Framework | Aug 8, 2024 | Depth Estimationobject-detection | —Unverified | 0 |
| Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian | Aug 7, 2024 | Autonomous Drivingobject-detection | CodeCode Available | 1 |
| CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Aug 7, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Designing Extremely Memory-Efficient CNNs for On-device Vision Tasks | Aug 7, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model | Aug 7, 2024 | Image Generationobject-detection | CodeCode Available | 0 |
| PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation | Aug 7, 2024 | DecoderDense Captioning | CodeCode Available | 0 |
| Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection | Aug 7, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection | Aug 7, 2024 | 3D Object DetectionAutonomous Navigation | CodeCode Available | 2 |