| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Mar 1, 2024 | Objectobject-detection | CodeCode Available | 2 |
| FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything | Feb 29, 2024 | 3D Object ReconstructionInstance Segmentation | CodeCode Available | 2 |
| DEYO: DETR with YOLO for End-to-End Object Detection | Feb 26, 2024 | DecoderGPU | CodeCode Available | 2 |
| EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Feb 23, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition | Feb 22, 2024 | Image-level Supervised Instance Segmentationobject-detection | CodeCode Available | 2 |
| MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection | Feb 18, 2024 | 3D Object DetectionDataset Generation | CodeCode Available | 2 |
| YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection | Feb 14, 2024 | Fracture detectionmedical image detection | CodeCode Available | 2 |
| FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models | Feb 7, 2024 | Instance SegmentationObject | CodeCode Available | 2 |
| Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection | Feb 6, 2024 | 3D Object DetectionDenoising | CodeCode Available | 2 |
| YOLOPoint Joint Keypoint and Object Detection | Feb 6, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector | Feb 5, 2024 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |
| HASSOD: Hierarchical Adaptive Self-Supervised Object Detection | Feb 5, 2024 | Objectobject-detection | CodeCode Available | 2 |
| SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design | Jan 29, 2024 | CPUGPU | CodeCode Available | 2 |
| MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection | Jan 29, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection | Jan 29, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration | Jan 23, 2024 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis | Jan 22, 2024 | Document Layout AnalysisDocument Summarization | CodeCode Available | 2 |
| Removal then Selection: A Coarse-to-Fine Fusion Perspective for RGB-Infrared Object Detection | Jan 19, 2024 | Multispectral Object DetectionObject | CodeCode Available | 2 |
| A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Jan 18, 2024 | Instance SegmentationInteractive Segmentation | CodeCode Available | 2 |
| Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Jan 17, 2024 | GPUImage Classification | CodeCode Available | 2 |
| Fine-Grained Prototypes Distillation for Few-Shot Object Detection | Jan 15, 2024 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 2 |
| MS-DETR: Efficient DETR Training with Mixed Supervision | Jan 8, 2024 | DecoderObject | CodeCode Available | 2 |
| RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAM | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Exploring Orthogonality in Open World Object Detection | Jan 1, 2024 | Incremental LearningObject | CodeCode Available | 2 |