| Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | Jun 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| BN-DRISHTI: Bangla Document Recognition through Instance-level Segmentation of Handwritten Text Images | May 31, 2023 | Handwriting RecognitionHandwritten Line Segmentation | CodeCode Available | 1 |
| Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN | May 31, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits | May 31, 2023 | 3D Point Cloud Classificationobject-detection | —Unverified | 0 |
| Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast | May 31, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 1 |
| LOWA: Localize Objects in the Wild with Attributes | May 31, 2023 | AttributeObject | —Unverified | 0 |
| Cross-Domain Car Detection Model with Integrated Convolutional Block Attention Mechanism | May 31, 2023 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Multi-modal Queried Object Detection in the Wild | May 30, 2023 | Few-Shot Object DetectionObject | CodeCode Available | 2 |
| Table Detection for Visually Rich Document Images | May 30, 2023 | document understandingobject-detection | CodeCode Available | 0 |
| VVC Extension Scheme for Object Detection Using Contrast Reduction | May 30, 2023 | DecoderObject | —Unverified | 0 |
| UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous Driving | May 30, 2023 | 3D Object Detection3D Scene Reconstruction | CodeCode Available | 2 |
| Sit Back and Relax: Learning to Drive Incrementally in All Weather Conditions | May 30, 2023 | AllAutonomous Driving | CodeCode Available | 0 |
| Generating Driving Scenes with Diffusion | May 29, 2023 | object-detectionObject Detection | —Unverified | 0 |
| PaLI-X: On Scaling up a Multilingual Vision and Language Model | May 29, 2023 | Chart Question Answeringdocument understanding | CodeCode Available | 1 |
| Fashion Object Detection for Tops & Bottoms | May 29, 2023 | Objectobject-detection | —Unverified | 0 |
| Monocular 2D Camera-based Proximity Monitoring for Human-Machine Collision Warning on Construction Sites | May 29, 2023 | 3D Object DetectionManagement | CodeCode Available | 0 |
| CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models | May 29, 2023 | DenoisingObject | CodeCode Available | 1 |
| View-to-Label: Multi-View Consistency for Self-Supervised 3D Object Detection | May 29, 2023 | 3D Object DetectionAutonomous Vehicles | —Unverified | 0 |
| VCVW-3D: A Virtual Construction Vehicles and Workers Dataset with 3D Annotations | May 29, 2023 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 0 |
| Mining Negative Temporal Contexts For False Positive Suppression In Real-Time Ultrasound Lesion Detection | May 29, 2023 | Lesion Detectionobject-detection | CodeCode Available | 1 |
| Hierarchical Neural Memory Network for Low Latency Event Processing | May 29, 2023 | Event-based visionMonocular Depth Estimation | CodeCode Available | 0 |
| Contrastive Learning Based Recursive Dynamic Multi-Scale Network for Image Deraining | May 29, 2023 | Contrastive Learningobject-detection | —Unverified | 0 |
| Contextual Object Detection with Multimodal Large Language Models | May 29, 2023 | Cloze TestDecoder | CodeCode Available | 2 |
| Lighting and Rotation Invariant Real-time Vehicle Wheel Detector based on YOLOv5 | May 28, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Real-time Object Detection: YOLOv1 Re-Implementation in PyTorch | May 28, 2023 | Objectobject-detection | —Unverified | 0 |