| Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | Jun 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| BN-DRISHTI: Bangla Document Recognition through Instance-level Segmentation of Handwritten Text Images | May 31, 2023 | Handwriting RecognitionHandwritten Line Segmentation | CodeCode Available | 1 |
| Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN | May 31, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits | May 31, 2023 | 3D Point Cloud Classificationobject-detection | —Unverified | 0 |
| Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast | May 31, 2023 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 1 |
| Cross-Domain Car Detection Model with Integrated Convolutional Block Attention Mechanism | May 31, 2023 | Autonomous Drivingobject-detection | —Unverified | 0 |
| LOWA: Localize Objects in the Wild with Attributes | May 31, 2023 | AttributeObject | —Unverified | 0 |
| Multi-modal Queried Object Detection in the Wild | May 30, 2023 | Few-Shot Object DetectionObject | CodeCode Available | 2 |
| Table Detection for Visually Rich Document Images | May 30, 2023 | document understandingobject-detection | CodeCode Available | 0 |
| Sit Back and Relax: Learning to Drive Incrementally in All Weather Conditions | May 30, 2023 | AllAutonomous Driving | CodeCode Available | 0 |
| VVC Extension Scheme for Object Detection Using Contrast Reduction | May 30, 2023 | DecoderObject | —Unverified | 0 |
| UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous Driving | May 30, 2023 | 3D Object Detection3D Scene Reconstruction | CodeCode Available | 2 |
| Fashion Object Detection for Tops & Bottoms | May 29, 2023 | Objectobject-detection | —Unverified | 0 |
| Generating Driving Scenes with Diffusion | May 29, 2023 | object-detectionObject Detection | —Unverified | 0 |
| PaLI-X: On Scaling up a Multilingual Vision and Language Model | May 29, 2023 | Chart Question Answeringdocument understanding | CodeCode Available | 1 |
| Mining Negative Temporal Contexts For False Positive Suppression In Real-Time Ultrasound Lesion Detection | May 29, 2023 | Lesion Detectionobject-detection | CodeCode Available | 1 |
| View-to-Label: Multi-View Consistency for Self-Supervised 3D Object Detection | May 29, 2023 | 3D Object DetectionAutonomous Vehicles | —Unverified | 0 |
| CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models | May 29, 2023 | DenoisingObject | CodeCode Available | 1 |
| Monocular 2D Camera-based Proximity Monitoring for Human-Machine Collision Warning on Construction Sites | May 29, 2023 | 3D Object DetectionManagement | CodeCode Available | 0 |
| Hierarchical Neural Memory Network for Low Latency Event Processing | May 29, 2023 | Event-based visionMonocular Depth Estimation | CodeCode Available | 0 |
| VCVW-3D: A Virtual Construction Vehicles and Workers Dataset with 3D Annotations | May 29, 2023 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 0 |
| Contrastive Learning Based Recursive Dynamic Multi-Scale Network for Image Deraining | May 29, 2023 | Contrastive Learningobject-detection | —Unverified | 0 |
| Contextual Object Detection with Multimodal Large Language Models | May 29, 2023 | Cloze TestDecoder | CodeCode Available | 2 |
| Lighting and Rotation Invariant Real-time Vehicle Wheel Detector based on YOLOv5 | May 28, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Real-time Object Detection: YOLOv1 Re-Implementation in PyTorch | May 28, 2023 | Objectobject-detection | —Unverified | 0 |
| Adversarial Attack On Yolov5 For Traffic And Road Sign Detection | May 27, 2023 | Adversarial Attackobject-detection | CodeCode Available | 1 |
| MLOps: A Step Forward to Enterprise Machine Learning | May 27, 2023 | object-detectionObject Detection | —Unverified | 0 |
| FishEye8K: A Benchmark and Dataset for Fisheye Camera Object Detection | May 27, 2023 | 2D Object Detectionobject-detection | CodeCode Available | 1 |
| Learning from Children: Improving Image-Caption Pretraining via Curriculum | May 27, 2023 | image-classificationImage Classification | CodeCode Available | 0 |
| Modularized Zero-shot VQA with Pre-trained Models | May 27, 2023 | object-detectionObject Detection | CodeCode Available | 0 |
| On the Importance of Backbone to the Adversarial Robustness of Object Detectors | May 27, 2023 | Adversarial RobustnessAutonomous Driving | CodeCode Available | 0 |
| Radar Enlighten the Dark: Enhancing Low-Visibility Perception for Automated Vehicles with Camera-Radar Fusion | May 27, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| DeepSeaNet: Improving Underwater Object Detection using EfficientDet | May 26, 2023 | Objectobject-detection | —Unverified | 0 |
| Rate-Distortion Theory in Coding for Machines and its Application | May 26, 2023 | Instance Segmentationobject-detection | —Unverified | 0 |
| Linear Object Detection in Document Images using Multiple Object Tracking | May 26, 2023 | Instance SegmentationMultiple Object Tracking | —Unverified | 0 |
| TFDet: Target-Aware Fusion for RGB-T Pedestrian Detection | May 26, 2023 | Multispectral Object Detectionobject-detection | CodeCode Available | 1 |
| Modulate Your Spectrum in Self-Supervised Learning | May 26, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| FSD: Fully-Specialized Detector via Neural Architecture Search | May 26, 2023 | Lesion DetectionNeural Architecture Search | —Unverified | 0 |
| Knowledge Diffusion for Distillation | May 25, 2023 | Denoisingimage-classification | CodeCode Available | 1 |
| FollowNet: A Comprehensive Benchmark for Car-Following Behavior Modeling | May 25, 2023 | Autonomous Vehiclesobject-detection | CodeCode Available | 1 |
| Investigation of UAV Detection in Images with Complex Backgrounds and Rainy Artifacts | May 25, 2023 | Benchmarkingobject-detection | CodeCode Available | 0 |
| Learning Occupancy for Monocular 3D Object Detection | May 25, 2023 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 1 |
| Leveraging object detection for the identification of lung cancer | May 25, 2023 | Computational EfficiencyMedical Image Analysis | —Unverified | 0 |
| Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos | May 25, 2023 | 3D ReconstructionObject | —Unverified | 0 |
| Language-Guided 3D Object Detection in Point Cloud for Autonomous Driving | May 25, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification | May 25, 2023 | 3D ClassificationClassification | —Unverified | 0 |
| Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance | May 25, 2023 | Decoderimage-classification | —Unverified | 0 |
| Towards Large-scale Single-shot Millimeter-wave Imaging for Low-cost Security Inspection | May 25, 2023 | Image Reconstructionobject-detection | —Unverified | 0 |
| Improved Multi-Scale Grid Rendering of Point Clouds for Radar Object Detection Networks | May 25, 2023 | Descriptiveobject-detection | —Unverified | 0 |
| RC-BEVFusion: A Plug-In Module for Radar-Camera Bird's Eye View Feature Fusion | May 25, 2023 | 3D Object Detectionobject-detection | —Unverified | 0 |