| LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks | Feb 19, 2024 | Explainable artificial intelligenceExplainable Artificial Intelligence (XAI) | CodeCode Available | 1 |
| Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling | Feb 19, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| SDGE: Stereo Guided Depth Estimation for 360^ Camera Sets | Feb 19, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Reinforcement Learning as a Parsimonious Alternative to Prediction Cascades: A Case Study on Image Segmentation | Feb 19, 2024 | Image Segmentationobject-detection | CodeCode Available | 0 |
| UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object Tracking | Feb 19, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 1 |
| LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection | Feb 18, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| A Multispectral Automated Transfer Technique (MATT) for machine-driven image labeling utilizing the Segment Anything Model (SAM) | Feb 18, 2024 | Multispectral Object Detectionobject-detection | —Unverified | 0 |
| MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection | Feb 18, 2024 | 3D Object DetectionDataset Generation | CodeCode Available | 2 |
| GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation | Feb 17, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 1 |
| ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual Connections | Feb 17, 2024 | Diversityimage-classification | CodeCode Available | 1 |
| Modular Graph Extraction for Handwritten Circuit Diagram Images | Feb 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| CodaMal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes | Feb 16, 2024 | Domain Adaptationobject-detection | CodeCode Available | 0 |
| STF: Spatio-Temporal Fusion Module for Improving Video Object Detection | Feb 16, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| AutoGPT+P: Affordance-based Task Planning with Large Language Models | Feb 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| SAWEC: Sensing-Assisted Wireless Edge Computing | Feb 15, 2024 | BenchmarkingEdge-computing | CodeCode Available | 0 |
| LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition | Feb 15, 2024 | Grounded Multimodal Named Entity RecognitionMulti-modal Named Entity Recognition | CodeCode Available | 1 |
| A Comprehensive Review on Computer Vision Analysis of Aerial Data | Feb 15, 2024 | Change Detectionobject-detection | —Unverified | 0 |
| Efficient One-stage Video Object Detection by Exploiting Temporal Consistency | Feb 14, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection | Feb 14, 2024 | Fracture detectionmedical image detection | CodeCode Available | 2 |
| Switch EMA: A Free Lunch for Better Flatness and Sharpness | Feb 14, 2024 | Attributeimage-classification | CodeCode Available | 1 |
| Few-Shot Object Detection with Sparse Context Transformers | Feb 14, 2024 | Few-Shot Object DetectionObject | —Unverified | 0 |
| TDViT: Temporal Dilated Video Transformer for Dense Video Tasks | Feb 14, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss | Feb 13, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection | Feb 13, 2024 | Contrastive LearningObject | —Unverified | 0 |
| Object Detection in Thermal Images Using Deep Learning for Unmanned Aerial Vehicles | Feb 13, 2024 | object-detectionObject Detection | —Unverified | 0 |
| AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer | Feb 12, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLO | Feb 12, 2024 | Autonomous Vehiclesobject-detection | CodeCode Available | 1 |
| Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems | Feb 12, 2024 | GPUobject-detection | CodeCode Available | 0 |
| A Flow-based Credibility Metric for Safety-critical Pedestrian Detection | Feb 12, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Semantic Object-level Modeling for Robust Visual Camera Relocalization | Feb 10, 2024 | Camera RelocalizationObject | —Unverified | 0 |
| Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance | Feb 10, 2024 | Computational EfficiencyKnowledge Distillation | CodeCode Available | 0 |
| Transfer learning with generative models for object detection on limited datasets | Feb 9, 2024 | GeophysicsObject | —Unverified | 0 |
| Event-to-Video Conversion for Overhead Object Detection | Feb 9, 2024 | Objectobject-detection | —Unverified | 0 |
| Neural Rendering based Urban Scene Reconstruction for Autonomous Driving | Feb 9, 2024 | 3D Object Detection3D Reconstruction | —Unverified | 0 |
| Scrapping The Web For Early Wildfire Detection: A New Annotated Dataset of Images and Videos of Smoke Plumes In-the-wild | Feb 8, 2024 | Diversityobject-detection | —Unverified | 0 |
| InstaGen: Enhancing Object Detection by Training on Synthetic Dataset | Feb 8, 2024 | Objectobject-detection | —Unverified | 0 |
| Using YOLO v7 to Detect Kidney in Magnetic Resonance Imaging | Feb 8, 2024 | Contrastive Learningobject-detection | —Unverified | 0 |
| Streamlined Hybrid Annotation Framework using Scalable Codestream for Bandwidth-Restricted UAV Object Detection | Feb 7, 2024 | Decision Makingobject-detection | —Unverified | 0 |
| G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection | Feb 7, 2024 | Domain GeneralizationNeural Architecture Search | CodeCode Available | 1 |
| Shape-biased Texture Agnostic Representations for Improved Textureless and Metallic Object Detection and 6D Pose Estimation | Feb 7, 2024 | 6D Pose EstimationObject | CodeCode Available | 0 |
| Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration | Feb 7, 2024 | 3D Object DetectionDenoising | —Unverified | 0 |
| FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models | Feb 7, 2024 | Instance SegmentationObject | CodeCode Available | 2 |
| LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors | Feb 7, 2024 | image-classificationImage Classification | —Unverified | 0 |
| 0-1 laws for pattern occurrences in phylogenetic trees and networks | Feb 7, 2024 | 10-shot image generation | —Unverified | 0 |
| Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private Sources | Feb 6, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 0 |
| YOLOPoint Joint Keypoint and Object Detection | Feb 6, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection | Feb 6, 2024 | 3D Object DetectionDenoising | CodeCode Available | 2 |
| Improving Robustness of LiDAR-Camera Fusion Model against Weather Corruption from Fusion Strategy Perspective | Feb 5, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| ActiveAnno3D -- An Active Learning Framework for Multi-Modal 3D Object Detection | Feb 5, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 4 |
| Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector | Feb 5, 2024 | Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection | CodeCode Available | 2 |