| TransGOP: Transformer-Based Gaze Object Prediction | Feb 21, 2024 | Gaze EstimationObject | CodeCode Available | 1 |
| LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks | Feb 19, 2024 | Explainable artificial intelligenceExplainable Artificial Intelligence (XAI) | CodeCode Available | 1 |
| UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object Tracking | Feb 19, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 1 |
| Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling | Feb 19, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection | Feb 18, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual Connections | Feb 17, 2024 | Diversityimage-classification | CodeCode Available | 1 |
| GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation | Feb 17, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 1 |
| LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition | Feb 15, 2024 | Grounded Multimodal Named Entity RecognitionMulti-modal Named Entity Recognition | CodeCode Available | 1 |
| Efficient One-stage Video Object Detection by Exploiting Temporal Consistency | Feb 14, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| TDViT: Temporal Dilated Video Transformer for Dense Video Tasks | Feb 14, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Switch EMA: A Free Lunch for Better Flatness and Sharpness | Feb 14, 2024 | Attributeimage-classification | CodeCode Available | 1 |
| AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer | Feb 12, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLO | Feb 12, 2024 | Autonomous Vehiclesobject-detection | CodeCode Available | 1 |
| G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection | Feb 7, 2024 | Domain GeneralizationNeural Architecture Search | CodeCode Available | 1 |
| Spatio-temporal Prompting Network for Robust Video Feature Extraction | Feb 4, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Multimodal-Enhanced Objectness Learner for Corner Case Detection in Autonomous Driving | Feb 3, 2024 | Autonomous DrivingMultimodal Deep Learning | CodeCode Available | 1 |
| RIDERS: Radar-Infrared Depth Estimation for Robust Sensing | Feb 3, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| SU-SAM: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes | Jan 31, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection | Jan 29, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 1 |
| pLitterStreet: Street Level Plastic Litter Detection and Mapping | Jan 26, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty | Jan 23, 2024 | Autonomous VehiclesObject Detection | CodeCode Available | 1 |
| Rethinking Centered Kernel Alignment in Knowledge Distillation | Jan 22, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Focaler-IoU: More Focused Intersection over Union Loss | Jan 19, 2024 | Objectobject-detection | CodeCode Available | 1 |
| BlenDA: Domain Adaptive Object Detection through diffusion-based blending | Jan 18, 2024 | Domain AdaptationImage-to-Image Translation | CodeCode Available | 1 |
| MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection | Jan 18, 2024 | Mambaobject-detection | CodeCode Available | 1 |