| TransGOP: Transformer-Based Gaze Object Prediction | Feb 21, 2024 | Gaze EstimationObject | CodeCode Available | 1 |
| UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object Tracking | Feb 19, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 1 |
| LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks | Feb 19, 2024 | Explainable artificial intelligenceExplainable Artificial Intelligence (XAI) | CodeCode Available | 1 |
| Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling | Feb 19, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection | Feb 18, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation | Feb 17, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 1 |
| ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual Connections | Feb 17, 2024 | Diversityimage-classification | CodeCode Available | 1 |
| LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition | Feb 15, 2024 | Grounded Multimodal Named Entity RecognitionMulti-modal Named Entity Recognition | CodeCode Available | 1 |
| Switch EMA: A Free Lunch for Better Flatness and Sharpness | Feb 14, 2024 | Attributeimage-classification | CodeCode Available | 1 |
| TDViT: Temporal Dilated Video Transformer for Dense Video Tasks | Feb 14, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Efficient One-stage Video Object Detection by Exploiting Temporal Consistency | Feb 14, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLO | Feb 12, 2024 | Autonomous Vehiclesobject-detection | CodeCode Available | 1 |
| AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer | Feb 12, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection | Feb 7, 2024 | Domain GeneralizationNeural Architecture Search | CodeCode Available | 1 |
| Spatio-temporal Prompting Network for Robust Video Feature Extraction | Feb 4, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| RIDERS: Radar-Infrared Depth Estimation for Robust Sensing | Feb 3, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Multimodal-Enhanced Objectness Learner for Corner Case Detection in Autonomous Driving | Feb 3, 2024 | Autonomous DrivingMultimodal Deep Learning | CodeCode Available | 1 |
| SU-SAM: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes | Jan 31, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection | Jan 29, 2024 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 1 |
| pLitterStreet: Street Level Plastic Litter Detection and Mapping | Jan 26, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty | Jan 23, 2024 | Autonomous VehiclesObject Detection | CodeCode Available | 1 |
| Rethinking Centered Kernel Alignment in Knowledge Distillation | Jan 22, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Focaler-IoU: More Focused Intersection over Union Loss | Jan 19, 2024 | Objectobject-detection | CodeCode Available | 1 |
| BlenDA: Domain Adaptive Object Detection through diffusion-based blending | Jan 18, 2024 | Domain AdaptationImage-to-Image Translation | CodeCode Available | 1 |
| MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection | Jan 18, 2024 | Mambaobject-detection | CodeCode Available | 1 |
| Trapped in texture bias? A large scale comparison of deep instance segmentation | Jan 17, 2024 | Data AugmentationInstance Segmentation | CodeCode Available | 1 |
| SAMF: Small-Area-Aware Multi-focus Image Fusion for Object Detection | Jan 16, 2024 | Multi Focus Image Fusionobject-detection | CodeCode Available | 1 |
| DCDet: Dynamic Cross-based 3D Object Detector | Jan 14, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| Improving the Detection of Small Oriented Objects in Aerial Images | Jan 12, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| CLIP-Guided Source-Free Object Detection in Aerial Images | Jan 10, 2024 | Domain AdaptationObject | CodeCode Available | 1 |
| Generic Knowledge Boosted Pre-training For Remote Sensing Images | Jan 9, 2024 | Change DetectionDeep Learning | CodeCode Available | 1 |
| A Flying Bird Object Detection Method for Surveillance Video | Jan 8, 2024 | Objectobject-detection | CodeCode Available | 1 |
| What How and When Should Object Detectors Update in Continually Changing Test Domains? | Jan 1, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Dispel Darkness for Better Fusion: A Controllable Visual Enhancer based on Cross-modal Conditional Adversarial Learning | Jan 1, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Towards Robust 3D Object Detection with LiDAR and 4D Radar Fusion in Various Weather Conditions | Jan 1, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training | Jan 1, 2024 | Adversarial Attackimage-classification | CodeCode Available | 1 |
| CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images | Jan 1, 2024 | 3D Object Detection3D Reconstruction | CodeCode Available | 1 |
| PairDETR : Joint Detection and Association of Human Bodies and Faces | Jan 1, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Depth-Aware Concealed Crop Detection in Dense Agricultural Scenes | Jan 1, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object Detection | Jan 1, 2024 | 3D Object DetectionKnowledge Distillation | CodeCode Available | 1 |
| Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective | Jan 1, 2024 | Decoderobject-detection | CodeCode Available | 1 |
| HINTED: Hard Instance Enhanced Detector with Mixed-Density Feature Fusion for Sparsely-Supervised 3D Object Detection | Jan 1, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| Referring Expression Counting | Jan 1, 2024 | 8kobject-detection | CodeCode Available | 1 |
| Shape-IoU: More Accurate Metric considering Bounding Box Shape and Scale | Dec 29, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection | Dec 25, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| MonoLSS: Learnable Sample Selection For Monocular 3D Detection | Dec 22, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection | Dec 22, 2023 | Attributeobject-detection | CodeCode Available | 1 |
| DECO: Query-Based End-to-End Object Detection with ConvNets | Dec 21, 2023 | DecoderObject | CodeCode Available | 1 |
| Universal Noise Annotation: Unveiling the Impact of Noisy annotation on Object Detection | Dec 21, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection | Dec 21, 2023 | 3D Object Detectionobject-detection | CodeCode Available | 1 |