| YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors | Jul 6, 2022 | 2D Object DetectionGPU | CodeCode Available | 7 |
| YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications | Sep 7, 2022 | GPUObject Detection | CodeCode Available | 5 |
| Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks | Mar 30, 2023 | Human ParsingPedestrian Attribute Recognition | CodeCode Available | 3 |
| When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset | Jul 14, 2024 | 3D Object DetectionMultispectral Object Detection | CodeCode Available | 2 |
| UniRGB-IR: A Unified Framework for RGB-Infrared Semantic Tasks via Adapter Tuning | Apr 26, 2024 | Multispectral Object DetectionPedestrian Detection | CodeCode Available | 2 |
| Removal then Selection: A Coarse-to-Fine Fusion Perspective for RGB-Infrared Object Detection | Jan 19, 2024 | Multispectral Object DetectionObject | CodeCode Available | 2 |
| Hulk: A Universal Knowledge Translator for Human-Centric Tasks | Dec 4, 2023 | 3D Human Pose EstimationAction Recognition | CodeCode Available | 2 |
| HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining | Mar 10, 2023 | AttributeAutonomous Driving | CodeCode Available | 2 |
| CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers | Mar 9, 2022 | 3D Object DetectionAutonomous Vehicles | CodeCode Available | 2 |
| F2DNet: Fast Focal Detection Network for Pedestrian Detection | Mar 4, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Pedestrian Detection: Domain Generalization, CNNs, Transformers and Beyond | Jan 10, 2022 | AttributeAutonomous Driving | CodeCode Available | 2 |
| Detection in Crowded Scenes: One Proposal, Multiple Predictions | Mar 20, 2020 | Object DetectionPedestrian Detection | CodeCode Available | 2 |
| Focal Loss for Dense Object Detection | Aug 7, 2017 | 2D Object DetectionDense Object Detection | CodeCode Available | 2 |
| Feature Pyramid Networks for Object Detection | Dec 9, 2016 | GPUObject | CodeCode Available | 2 |
| Fast Algorithms for Convolutional Neural Networks | Sep 30, 2015 | GPUPedestrian Detection | CodeCode Available | 2 |
| Multispectral Pedestrian Detection with Sparsely Annotated Label | Jan 5, 2025 | object-detectionObject Detection | CodeCode Available | 1 |
| MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion | Aug 28, 2024 | Pedestrian Detection | CodeCode Available | 1 |
| MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection | Aug 2, 2024 | Autonomous DrivingMamba | CodeCode Available | 1 |
| AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection | May 21, 2024 | Knowledge DistillationPedestrian Detection | CodeCode Available | 1 |
| MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection | Apr 29, 2024 | Autonomous DrivingMultispectral Object Detection | CodeCode Available | 1 |
| Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments | Mar 20, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection | Mar 2, 2024 | Pedestrian Detection | CodeCode Available | 1 |
| INSANet: INtra-INter Spectral Attention Network for Effective Feature Fusion of Multispectral Pedestrian Detection | Feb 10, 2024 | Multispectral Object DetectionPedestrian Detection | CodeCode Available | 1 |
| Paved2Paradise: Cost-Effective and Scalable LiDAR Simulation by Factoring the Real World | Dec 2, 2023 | Human DetectionPedestrian Detection | CodeCode Available | 1 |
| DDAM-PS: Diligent Domain Adaptive Mixer for Person Search | Oct 31, 2023 | Domain AdaptationPedestrian Detection | CodeCode Available | 1 |
| HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information | Oct 7, 2023 | Human Detectionobject-detection | CodeCode Available | 1 |
| Nonlinear optical encoding enabled by recurrent linear scattering | Jul 17, 2023 | Data CompressionDecoder | CodeCode Available | 1 |
| TFDet: Target-Aware Fusion for RGB-T Pedestrian Detection | May 26, 2023 | Multispectral Object Detectionobject-detection | CodeCode Available | 1 |
| CARLA-BSP: a simulated dataset with pedestrians | Apr 29, 2023 | Pedestrian DetectionPose Estimation | CodeCode Available | 1 |
| VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision | Apr 6, 2023 | Autonomous DrivingPedestrian Detection | CodeCode Available | 1 |
| UniHCP: A Unified Model for Human-Centric Perceptions | Mar 6, 2023 | 2D Pose EstimationAttribute | CodeCode Available | 1 |
| MS-DETR: Multispectral Pedestrian Detection Transformer with Loosely Coupled Fusion and Modality-Balanced Optimization | Feb 1, 2023 | DecoderPedestrian Detection | CodeCode Available | 1 |
| Comparison Of Deep Object Detectors On A New Vulnerable Pedestrian Dataset | Dec 12, 2022 | Autonomous Drivingobject-detection | CodeCode Available | 1 |
| Untargeted Backdoor Attack against Object Detection | Nov 2, 2022 | Backdoor Attackimage-classification | CodeCode Available | 1 |
| CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection | Sep 27, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| LMOT: Efficient Light-Weight Detection and Tracking in Crowds | Aug 8, 2022 | 2D Object DetectionMulti-Object Tracking | CodeCode Available | 1 |
| Domain Adaptive Person Search | Jul 25, 2022 | Pedestrian DetectionPerson Re-Identification | CodeCode Available | 1 |
| 3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization | Jul 22, 2022 | Data AugmentationMultiview Detection | CodeCode Available | 1 |
| PedRecNet: Multi-task deep neural network for full 3D human pose and orientation estimation | Apr 25, 2022 | 3D Human Pose EstimationFace Recognition | CodeCode Available | 1 |
| STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes | Apr 3, 2022 | 3D Object DetectionPedestrian Detection | CodeCode Available | 1 |
| Graph Neural Networks for Cross-Camera Data Association | Jan 17, 2022 | 3D Pose EstimationGraph Matching | CodeCode Available | 1 |
| Accurate and Real-time 3D Pedestrian Detection Using an Efficient Attentive Pillar Network | Dec 31, 2021 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Cross-Modality Fusion Transformer for Multispectral Object Detection | Oct 30, 2021 | Multispectral Object DetectionObject | CodeCode Available | 1 |
| Bringing Generalization to Deep Multi-View Pedestrian Detection | Sep 24, 2021 | multi-view detectionMultiview Detection | CodeCode Available | 1 |
| LGD: Label-guided Self-distillation for Object Detection | Sep 23, 2021 | Instance SegmentationObject | CodeCode Available | 1 |
| Efficient and Effective Generation of Test Cases for Pedestrian Detection -- Search-based Software Testing of Baidu Apollo in SVL | Sep 16, 2021 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| LLVIP: A Visible-infrared Paired Dataset for Low-light Vision | Aug 24, 2021 | Image GenerationImage Registration | CodeCode Available | 1 |
| MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking? | Aug 21, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| BlockCopy: High-Resolution Video Processing with Block-Sparse Feature Propagation and Online Policies | Aug 20, 2021 | Instance SegmentationPedestrian Detection | CodeCode Available | 1 |
| MLPD: Multi-Label Pedestrian Detector in Multispectral Domain | Jul 26, 2021 | Multi-Label LearningMultispectral Object Detection | CodeCode Available | 1 |