SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 751775 of 1576 papers

TitleStatusHype
Cooperative Perception for 3D Object Detection in Driving Scenarios using Infrastructure SensorsCode0
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose EstimationCode0
Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative PerceptionCode0
MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object DetectionCode0
Multimodal 3D Object Detection from Simulated PretrainingCode0
MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective PerceptionCode0
Attention-Based Depth Distillation with 3D-Aware Positional Encoding for Monocular 3D Object DetectionCode0
Computer Vision Aided mmWave Beam Alignment in V2X CommunicationsCode0
FusionRCNN: LiDAR-Camera Fusion for Two-stage 3D Object DetectionCode0
Attentional PointNet for 3D-Object Detection in Point CloudsCode0
MonoSIM: Simulating Learning Behaviors of Heterogeneous Point Cloud Object Detectors for Monocular 3D Object DetectionCode0
Comparative study of subset selection methods for rapid prototyping of 3D object detection algorithmsCode0
KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous drivingCode0
Monocular 3D Object Detection with Pseudo-LiDAR Point CloudCode0
MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object LocalizationCode0
Frustum ConvNet: Sliding Frustums to Aggregate Local Point-Wise Features for Amodal 3D Object DetectionCode0
MLVSNet: Multi-Level Voting Siamese Network for 3D Visual TrackingCode0
M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D Object DetectionCode0
Focal Loss in 3D Object DetectionCode0
FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D DetectionCode0
Monocular 2D Camera-based Proximity Monitoring for Human-Machine Collision Warning on Construction SitesCode0
MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object DetectionCode0
MDT3D: Multi-Dataset Training for LiDAR 3D Object Detection GeneralizationCode0
MEDL-U: Uncertainty-aware 3D Automatic Annotation based on Evidential Deep LearningCode0
FFAM: Feature Factorization Activation Map for Explanation of 3D DetectorsCode0
Show:102550
← PrevPage 31 of 64Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MMFusion-eNDS0.77Unverified
3MegFusionNDS0.77Unverified
4RacoonPowerNDS0.76Unverified
5BEVFusion-eNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8DAANDS0.75Unverified
9FusionVPENDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified