SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 126150 of 1576 papers

TitleStatusHype
Unifying Voxel-based Representation with Transformer for 3D Object DetectionCode2
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-trainingCode2
BEVFusion: A Simple and Robust LiDAR-Camera Fusion FrameworkCode2
BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous DrivingCode2
PillarNet: Real-Time and High-Performance Pillar-based 3D Object DetectionCode2
Surface Representation for Point CloudsCode2
TJ4DRadSet: A 4D Radar Dataset for Autonomous DrivingCode2
Focal Sparse Convolutional Networks for 3D Object DetectionCode2
DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft DatasetCode2
Image-to-Lidar Self-Supervised Distillation for Autonomous Driving DataCode2
LiDAR Snowfall Simulation for Robust 3D Object DetectionCode2
MonoDETR: Depth-guided Transformer for Monocular 3D Object DetectionCode2
TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with TransformersCode2
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision TransformerCode2
Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point CloudsCode2
Sparse Fuse Dense: Towards High Quality 3D Detection with Depth CompletionCode2
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with TransformersCode2
SoftGroup for 3D Instance Segmentation on Point CloudsCode2
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-ViewCode2
Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose AnnotationsCode2
Searching Efficient 3D Architectures with Sparse Point-Voxel ConvolutionCode2
Generative Sparse Detection Networks for 3D Single-shot Object DetectionCode2
Center-based 3D Object Detection and TrackingCode2
Star-convex Polyhedra for 3D Object Detection and Segmentation in MicroscopyCode2
Show:102550
← PrevPage 6 of 64Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MegFusionNDS0.77Unverified
3MMFusion-eNDS0.77Unverified
4BEVFusion-eNDS0.76Unverified
5RacoonPowerNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8FusionVPENDS0.75Unverified
9FocalFormer3D-FNDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified