SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 701750 of 1576 papers

TitleStatusHype
DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving0
VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object DetectionCode1
CRN: Camera Radar Net for Accurate, Robust, Efficient 3D PerceptionCode1
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
IC-FPS: Instance-Centroid Faster Point Sampling Module for 3D Point-base Object Detection0
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerCode1
Robo3D: Towards Robust and Reliable 3D Perception against Corruptions0
BEVFusion4D: Learning LiDAR-Camera Fusion Under Bird's-Eye-View via Cross-Modality Guidance and Temporal Aggregation0
Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous DrivingCode1
DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and TrackingCode1
SimDistill: Simulated Multi-modal Distillation for BEV 3D Object DetectionCode1
LinK: Linear Kernel for LiDAR-based 3D PerceptionCode1
Learning to Zoom and Unzoom0
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye ViewCode1
Unsupervised Adaptation from Repeated Traversals for Autonomous DrivingCode0
Viewpoint Equivariance for Multi-View 3D Object DetectionCode1
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown ObjectsCode3
MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation0
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-TrainingCode1
MonoATT: Online Monocular 3D Object Detection with Adaptive Token Transformer0
EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose EstimationCode3
OcTr: Octree-based Transformer for 3D Object Detection0
Spherical Transformer for LiDAR-based 3D RecognitionCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
DR.CPO: Diversified and Realistic 3D Augmentation via Iterative Construction, Random Placement, and HPR OcclusionCode0
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and TrackingCode2
Constructing Metric-Semantic Maps using Floor Plan Priors for Long-Term Indoor LocalizationCode1
VIMI: Vehicle-Infrastructure Multi-view Intermediate Fusion for Camera-based 3D Object DetectionCode1
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D UnderstandingCode1
Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous DrivingCode0
Augment and Criticize: Exploring Informative Samples for Semi-Supervised Monocular 3D Object Detection0
Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow PredictionCode1
CAPE: Camera View Position Embedding for Multi-View 3D Object DetectionCode1
GOOD: General Optimization-based Fusion for 3D Object Detection via LiDAR-Camera Object Candidates0
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
Among Us: Adversarially Robust Collaborative Perception by ConsensusCode1
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous DrivingCode3
MSF: Motion-guided Sequential Fusion for Efficient 3D Object Detection from Point Cloud SequencesCode1
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
Weakly Supervised Monocular 3D Object Detection using Multi-View Projection and Direction ConsistencyCode0
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle Cooperative PerceptionCode2
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object DetectionCode1
Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection0
Enhanced K-Radar: Optimal Density Reduction to Improve Detection Performance and Accessibility of 4D Radar Tensor-based Object Detection0
ReBound: An Open-Source 3D Bounding Box Annotation Tool for Active LearningCode1
Bi3D: Bi-domain Active Learning for Cross-domain 3D Object Detection0
Efficient Transformer-based 3D Object Detection with Dynamic Token Halting0
DDS3D: Dense Pseudo-Labels with Dynamic Threshold for Semi-Supervised 3D Object DetectionCode1
Calibration-free BEV Representation for Infrastructure PerceptionCode1
Show:102550
← PrevPage 15 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MMFusion-eNDS0.77Unverified
3MegFusionNDS0.77Unverified
4RacoonPowerNDS0.76Unverified
5BEVFusion-eNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8DAANDS0.75Unverified
9FusionVPENDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified