SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 51100 of 1576 papers

TitleStatusHype
MonoCD: Monocular 3D Object Detection with Complementary DepthsCode2
Monocular 3D Object Detection with Depth from MotionCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error PriorsCode2
BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal StereoCode2
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object DetectionCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
nuScenes: A multimodal dataset for autonomous drivingCode2
UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous DrivingCode2
OctFormer: Octree-based Transformers for 3D Point CloudsCode2
OneFormer3D: One Transformer for Unified Point Cloud SegmentationCode2
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object DetectionCode2
Image-to-Lidar Self-Supervised Distillation for Autonomous Driving DataCode2
Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object DetectionCode2
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesCode2
LiDAR Snowfall Simulation for Robust 3D Object DetectionCode2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision TransformerCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object DetectionCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
Generative Sparse Detection Networks for 3D Single-shot Object DetectionCode2
LinK3D: Linear Keypoints Representation for 3D LiDAR Point CloudCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-ViewCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
Focal Sparse Convolutional Networks for 3D Object DetectionCode2
Fully Sparse 3D Object DetectionCode2
Fully Test-Time Adaptation for Monocular 3D Object DetectionCode2
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object DetectionCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object DetectionCode2
L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object DetectionCode2
BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous DrivingCode2
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation ModelsCode2
Argoverse 2: Next Generation Datasets for Self-Driving Perception and ForecastingCode2
Commonsense Prototype for Outdoor Unsupervised 3D Object DetectionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Show:102550
← PrevPage 2 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MegFusionNDS0.77Unverified
3MMFusion-eNDS0.77Unverified
4BEVFusion-eNDS0.76Unverified
5RacoonPowerNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8FusionVPENDS0.75Unverified
9FocalFormer3D-FNDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified