SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 2650 of 1576 papers

TitleStatusHype
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown ObjectsCode3
Geometric-aware Pretraining for Vision-centric 3D Object DetectionCode3
PETR: Position Embedding Transformation for Multi-View 3D Object DetectionCode3
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation ModelsCode3
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation ModelsCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object DetectionCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
Commonsense Prototype for Outdoor Unsupervised 3D Object DetectionCode2
DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy PredictionCode2
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersCode2
Argoverse 2: Next Generation Datasets for Self-Driving Perception and ForecastingCode2
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with TransformersCode2
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object DetectionCode2
Show:102550
← PrevPage 2 of 64Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MMFusion-eNDS0.77Unverified
3MegFusionNDS0.77Unverified
4RacoonPowerNDS0.76Unverified
5BEVFusion-eNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8DAANDS0.75Unverified
9FusionVPENDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified