SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 501550 of 1576 papers

TitleStatusHype
HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point CloudsCode1
Dynamic V2X Autonomous Perception from Road-to-Vehicle Vision0
ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object DetectionCode0
DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection0
MVFAN: Multi-View Feature Assisted Network for 4D Radar Object Detection0
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object DetectionCode3
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object DetectionCode1
Pre-Training LiDAR-Based 3D Object Detectors Through ColorizationCode0
Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS0
EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye ViewCode1
RTNH+: Enhanced 4D Radar Object Detection Network using Combined CFAR-based Two-level Preprocessing and Vertical Encoding0
MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation CoefficientCode1
Towards Generalizable Multi-Camera 3D Object Detection via Perspective DebiasingCode1
Open-CRB: Towards Open World Active Learning for 3D Object DetectionCode1
Multimodal Object Query Initialization for 3D Object Detection0
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training ParadigmCode2
UniPAD: A Universal Pre-training Paradigm for Autonomous DrivingCode2
GraphAlign: Enhancing Accurate Feature Alignment by Graph matching for Multi-Modal 3D Object Detection0
Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous DrivingCode1
3DS-SLAM: A 3D Object Detection based Semantic SLAM towards Dynamic Indoor Environments0
V2X-AHD:Vehicle-to-Everything Cooperation Perception via Asymmetric Heterogenous Distillation NetworkCode0
Uni3DETR: Unified 3D Detection TransformerCode1
CoBEVFusion: Cooperative Perception with LiDAR-Camera Bird's-Eye View Fusion0
Towards Fair and Comprehensive Comparisons for Image-Based 3D Object DetectionCode0
Anyview: Generalizable Indoor 3D Object Detection with Variable Frames0
Care3D: An Active 3D Object Detection Dataset of Real Robotic-Care EnvironmentsCode0
Rotation Matters: Generalized Monocular 3D Object Detection for Various Camera Systems0
Joint object detection and re-identification for 3D obstacle multi-camera systems0
QE-BEV: Query Evolution for Bird's Eye View Object Detection in Varied ContextsCode0
Towards Long-Range 3D Object Detection for Autonomous Vehicles0
WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object DetectionCode1
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object DetectionCode2
MagicDrive: Street View Generation with Diverse 3D Geometry ControlCode3
CoBEV: Elevating Roadside 3D Object Detection with Depth and Height ComplementarityCode1
Every Dataset Counts: Scaling up Monocular 3D Object Detection with Joint Datasets Training0
Towards Robust 3D Object Detection In Rainy Conditions0
Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection0
LS-VOS: Identifying Outliers in 3D Object Detections Using Latent Space Virtual Outlier Synthesis0
MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings0
Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature AugmentationCode1
LEF: Late-to-Early Temporal Fusion for LiDAR 3D Object Detection0
BEVHeight++: Toward Robust Visual Centric 3D Object Detection0
M^2SODAI: Multi-Modal Maritime Object Detection Dataset With RGB and Hyperspectral Image Sensors0
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge DistillationCode1
UniBEV: Multi-modal 3D Object Detection with Uniform BEV Encoders for Robustness against Missing Sensor ModalitiesCode1
Towards Robust Robot 3D Perception in Urban Environments: The UT Campus Object DatasetCode1
Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile DatasetCode1
MonoUNI: A Unified Vehicle and Infrastructure-side Monocular 3D Object Detection Network with Sufficient Depth CluesCode1
Unsupervised Domain Adaptation for Self-Driving from Past Traversal FeaturesCode0
FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object DetectionCode1
Show:102550
← PrevPage 11 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MegFusionNDS0.77Unverified
3MMFusion-eNDS0.77Unverified
4BEVFusion-eNDS0.76Unverified
5RacoonPowerNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8FusionVPENDS0.75Unverified
9FocalFormer3D-FNDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified