SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 101150 of 1576 papers

TitleStatusHype
FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection0
UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles0
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features0
V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object DetectionCode1
RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging RadarCode1
MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception0
GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection0
Learning Class Prototypes for Unified Sparse-Supervised 3D Object Detection0
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object DetectorCode1
V2X-R: Cooperative LiDAR-4D Radar Fusion with Denoising Diffusion for 3D Object Detection0
FSHNet: Fully Sparse Hybrid Network for 3D Object Detection0
ViKIENet: Towards Efficient 3D Object Detection with Virtual Key Instance Enhanced Network0
Leveraging Temporal Cues for Semi-Supervised Multi-View 3D Object Detection0
PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram0
CorrBEV: Multi-View 3D Object Detection by Correlation Learning with Multi-modal Prototypes0
TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning DistillationCode1
Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement0
TSceneJAL: Joint Active Learning of Traffic Scenes for 3D Object DetectionCode0
HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object DetectionCode0
SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object DetectionCode0
A New Adversarial Perspective for LiDAR-based 3D Object Detection0
PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts0
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion0
RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object DetectionCode1
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations0
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against CorruptionsCode1
PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud CompletionCode1
Physical Informed Driving World Model0
Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis0
Real-Time 3D Object Detection Using InnovizOne LiDAR and Low-Power Hailo-8 AI AcceleratorCode0
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object DetectionCode1
Cubify Anything: Scaling Indoor 3D Object DetectionCode3
Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure0
TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception0
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and PrunableCode0
MCBLT: Multi-Camera Multi-Object 3D Tracking in Long Videos0
Bootstraping Clustering of Gaussians for View-consistent 3D Scene UnderstandingCode1
SpaRC: Sparse Radar-Camera Fusion for 3D Object DetectionCode0
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
Open Vocabulary Monocular 3D Object DetectionCode2
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data0
VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving0
MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving0
MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection0
VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation0
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection0
Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance0
V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising DiffusionCode2
Show:102550
← PrevPage 3 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MMFusion-eNDS0.77Unverified
3MegFusionNDS0.77Unverified
4RacoonPowerNDS0.76Unverified
5BEVFusion-eNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8DAANDS0.75Unverified
9FusionVPENDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified