SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 151200 of 1576 papers

TitleStatusHype
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object DetectionCode2
Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft DatasetCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object DetectionCode2
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
ARM3D: Attention-based relation module for indoor 3D object detectionCode1
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D DataCode1
FastPillars: A Deployment-friendly Pillar-based 3D DetectorCode1
FASTer: Focal Token Acquiring-and-Scaling Transformer for Long-term 3D Object DetectionCode1
FCOS3D: Fully Convolutional One-Stage Monocular 3D Object DetectionCode1
FADet: A Multi-sensor 3D Object Detection Network based on Local Featured AttentionCode1
4D-Net for Learned Multi-Modal AlignmentCode1
Far3D: Expanding the Horizon for Surround-view 3D Object DetectionCode1
Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D SceneCode1
3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object DetectionCode1
CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object DetectionCode1
Faraway-Frustum: Dealing with Lidar Sparsity for 3D Object Detection using FusionCode1
FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object DetectionCode1
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event CamerasCode1
ES6D: A Computation Efficient and Symmetry-Aware 6D Pose Regression FrameworkCode1
Evaluating Adversarial Attacks on Driving Safety in Vision-Based Autonomous VehiclesCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
Exploring Active 3D Object Detection from a Generalization PerspectiveCode1
Enhancing 3D Object Detection with 2D Detection-Guided Query AnchorsCode1
End-to-End Pseudo-LiDAR for Image-Based 3D Object DetectionCode1
EPNet++: Cascade Bi-directional Fusion for Multi-Modal 3D Object DetectionCode1
Calibration-free BEV Representation for Infrastructure PerceptionCode1
3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance SegmentationCode1
End-To-End Optimization of LiDAR Beam Configuration for 3D Object Detection and LocalizationCode1
EPNet: Enhancing Point Features with Image Semantics for 3D Object DetectionCode1
FGR: Frustum-Aware Geometric Reasoning for Weakly Supervised 3D Vehicle DetectionCode1
EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye ViewCode1
3D Spatial Recognition without Spatially Labeled 3DCode1
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object DetectionCode1
An End-to-End Transformer Model for 3D Object DetectionCode1
3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance SegmentationCode1
SeSame: Simple, Easy 3D Object Detection with Point-Wise SemanticsCode1
Anchor-free 3D Single Stage Detector with Mask-Guided Attention for Point CloudCode1
3DRM:Pair-wise relation module for 3D object detectionCode1
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object DetectionCode1
3D Cascade RCNN: High Quality Object Detection in Point CloudsCode1
DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera ScenariosCode1
Among Us: Adversarially Robust Collaborative Perception by ConsensusCode1
3DPPE: 3D Point Positional Encoding for Transformer-based Multi-Camera 3D Object DetectionCode1
3D Bounding Box Estimation Using Deep Learning and GeometryCode1
ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels OnlyCode1
Aligning Bird-Eye View Representation of Point Cloud Sequences using Scene FlowCode1
3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection TransformersCode1
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward GuidanceCode1
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic FusionCode1
Show:102550
← PrevPage 4 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MegFusionNDS0.77Unverified
3MMFusion-eNDS0.77Unverified
4BEVFusion-eNDS0.76Unverified
5RacoonPowerNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8FusionVPENDS0.75Unverified
9FocalFormer3D-FNDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified