SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 351400 of 1576 papers

TitleStatusHype
MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues0
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object DetectionCode1
DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation0
MonoCD: Monocular 3D Object Detection with Complementary DepthsCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection0
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance FieldsCode2
Weak-to-Strong 3D Object Detection with X-Ray DistillationCode0
Accurate Cutting-point Estimation for Robotic Lychee Harvesting through Geometry-aware Learning0
VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object DetectionCode1
SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large ObjectsCode2
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality PropagationCode2
CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation0
SSF3D: Strict Semi-Supervised 3D Object Detection with Switching Filter0
Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection0
UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain GapsCode1
Is Your LiDAR Placement Optimized for 3D Scene Understanding?Code2
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object DetectionCode3
Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks0
Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection0
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object DetectionCode3
CR3DT: Camera-RADAR Fusion for 3D Detection and TrackingCode1
3D Object Detection from Point Cloud via Voting Step DiffusionCode0
Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban EnvironmentsCode1
SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model0
EffiPerception: an Efficient Framework for Various Perception Tasks0
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection0
Just Add $100 More: Augmenting NeRF-based Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem0
V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions0
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception0
SimPB: A Single Model for 2D and 3D Object Detection from Multiple CamerasCode1
RCooper: A Real-world Large-scale Dataset for Roadside Cooperative PerceptionCode2
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest0
Improving Distant 3D Object Detection Using 2D Box Supervision0
CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow0
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object DetectionCode0
SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection0
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D PerceptionCode1
LISO: Lidar-only Self-Supervised 3D Object DetectionCode2
SeSame: Simple, Easy 3D Object Detection with Point-Wise SemanticsCode1
Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object DetectionCode1
Enhancing 3D Object Detection with 2D Detection-Guided Query AnchorsCode1
Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving0
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object DetectionCode2
RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR FeaturesCode1
LVIC: Multi-modality segmentation by Lifting Visual Info as Cue0
ActFormer: Scalable Collaborative Perception via Active Queries0
CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view ImagesCode1
CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection0
Show:102550
← PrevPage 8 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MegFusionNDS0.77Unverified
3MMFusion-eNDS0.77Unverified
4BEVFusion-eNDS0.76Unverified
5RacoonPowerNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8FusionVPENDS0.75Unverified
9FocalFormer3D-FNDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified