SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 151200 of 1576 papers

TitleStatusHype
Objects as PointsCode2
nuScenes: A multimodal dataset for autonomous drivingCode2
PointPillars: Fast Encoders for Object Detection from Point CloudsCode2
SECOND: Sparsely Embedded Convolutional DetectionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic FusionCode1
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object DetectionCode1
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory PredictionCode1
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionCode1
Learning Class Prototypes for Unified Sparse Supervised 3D Object DetectionCode1
DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera ScenariosCode1
State Space Model Meets Transformer: A New Paradigm for 3D Object DetectionCode1
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground SimulationCode1
Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual LabelsCode1
Accelerate 3D Object Detection Models via Zero-Shot Attention Key PruningCode1
SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic PromptsCode1
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward GuidanceCode1
FASTer: Focal Token Acquiring-and-Scaling Transformer for Long-term 3D Object DetectionCode1
Spiideo SoccerNet SynLoc: Single Frame World Coordinate Athlete Detection and Localization with Synthetic DataCode1
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event CamerasCode1
CoDiff: Conditional Diffusion Model for Collaborative 3D Object DetectionCode1
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and DatasetCode1
CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object DetectionCode1
AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR DataCode1
V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object DetectionCode1
RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging RadarCode1
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object DetectorCode1
TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning DistillationCode1
RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object DetectionCode1
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against CorruptionsCode1
PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud CompletionCode1
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object DetectionCode1
Bootstraping Clustering of Gaussians for View-consistent 3D Scene UnderstandingCode1
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationCode1
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object DetectionCode1
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object DetectionCode1
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane SweepsCode1
Real-time Stereo-based 3D Object Detection for Streaming PerceptionCode1
TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal EnhancementCode1
LEROjD: Lidar Extended Radar-Only Object DetectionCode1
GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object DetectionCode1
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-ViewCode1
A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future DirectionsCode1
Co-Fix3D: Enhancing 3D Object Detection with Collaborative RefinementCode1
Vision-Language Guidance for LiDAR-based Unsupervised 3D Object DetectionCode1
Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object DetectionCode1
InScope: A New Real-world 3D Infrastructure-side Collaborative Perception Dataset for Open Traffic ScenariosCode1
WARM-3D: A Weakly-Supervised Sim2Real Domain Adaptation Framework for Roadside Monocular 3D Object DetectionCode1
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality EnsembleCode1
Show:102550
← PrevPage 4 of 32Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MMFusion-eNDS0.77Unverified
3MegFusionNDS0.77Unverified
4RacoonPowerNDS0.76Unverified
5BEVFusion-eNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8DAANDS0.75Unverified
9FusionVPENDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified