SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 151175 of 1576 papers

TitleStatusHype
Objects as PointsCode2
nuScenes: A multimodal dataset for autonomous drivingCode2
PointPillars: Fast Encoders for Object Detection from Point CloudsCode2
SECOND: Sparsely Embedded Convolutional DetectionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic FusionCode1
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object DetectionCode1
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory PredictionCode1
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionCode1
Learning Class Prototypes for Unified Sparse Supervised 3D Object DetectionCode1
DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera ScenariosCode1
State Space Model Meets Transformer: A New Paradigm for 3D Object DetectionCode1
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground SimulationCode1
Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual LabelsCode1
Accelerate 3D Object Detection Models via Zero-Shot Attention Key PruningCode1
SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic PromptsCode1
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward GuidanceCode1
FASTer: Focal Token Acquiring-and-Scaling Transformer for Long-term 3D Object DetectionCode1
Spiideo SoccerNet SynLoc: Single Frame World Coordinate Athlete Detection and Localization with Synthetic DataCode1
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event CamerasCode1
CoDiff: Conditional Diffusion Model for Collaborative 3D Object DetectionCode1
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and DatasetCode1
CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object DetectionCode1
AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR DataCode1
Show:102550
← PrevPage 7 of 64Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MMFusion-eNDS0.77Unverified
3MegFusionNDS0.77Unverified
4RacoonPowerNDS0.76Unverified
5BEVFusion-eNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8DAANDS0.75Unverified
9FusionVPENDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified