SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 151200 of 2454 papers

TitleStatusHype
Behind the Scenes: Density Fields for Single View ReconstructionCode2
EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic CameraCode2
DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving ApplicationsCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Enforcing geometric constraints of virtual normal for depth predictionCode2
MonoCD: Monocular 3D Object Detection with Complementary DepthsCode2
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesCode2
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth EstimationCode2
MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D QueriesCode2
TIPS: Text-Image Pretraining with Spatial AwarenessCode2
Deeper into Self-Supervised Monocular Indoor Depth EstimationCode1
Deep Multi-view Depth Estimation with Predicted UncertaintyCode1
Deeper Depth Prediction with Fully Convolutional Residual NetworksCode1
DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and TrackingCode1
Deep Depth Estimation from Visual-Inertial SLAMCode1
A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view StereoCode1
Deep Multi-View Stereo gone wildCode1
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and ChallengesCode1
A Concise but High-performing Network for Image Guided Depth Completion in Autonomous DrivingCode1
Deep Depth From FocusCode1
Domain Adaptive Semantic Segmentation with Self-Supervised Depth EstimationCode1
DS-Depth: Dynamic and Static Depth Estimation via a Fusion Cost VolumeCode1
Deep Color Mismatch Correction In Stereoscopic 3D ImagesCode1
Always Clear Depth: Robust Monocular Depth Estimation under Adverse WeatherCode1
DiverseDepth: Affine-invariant Depth Prediction Using Diverse DataCode1
Deconstructing Self-Supervised Monocular Reconstruction: The Design Decisions that MatterCode1
altiro3D: Scene representation from single image and novel view synthesisCode1
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine DomainCode1
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual PerceptionCode1
2.5D Visual Relationship DetectionCode1
DaRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth AdaptationCode1
Does it work outside this benchmark? Introducing the Rigid Depth Constructor tool, depth validation dataset construction in rigid scenes for the massesCode1
Dual Pixel Exploration: Simultaneous Depth Estimation and Image RestorationCode1
A Light and Tuning-free Method for Simulating Camera Motion in Video GenerationCode1
Cylin-Painting: Seamless 360 Panoramic Image Outpainting and BeyondCode1
Disparity Estimation Using a Quad-Pixel SensorCode1
AI Playground: Unreal Engine-based Data Ablation Tool for Deep LearningCode1
Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular DepthCode1
Automated Distance Estimation for Wildlife Camera TrappingCode1
A Hybrid Sparse-Dense Monocular SLAM System for Autonomous DrivingCode1
CutDepth:Edge-aware Data Augmentation in Depth EstimationCode1
Discrete Cosine Transform Network for Guided Depth Map Super-ResolutionCode1
Curvature-guided dynamic scale networks for Multi-view StereoCode1
ACDNet: Adaptively Combined Dilated Convolution for Monocular Panorama Depth EstimationCode1
DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic ModelsCode1
Discrete Time Convolution for Fast Event-Based StereoCode1
A geometry-aware deep network for depth estimation in monocular endoscopyCode1
A General and Adaptive Robust Loss FunctionCode1
Cross-modal transformers for infrared and visible image fusionCode1
Show:102550
← PrevPage 4 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified