SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 701750 of 2454 papers

TitleStatusHype
A Concise but High-performing Network for Image Guided Depth Completion in Autonomous DrivingCode1
Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian SplattingCode7
Range-Agnostic Multi-View Depth Estimation With Keyframe SelectionCode1
FoVA-Depth: Field-of-View Agnostic Depth Estimation for Cross-Dataset Generalization0
InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy PredictionCode1
Icy Moon Surface Simulation and Stereo Depth Estimation for Sampling AutonomyCode0
EndoGaussian: Real-time Gaussian Splatting for Dynamic Endoscopic Scene ReconstructionCode0
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View StereoCode3
Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered by Multiple Disparity Consistency0
Depth Anything: Unleashing the Power of Large-Scale Unlabeled DataCode9
TDC-less Direct Time-of-Flight Imaging Using Spiking Neural Networks0
Self-supervised Event-based Monocular Depth Estimation using Cross-modal Consistency0
Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic SurgeryCode2
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal EndoscopyCode1
InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes0
RadarCam-Depth: Radar-Camera Fusion for Depth Estimation with Learned Metric ScaleCode2
NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation0
Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping0
Denoising Vision TransformersCode3
Depth-discriminative Metric Learning for Monocular 3D Object Detection0
Flexible Depth Completion for Sparse and Varying Point Densities0
iToF-flow-based High Frame Rate Depth ImagingCode0
Diffusion Handles Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D0
Global and Hierarchical Geometry Consistency Priors for Few-shot NeRFs in Indoor ScenesCode1
Coherence As Texture - Passive Textureless 3D Reconstruction by Self-interference0
Learning Deformable Hypothesis Sampling for Accurate PatchMatch Multi-View StereoCode1
Manydepth2: Motion-Aware Self-Supervised Multi-Frame Monocular Depth Estimation in Dynamic ScenesCode1
Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarizationCode1
Harnessing Diffusion Models for Visual Perception with Meta PromptsCode1
Lift-Attend-Splat: Bird's-eye-view camera-lidar fusion using transformers0
DUSt3R: Geometric 3D Vision Made EasyCode5
NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields0
PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation0
Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model0
DDOS: The Drone Depth and Obstacle Segmentation Dataset0
Atlantis: Enabling Underwater Depth Estimation with Stable DiffusionCode1
Long-Tailed 3D Detection via Multi-Modal FusionCode0
From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth Estimation of Dynamic Objects with Ground Contact Prior0
CT-MVSNet: Efficient Multi-View Stereo with Cross-scale TransformerCode1
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free EnvironmentsCode2
Text2Immersion: Generative Immersive Scene with 3D Gaussians0
ReCoRe: Regularized Contrastive Representation Learning of World Model0
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text AlignmentCode1
Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning0
Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops0
GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera Parameters via Ground Plane Embedding0
Camera-based 3D Semantic Scene Completion with Sparse Guidance NetworkCode1
Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation0
Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth Estimation0
Artificial Neural Network for Estimation of Physical Parameters of Sea Water using LiDAR Waveforms0
Show:102550
← PrevPage 15 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified