SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 251300 of 2454 papers

TitleStatusHype
RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis0
LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty GuidanceCode1
Scaling 4D Representations0
Flowing from Words to Pixels: A Framework for Cross-Modality Evolution0
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion0
Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation0
Prompting Depth Anything for 4K Resolution Accurate Metric Depth EstimationCode5
PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts0
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations0
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video0
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy PredictionCode1
MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance0
Cross-View Completion Models are Zero-shot Correspondence Estimators0
Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos0
T-SVG: Text-Driven Stereoscopic Video Generation0
BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation0
Utilizing Multi-step Loss for Single Image Reflection RemovalCode0
Dense Depth from Event Focal Stack0
Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation0
SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception0
On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events0
Driv3R: Learning Dense 4D Reconstruction for Autonomous DrivingCode2
Event fields: Capturing light fields at high speed, resolution, and dynamic range0
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction0
GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion0
TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-ActionCode2
PanoDreamer: Optimization-Based Single Image to 360 3D Scene With DiffusionCode2
SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB ImagesCode0
MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic VideosCode5
DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction0
MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction0
LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation0
MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction0
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos0
Dense Scene Reconstruction from Light-Field Images Affected by Rolling ShutterCode0
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models0
GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos0
Amodal Depth Anything: Amodal Depth Estimation in the Wild0
Dual Exposure Stereo for Extended Dynamic Range 3D Imaging0
Single-Shot Metric Depth from Focused Plenoptic Cameras0
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving0
AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric Depth Estimation0
STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation0
Mutli-View 3D Reconstruction using Knowledge DistillationCode0
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation0
SpaRC: Sparse Radar-Camera Fusion for 3D Object DetectionCode0
Gaussian Splashing: Direct Volumetric Rendering Underwater0
MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications0
Video Depth without Video Models0
360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images0
Show:102550
← PrevPage 6 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified