Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 2454 papers

Title	Date	Tasks	Status	Hype	Score
VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion	Feb 23, 2023	3D geometry3D Semantic Scene Completion	CodeCode Available	3	5
Unifying Flow, Stereo and Depth Estimation	Nov 10, 2022	Depth EstimationOptical Flow Estimation	CodeCode Available	3	5
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation	Mar 27, 2024	Depth EstimationDepth Prediction	CodeCode Available	3	5
DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting	Nov 26, 2024	Camera CalibrationDepth Estimation	CodeCode Available	3	5
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video	Mar 27, 2025	Camera Pose EstimationDepth Estimation	CodeCode Available	3	5
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?	Mar 10, 2024	Depth EstimationImage Matting	CodeCode Available	3	5
SimpleRecon: 3D Reconstruction Without 3D Convolutions	Aug 31, 2022	3D ReconstructionDepth Estimation	CodeCode Available	3	5
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors	Jan 9, 2025	Depth EstimationMonocular Depth Estimation	CodeCode Available	3	5
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos	May 3, 2024	Depth EstimationDepth Prediction	CodeCode Available	3	5
RoadBEV: Road Surface Reconstruction in Bird's Eye View	Apr 9, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	3	5
Towards Accurate Reconstruction of 3D Scene Shape from A Single Monocular Image	Aug 28, 2022	Depth EstimationDepth Prediction	CodeCode Available	3	5
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera	Jan 5, 2025	Data AugmentationDepth Estimation	CodeCode Available	3	5
Denoising Vision Transformers	Jan 5, 2024	DenoisingDepth Estimation	CodeCode Available	3	5
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation	Dec 4, 2023	Depth Estimation	CodeCode Available	3	5
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Jan 16, 2025	Depth EstimationDisparity Estimation	CodeCode Available	3	5
PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Oct 29, 2024	3DGS3D Reconstruction	CodeCode Available	3	5
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer	Jul 2, 2019	Depth EstimationMonocular Depth Estimation	CodeCode Available	3	5
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection	Jul 27, 2023	3D geometry3D Object Detection	CodeCode Available	2	5
Neural Ray Surfaces for Self-Supervised Learning of Depth and Ego-motion	Aug 15, 2020	Depth EstimationMotion Estimation	CodeCode Available	2	5
CompletionFormer: Depth Completion with Convolutions and Vision Transformers	Apr 25, 2023	Depth CompletionDepth Estimation	CodeCode Available	2	5
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection	Jun 21, 2022	3D Object DetectionDepth Estimation	CodeCode Available	2	5
MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries	May 2, 2022	Autonomous DrivingDepth Estimation	CodeCode Available	2	5
NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation	Mar 3, 2022	DecoderDepth Estimation	CodeCode Available	2	5
Behind the Scenes: Density Fields for Single View Reconstruction	Jan 18, 2023	Depth EstimationDepth Prediction	CodeCode Available	2	5
Multi-Task Learning as Multi-Objective Optimization	Oct 10, 2018	Depth EstimationGeneral Classification	CodeCode Available	2	5
Consistent Video Depth Estimation	Apr 30, 2020	Depth EstimationMonocular Reconstruction	CodeCode Available	2	5
MultiMAE: Multi-modal Multi-task Masked Autoencoders	Apr 4, 2022	Depth Estimationimage-classification	CodeCode Available	2	5
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments	Dec 14, 2023	Autonomous DrivingDepth Estimation	CodeCode Available	2	5
MonoCD: Monocular 3D Object Detection with Complementary Depths	Apr 4, 2024	3D Object DetectionDepth Estimation	CodeCode Available	2	5
Monocular 3D Object Detection with Depth from Motion	Jul 26, 2022	3D Object DetectionDepth Estimation	CodeCode Available	2	5
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors	Oct 25, 2024	3D Object DetectionDepth Estimation	CodeCode Available	2	5
A Unified Image-Dense Annotation Generation Model for Underwater Scenes	Mar 27, 2025	Depth EstimationPrediction	CodeCode Available	2	5
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation	Nov 23, 2022	Depth EstimationMonocular Depth Estimation	CodeCode Available	2	5
BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo	Sep 21, 2022	3D Object DetectionDepth Estimation	CodeCode Available	2	5
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV	Mar 3, 2024	Depth EstimationMonocular Depth Estimation	CodeCode Available	2	5
Learning to Recover 3D Scene Shape from a Single Image	Dec 17, 2020	3D Scene ReconstructionDepth Estimation	CodeCode Available	2	5
Map-free Visual Relocalization: Metric Pose Relative to a Single Image	Oct 11, 2022	Depth EstimationDepth Prediction	CodeCode Available	2	5
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation	Jul 19, 2024	Data AugmentationDepth Estimation	CodeCode Available	2	5
On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR	Nov 1, 2024	3D Semantic SegmentationAutonomous Driving	CodeCode Available	2	5
HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation	Apr 30, 2025	Depth EstimationScene Generation	CodeCode Available	2	5
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis	Dec 4, 2023	2kDepth Estimation	CodeCode Available	2	5
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors	Jul 26, 2024	Depth EstimationGPU	CodeCode Available	2	5
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models	Jun 18, 2024	BenchmarkingDepth Estimation	CodeCode Available	2	5
GeoMVSNet: Learning Multi-View Stereo With Geometry Perception	Jan 1, 2023	3D ReconstructionDepth Estimation	CodeCode Available	2	5
A Survey on RGB-D Datasets	Jan 15, 2022	Depth EstimationMonocular Depth Estimation	CodeCode Available	2	5
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth Estimation	Apr 3, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2	5
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation	Jul 15, 2024	DenoisingDepth Estimation	CodeCode Available	2	5
Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning	Apr 4, 2024	3D Scene ReconstructionDepth Estimation	CodeCode Available	2	5
Enforcing geometric constraints of virtual normal for depth prediction	Jul 29, 2019	Depth EstimationDepth Prediction	CodeCode Available	2	5
Atlas: End-to-End 3D Scene Reconstruction from Posed Images	Mar 23, 2020	3D Reconstruction3D Scene Reconstruction	CodeCode Available	2	5

Show:10 25 50

← PrevPage 2 of 50Next →

All datasets Stanford2D3D Panoramic NYU-Depth V2 DCM eBDtheque ScanNetV2 Cityscapes test DIODE KITTI 2015 Mars DTM Estimation ScanNet 4D Light Field Dataset KITTI Eigen split

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	OmniDepth	RMSE	0.62	—	Unverified
2	SphereDepth	RMSE	0.45	—	Unverified
3	Jin et al.	RMSE	0.42	—	Unverified
4	BiFuse with fusion	RMSE	0.41	—	Unverified
5	HoHoNet (ResNet-101)	RMSE	0.38	—	Unverified
6	PanoDepth	RMSE	0.37	—	Unverified
7	BiFuse++	RMSE	0.37	—	Unverified
8	UniFuse with fusion	RMSE	0.37	—	Unverified
9	DisConv	RMSE	0.37	—	Unverified
10	SliceNet	RMSE	0.37	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	A2J	mAP	8.61	—	Unverified
2	PAD-Net	RMS	0.79	—	Unverified
3	MS-CRF	RMS	0.59	—	Unverified
4	DORN	RMS	0.51	—	Unverified
5	Freeform	RMS	0.43	—	Unverified
6	Optimized, freeform	RMS	0.43	—	Unverified
7	VNL	RMS	0.42	—	Unverified
8	BTS	RMS	0.41	—	Unverified
9	TransDepth (AGD+ ViT)	RMS	0.37	—	Unverified
10	AdaBins	RMS	0.36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T2Net	Abs Rel	0.35	—	Unverified
2	MIDAS	Abs Rel	0.31	—	Unverified
3	Bhattacharjee et al.	Abs Rel	0.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T2Net	Abs Rel	0.49	—	Unverified
2	MIDAS	Abs Rel	0.42	—	Unverified
3	Bhattacharjee et al.	Abs Rel	0.38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeReS	absolute relative error	0.1	—	Unverified
2	DELTAS	absolute relative error	0.09	—	Unverified
3	Distill Any Depth	absolute relative error	0.04	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SDC-Depth	RMSE	6.92	—	Unverified
2	SwinMTL	RMSE	6.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AIP-Brown	Delta < 1.25	0.36	—	Unverified
2	LeRes	Delta < 1.25	0.23	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	H-Net (Ours)	Absolute relative error (AbsRel)	0.09	—	Unverified
2	H-Net (Ours) Full Eigen	Absolute relative error (AbsRel)	0.08	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLPDepth	Delta < 1.25	0.43	—	Unverified
2	SRDINET (Model A)	Delta < 1.25	0.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Atlas (finetuned)	RMSE	0.17	—	Unverified
2	Atlas (plain)	RMSE	0.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LFattNet	BadPix(0.01)	17.23	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LightDepth	Number of parameters (M)	42.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniFuse	Abs Rel	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-TC (Cross-Task Consistency)	L1 error	1.63	—	Unverified