Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 2454 papers

Title	Date	Tasks	Status	Hype
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video	Mar 27, 2025	Camera Pose EstimationDepth Estimation	CodeCode Available	3
RoadBEV: Road Surface Reconstruction in Bird's Eye View	Apr 9, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	3
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation	Mar 27, 2024	Depth EstimationDepth Prediction	CodeCode Available	3
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image	Jun 6, 2024	3D Scene ReconstructionDepth Estimation	CodeCode Available	3
DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting	Nov 26, 2024	Camera CalibrationDepth Estimation	CodeCode Available	3
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors	Jan 9, 2025	Depth EstimationMonocular Depth Estimation	CodeCode Available	3
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos	May 3, 2024	Depth EstimationDepth Prediction	CodeCode Available	3
VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion	Feb 23, 2023	3D geometry3D Semantic Scene Completion	CodeCode Available	3
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?	Mar 10, 2024	Depth EstimationImage Matting	CodeCode Available	3
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation	Dec 4, 2023	Depth Estimation	CodeCode Available	3
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo	Jan 22, 2024	3D ReconstructionDepth Estimation	CodeCode Available	3
PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Oct 29, 2024	3DGS3D Reconstruction	CodeCode Available	3
Denoising Vision Transformers	Jan 5, 2024	DenoisingDepth Estimation	CodeCode Available	3
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Jan 16, 2025	Depth EstimationDisparity Estimation	CodeCode Available	3
LiftFeat: 3D Geometry-Aware Local Feature Matching	May 6, 2025	3D geometryDepth Estimation	CodeCode Available	3
MUSt3R: Multi-view Network for Stereo 3D Reconstruction	Mar 3, 2025	3D ReconstructionArticles	CodeCode Available	3
SimpleRecon: 3D Reconstruction Without 3D Convolutions	Aug 31, 2022	3D ReconstructionDepth Estimation	CodeCode Available	3
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images	Oct 31, 2024	3D Object DetectionDepth Estimation	CodeCode Available	2
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation	Apr 3, 2022	DecoderDepth Estimation	CodeCode Available	2
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists	Sep 30, 2023	Depth EstimationImage Generation	CodeCode Available	2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors	Jul 26, 2024	Depth EstimationGPU	CodeCode Available	2
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation	Jul 15, 2024	DenoisingDepth Estimation	CodeCode Available	2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth Estimation	Apr 3, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2
BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo	Sep 21, 2022	3D Object DetectionDepth Estimation	CodeCode Available	2
GeoMVSNet: Learning Multi-View Stereo With Geometry Perception	Jan 1, 2023	3D ReconstructionDepth Estimation	CodeCode Available	2

Show:10 25 50

← PrevPage 3 of 99Next →

All datasets Stanford2D3D Panoramic NYU-Depth V2 DCM eBDtheque ScanNetV2 Cityscapes test DIODE KITTI 2015 Mars DTM Estimation ScanNet 4D Light Field Dataset KITTI Eigen split

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	OmniDepth	RMSE	0.62	—	Unverified
2	SphereDepth	RMSE	0.45	—	Unverified
3	Jin et al.	RMSE	0.42	—	Unverified
4	BiFuse with fusion	RMSE	0.41	—	Unverified
5	HoHoNet (ResNet-101)	RMSE	0.38	—	Unverified
6	PanoDepth	RMSE	0.37	—	Unverified
7	BiFuse++	RMSE	0.37	—	Unverified
8	UniFuse with fusion	RMSE	0.37	—	Unverified
9	DisConv	RMSE	0.37	—	Unverified
10	SliceNet	RMSE	0.37	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	A2J	mAP	8.61	—	Unverified
2	PAD-Net	RMS	0.79	—	Unverified
3	MS-CRF	RMS	0.59	—	Unverified
4	DORN	RMS	0.51	—	Unverified
5	Freeform	RMS	0.43	—	Unverified
6	Optimized, freeform	RMS	0.43	—	Unverified
7	VNL	RMS	0.42	—	Unverified
8	BTS	RMS	0.41	—	Unverified
9	TransDepth (AGD+ ViT)	RMS	0.37	—	Unverified
10	AdaBins	RMS	0.36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T2Net	Abs Rel	0.35	—	Unverified
2	MIDAS	Abs Rel	0.31	—	Unverified
3	Bhattacharjee et al.	Abs Rel	0.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T2Net	Abs Rel	0.49	—	Unverified
2	MIDAS	Abs Rel	0.42	—	Unverified
3	Bhattacharjee et al.	Abs Rel	0.38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeReS	absolute relative error	0.1	—	Unverified
2	DELTAS	absolute relative error	0.09	—	Unverified
3	Distill Any Depth	absolute relative error	0.04	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SDC-Depth	RMSE	6.92	—	Unverified
2	SwinMTL	RMSE	6.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AIP-Brown	Delta < 1.25	0.36	—	Unverified
2	LeRes	Delta < 1.25	0.23	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	H-Net (Ours)	Absolute relative error (AbsRel)	0.09	—	Unverified
2	H-Net (Ours) Full Eigen	Absolute relative error (AbsRel)	0.08	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLPDepth	Delta < 1.25	0.43	—	Unverified
2	SRDINET (Model A)	Delta < 1.25	0.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Atlas (finetuned)	RMSE	0.17	—	Unverified
2	Atlas (plain)	RMSE	0.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LFattNet	BadPix(0.01)	17.23	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LightDepth	Number of parameters (M)	42.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniFuse	Abs Rel	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-TC (Cross-Task Consistency)	L1 error	1.63	—	Unverified