Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 2454 papers

Title	Date	Tasks	Status	Hype
RSGaussian:3D Gaussian Splatting with LiDAR for Aerial Remote Sensing Novel View Synthesis	Dec 24, 2024	Depth EstimationNovel View Synthesis	—Unverified	0
LiRCDepth: Lightweight Radar-Camera Depth Estimation via Knowledge Distillation and Uncertainty Guidance	Dec 20, 2024	Computational EfficiencyDepth Estimation	CodeCode Available	1
Scaling 4D Representations	Dec 19, 2024	Action ClassificationCamera Pose Estimation	—Unverified	0
Flowing from Words to Pixels: A Framework for Cross-Modality Evolution	Dec 19, 2024	Depth EstimationImage Captioning	—Unverified	0
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion	Dec 18, 2024	DenoisingDepth Completion	—Unverified	0
Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Dec 18, 2024	Depth EstimationMonocular Depth Estimation	—Unverified	0
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation	Dec 18, 2024	3D Reconstruction4k	CodeCode Available	5
PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Dec 17, 2024	3D Object DetectionDepth Estimation	—Unverified	0
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations	Dec 16, 2024	3D Object DetectionDepth Estimation	—Unverified	0
Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video	Dec 16, 2024	Depth Estimation	—Unverified	0
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction	Dec 15, 2024	Autonomous DrivingDepth Estimation	CodeCode Available	1
MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance	Dec 14, 2024	DecoderDepth Estimation	—Unverified	0
Cross-View Completion Models are Zero-shot Correspondence Estimators	Dec 12, 2024	DecoderDepth Estimation	—Unverified	0
Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos	Dec 12, 2024	Camera Pose EstimationDepth Estimation	—Unverified	0
T-SVG: Text-Driven Stereoscopic Video Generation	Dec 12, 2024	Depth EstimationText-to-Video Generation	—Unverified	0
BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation	Dec 11, 2024	3D Pose EstimationDepth Estimation	—Unverified	0
Utilizing Multi-step Loss for Single Image Reflection Removal	Dec 11, 2024	Depth EstimationImage Segmentation	CodeCode Available	0
Dense Depth from Event Focal Stack	Dec 11, 2024	Depth Estimation	—Unverified	0
Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation	Dec 10, 2024	DecoderDepth-aware Video Panoptic Segmentation	—Unverified	0
SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception	Dec 9, 2024	Depth EstimationSemantic Segmentation	—Unverified	0
On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events	Dec 9, 2024	BenchmarkingComputational Efficiency	—Unverified	0
Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving	Dec 9, 2024	4D reconstructionAutonomous Driving	CodeCode Available	2
Event fields: Capturing light fields at high speed, resolution, and dynamic range	Dec 9, 2024	Depth EstimationScene Understanding	—Unverified	0
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction	Dec 9, 2024	Autonomous DrivingDepth Estimation	—Unverified	0
GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion	Dec 8, 2024	Autonomous DrivingAutonomous Vehicles	—Unverified	0
TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action	Dec 7, 2024	Depth EstimationMathematical Reasoning	CodeCode Available	2
PanoDreamer: Optimization-Based Single Image to 360 3D Scene With Diffusion	Dec 6, 2024	3D Scene ReconstructionDepth Estimation	CodeCode Available	2
SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images	Dec 6, 2024	Contrastive LearningDepth Estimation	CodeCode Available	0
MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos	Dec 5, 2024	Depth Estimation	CodeCode Available	5
DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction	Dec 5, 2024	3D Pose Estimation3D Reconstruction	—Unverified	0
MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction	Dec 5, 2024	3D ReconstructionDepth Estimation	—Unverified	0
LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation	Dec 5, 2024	Depth EstimationMonocular Depth Estimation	—Unverified	0
MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction	Dec 4, 2024	Depth Estimation	—Unverified	0
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos	Dec 4, 2024	Depth EstimationMonocular Depth Estimation	—Unverified	0
Dense Scene Reconstruction from Light-Field Images Affected by Rolling Shutter	Dec 4, 2024	Depth Estimation	CodeCode Available	0
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models	Dec 4, 2024	Depth Estimationobject-detection	—Unverified	0
GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos	Dec 3, 2024	Depth EstimationObject Reconstruction	—Unverified	0
Amodal Depth Anything: Amodal Depth Estimation in the Wild	Dec 3, 2024	Depth EstimationDepth Prediction	—Unverified	0
Dual Exposure Stereo for Extended Dynamic Range 3D Imaging	Dec 3, 2024	Depth EstimationStereo Depth Estimation	—Unverified	0
Single-Shot Metric Depth from Focused Plenoptic Cameras	Dec 3, 2024	Depth EstimationNavigate	—Unverified	0
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving	Dec 2, 2024	Autonomous DrivingDepth Estimation	—Unverified	0
AVS-Net: Audio-Visual Scale Net for Self-supervised Monocular Metric Depth Estimation	Dec 2, 2024	Depth EstimationDepth Prediction	—Unverified	0
STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation	Dec 2, 2024	Autonomous DrivingDepth Estimation	—Unverified	0
Mutli-View 3D Reconstruction using Knowledge Distillation	Dec 2, 2024	3D ReconstructionDepth Estimation	CodeCode Available	0
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation	Dec 1, 2024	3D Scene ReconstructionAutonomous Navigation	—Unverified	0
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection	Nov 29, 2024	3D Multi-Object Tracking3D Object Detection	CodeCode Available	0
Gaussian Splashing: Direct Volumetric Rendering Underwater	Nov 29, 2024	3DGS3D Reconstruction	—Unverified	0
MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications	Nov 29, 2024	Depth EstimationDepth Prediction	—Unverified	0
Video Depth without Video Models	Nov 28, 2024	Depth Estimation	—Unverified	0
360Recon: An Accurate Reconstruction Method Based on Depth Fusion from 360 Images	Nov 28, 2024	3D ReconstructionDepth Estimation	—Unverified	0

Show:10 25 50

← PrevPage 6 of 50Next →

All datasets Stanford2D3D Panoramic NYU-Depth V2 DCM eBDtheque ScanNetV2 Cityscapes test DIODE KITTI 2015 Mars DTM Estimation ScanNet 4D Light Field Dataset KITTI Eigen split

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	OmniDepth	RMSE	0.62	—	Unverified
2	SphereDepth	RMSE	0.45	—	Unverified
3	Jin et al.	RMSE	0.42	—	Unverified
4	BiFuse with fusion	RMSE	0.41	—	Unverified
5	HoHoNet (ResNet-101)	RMSE	0.38	—	Unverified
6	PanoDepth	RMSE	0.37	—	Unverified
7	BiFuse++	RMSE	0.37	—	Unverified
8	UniFuse with fusion	RMSE	0.37	—	Unverified
9	DisConv	RMSE	0.37	—	Unverified
10	SliceNet	RMSE	0.37	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	A2J	mAP	8.61	—	Unverified
2	PAD-Net	RMS	0.79	—	Unverified
3	MS-CRF	RMS	0.59	—	Unverified
4	DORN	RMS	0.51	—	Unverified
5	Freeform	RMS	0.43	—	Unverified
6	Optimized, freeform	RMS	0.43	—	Unverified
7	VNL	RMS	0.42	—	Unverified
8	BTS	RMS	0.41	—	Unverified
9	TransDepth (AGD+ ViT)	RMS	0.37	—	Unverified
10	AdaBins	RMS	0.36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T2Net	Abs Rel	0.35	—	Unverified
2	MIDAS	Abs Rel	0.31	—	Unverified
3	Bhattacharjee et al.	Abs Rel	0.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T2Net	Abs Rel	0.49	—	Unverified
2	MIDAS	Abs Rel	0.42	—	Unverified
3	Bhattacharjee et al.	Abs Rel	0.38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeReS	absolute relative error	0.1	—	Unverified
2	DELTAS	absolute relative error	0.09	—	Unverified
3	Distill Any Depth	absolute relative error	0.04	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SDC-Depth	RMSE	6.92	—	Unverified
2	SwinMTL	RMSE	6.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AIP-Brown	Delta < 1.25	0.36	—	Unverified
2	LeRes	Delta < 1.25	0.23	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	H-Net (Ours)	Absolute relative error (AbsRel)	0.09	—	Unverified
2	H-Net (Ours) Full Eigen	Absolute relative error (AbsRel)	0.08	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLPDepth	Delta < 1.25	0.43	—	Unverified
2	SRDINET (Model A)	Delta < 1.25	0.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Atlas (finetuned)	RMSE	0.17	—	Unverified
2	Atlas (plain)	RMSE	0.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LFattNet	BadPix(0.01)	17.23	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LightDepth	Number of parameters (M)	42.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniFuse	Abs Rel	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-TC (Cross-Task Consistency)	L1 error	1.63	—	Unverified