Depth Estimation
Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.
Papers
Showing 1–10 of 2454 papers
All datasetsStanford2D3D PanoramicNYU-Depth V2DCMeBDthequeScanNetV2Cityscapes testDIODEKITTI 2015Mars DTM EstimationScanNet4D Light Field DatasetKITTI Eigen split
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | A2J | mAP | 8.61 | — | Unverified |
| 2 | PAD-Net | RMS | 0.79 | — | Unverified |
| 3 | MS-CRF | RMS | 0.59 | — | Unverified |
| 4 | DORN | RMS | 0.51 | — | Unverified |
| 5 | Freeform | RMS | 0.43 | — | Unverified |
| 6 | Optimized, freeform | RMS | 0.43 | — | Unverified |
| 7 | VNL | RMS | 0.42 | — | Unverified |
| 8 | BTS | RMS | 0.41 | — | Unverified |
| 9 | TransDepth (AGD+ ViT) | RMS | 0.37 | — | Unverified |
| 10 | AdaBins | RMS | 0.36 | — | Unverified |