SOTAVerified

3D Reconstruction

3D Reconstruction is the task of creating a 3D model or representation of an object or scene from 2D images or other data sources. The goal of 3D reconstruction is to create a virtual representation of an object or scene that can be used for a variety of purposes, such as visualization, animation, simulation, and analysis. It can be used in fields such as computer vision, robotics, and virtual reality.

Image: Gwak et al

Papers

Showing 150 of 2326 papers

TitleStatusHype
TripoSR: Fast 3D Object Reconstruction from a Single ImageCode9
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction PriorsCode7
Grounding Image Matching in 3D with MASt3RCode7
Instant Neural Graphics Primitives with a Multiresolution Hash EncodingCode6
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse AttentionCode5
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow ModelsCode5
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward PassCode5
Prompting Depth Anything for 4K Resolution Accurate Metric Depth EstimationCode5
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB VideosCode5
Neural Fields in Robotics: A SurveyCode5
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View SynthesisCode5
3D Reconstruction with Spatial MemoryCode5
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth EstimationCode5
InstantSplat: Sparse-view SfM-free Gaussian Splatting in SecondsCode5
MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View ImagesCode5
Real3D-Portrait: One-shot Realistic 3D Talking Portrait SynthesisCode5
DUSt3R: Geometric 3D Vision Made EasyCode5
Structure-Aware Sparse-View X-ray 3D ReconstructionCode5
Infinite Photorealistic Worlds using Procedural GenerationCode5
RealFusion: 360° Reconstruction of Any Object from a Single ImageCode5
SpatialTrackerV2: 3D Point Tracking Made EasyCode4
UniK3D: Universal Camera Monocular 3D EstimationCode4
AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using SmartphonesCode4
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed ImagesCode4
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense PredictionCode4
On Scaling Up 3D Gaussian Splatting TrainingCode4
S^3Gaussian: Self-Supervised Street Gaussians for Autonomous DrivingCode4
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and MeshingCode4
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and GenerationCode4
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single ImageCode4
Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like SpeedCode4
Cameras as Rays: Pose Estimation via Ray DiffusionCode4
GIM: Learning Generalizable Image Matcher From Internet VideosCode4
Gaussian Splatting SLAMCode4
LightGlue: Local Feature Matching at Light SpeedCode4
Real-time volumetric rendering of dynamic humansCode4
Zero-1-to-3: Zero-shot One Image to 3D ObjectCode4
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image PriorsCode4
Highly Accurate Dichotomous Image SegmentationCode4
VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D ReconstructionCode3
Geo4D: Leveraging Video Generators for Geometric 4D Scene ReconstructionCode3
PE3R: Perception-Efficient 3D ReconstructionCode3
MUSt3R: Multi-view Network for Stereo 3D ReconstructionCode3
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset GenerationCode3
JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video EditingCode3
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian SplattingCode3
Differentiable Voxel-based X-ray Rendering Improves Sparse-View 3D CBCT ReconstructionCode3
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse ViewsCode3
PF3plat: Pose-Free Feed-Forward 3D Gaussian SplattingCode3
Large Spatial Model: End-to-end Unposed Images to Semantic 3DCode3
Show:102550
← PrevPage 1 of 47Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
13D-R2N2Overall0.63Unverified
2GipumaOverall0.58Unverified
3COLMAPOverall0.53Unverified
4MVSNetOverall0.46Unverified
5Vis-MVSNetOverall0.37Unverified
6AA-RMVSNetOverall0.36Unverified
7Cas-MVSNetOverall0.36Unverified
8EPP-MVSNetOverall0.36Unverified
9PatchmatchNetOverall0.35Unverified
10CVP-MVSNetOverall0.35Unverified
#ModelMetricClaimedVerifiedStatus
1MD-GONIoU92.8Unverified
2POCOIoU92.6Unverified
3FS-SDFIoU91.2Unverified
4DP-ConvONetIoU89.5Unverified
5ConvONetIoU88.4Unverified
6ONetIoU76.1Unverified
7EVolTIoU73.8Unverified
8ZubicLioIoU65.43Unverified
#ModelMetricClaimedVerifiedStatus
1AttSets3DIoU0.64Unverified
2PSGN3DIoU0.64Unverified
3OGN3DIoU0.6Unverified
43D-R2N23DIoU0.56Unverified
#ModelMetricClaimedVerifiedStatus
1Scan2CADAverage Accuracy31.68Unverified
23DMatchAverage Accuracy10.29Unverified
#ModelMetricClaimedVerifiedStatus
1SVCPChamfer10Unverified
#ModelMetricClaimedVerifiedStatus
1EVLAccuracy18.2Unverified
#ModelMetricClaimedVerifiedStatus
1EVLAccuracy5.7Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)3DIoU89.4Unverified