SOTAVerified

Visual Localization

Visual Localization is the problem of estimating the camera pose of a given image relative to a visual representation of a known scene.

Source: Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization

Papers

Showing 151200 of 402 papers

TitleStatusHype
Semantic and Feature Guided Uncertainty Quantification of Visual Localization for Autonomous Vehicles0
Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints0
Robust Visual Localization via Semantic-Guided Multi-Scale Transformer0
Deep Learning Reforms Image Matching: A Survey and Outlook0
To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building ModelsCode0
SafeNav: Safe Path Navigation using Landmark Based Localization in a GPS-denied Environment0
GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting0
Visual Re-Ranking with Non-Visual Side InformationCode0
LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds0
Scene-agnostic Pose Regression for Visual Localization0
Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning0
A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenariosCode0
Multi-Platform Teach-and-Repeat Navigation by Visual Place Recognition Based on Deep-Learned Local Features0
NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features0
Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes0
3D Gaussian Splatting aided Localization for Large and Complex Indoor-Environments0
Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning0
Visual Localization via Semantic Structures in Autonomous Photovoltaic Power Plant Inspection0
FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis0
Training Medical Large Vision-Language Models with Abnormal-Aware Feedback0
Gaussian Splatting Feature Fields for (Privacy-Preserving) Visual Localization0
CroCoDL: Cross-device Collaborative Dataset for Localization0
GPVK-VL: Geometry-Preserving Virtual Keyframes for Visual Localization under Large Viewpoint Changes0
Mutli-View 3D Reconstruction using Knowledge DistillationCode0
Unleashing the Power of Data Synthesis in Visual Localization0
PEnG: Pose-Enhanced Geo-LocalisationCode0
YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras0
Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization0
LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images0
RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps0
Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension0
SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality0
Combining Absolute and Semi-Generalized Relative Poses for Visual Localization0
Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood InformationCode0
HGSLoc: 3DGS-based Heuristic Camera Pose Refinement0
Reprojection Errors as Prompts for Efficient Scene Coordinate Regression0
Unveiling Visual Biases in Audio-Visual Localization Benchmarks0
Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations0
MambaLoc: Efficient Camera Localisation via State Space Model0
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos0
Re-localization acceleration with Medoid Silhouette Clustering0
From 2D to 3D: AISG-SLA Visual Localization Challenge0
Pose Estimation from Camera Images for Underwater Inspection0
RADA: Robust and Accurate Feature Learning with Domain Adaptation0
An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robotsCode0
Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization0
Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement0
Monocular Localization with Semantics Map for Autonomous Vehicles0
MeshVPR: Citywide Visual Place Recognition Using 3D Meshes0
TP3M: Transformer-based Pseudo 3D Image Matching with Reference Image0
Show:102550
← PrevPage 4 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MapNetMean Translation Error48.21Unverified
2PointNetVLADMean Translation Error28.48Unverified
3DCPMean Translation Error18.45Unverified
4AD-MapNetMean Translation Error18.43Unverified
5AtLoc+Mean Translation Error17.92Unverified
6PosePNMean Translation Error16.32Unverified
7VMLocMean Translation Error15.11Unverified
8MS-TransformerMean Translation Error11.69Unverified
9PoseMinkLocMean Translation Error11.2Unverified
10PosePN++Mean Translation Error10.64Unverified
#ModelMetricClaimedVerifiedStatus
1GIM-LoFTRAcc@0.25m, 2°79.1Unverified
2LoFTRAcc@0.25m, 2°78.5Unverified
3GIM-SuperGlueAcc@0.25m, 2°78Unverified
4SuperGlueAcc@0.25m, 2°77Unverified
5GIM-DKMAcc@0.25m, 2°77Unverified
6SCFeatAcc@0.25m, 2°74.3Unverified
7DKMAcc@0.25m, 2°70.2Unverified
#ModelMetricClaimedVerifiedStatus
1AtLocMean Translation Error29.6Unverified
2MapNet++Mean Translation Error29.5Unverified
3GNNMapNetMean Translation Error17.35Unverified
4CoordiNetMean Translation Error14.96Unverified
5AtLoc+Mean Translation Error13.7Unverified
6RobustLocMean Translation Error9.37Unverified
#ModelMetricClaimedVerifiedStatus
1Patch-NetVLADAcc @ .25m, 2°0.12Unverified
#ModelMetricClaimedVerifiedStatus
1Patch-NetVLADAcc @ .25m, 2°0.1Unverified