SOTAVerified

Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation

2025-05-02Unverified0· sign in to hype

Dimitrios Dagdilelis, Panagiotis Grigoriadis, Roberto Galeazzi

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We propose a cross attention transformer based method for multimodal sensor fusion to build a birds eye view of a vessels surroundings supporting safer autonomous marine navigation. The model deeply fuses multiview RGB and long wave infrared images with sparse LiDAR point clouds. Training also integrates X band radar and electronic chart data to inform predictions. The resulting view provides a detailed reliable scene representation improving navigational accuracy and robustness. Real world sea trials confirm the methods effectiveness even in adverse weather and complex maritime settings.

Tasks

Reproductions