SOTAVerified

MMFusion: Combining Image Forensic Filters for Visual Manipulation Detection and Localization

2023-12-04Code Available1· sign in to hype

Kostas Triaridis, Konstantinos Tsigos, Vasileios Mezaris

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Recent image manipulation localization and detection techniques typically leverage forensic artifacts and traces that are produced by a noise-sensitive filter, such as SRM or Bayar convolution. In this paper, we showcase that different filters commonly used in such approaches excel at unveiling different types of manipulations and provide complementary forensic traces. Thus, we explore ways of combining the outputs of such filters to leverage the complementary nature of the produced artifacts for performing image manipulation localization and detection (IMLD). We assess two distinct combination methods: one that produces independent features from each forensic filter and then fuses them (this is referred to as late fusion) and one that performs early mixing of different modal outputs and produces combined features (this is referred to as early fusion). We use the latter as a feature encoding mechanism, accompanied by a new decoding mechanism that encompasses feature re-weighting, for formulating the proposed MMFusion architecture. We demonstrate that MMFusion achieves competitive performance for both image manipulation localization and detection, outperforming state-of-the-art models across several image and video datasets. We also investigate further the contribution of each forensic filter within MMFusion for addressing different types of manipulations, building on recent AI explainability measures.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Casia V1+Late FusionBalanced Accuracy0.86Unverified
Casia V1+Early FusionBalanced Accuracy0.85Unverified
CocoGlideEarly FusionBalanced Accuracy0.66Unverified
CocoGlideLate FusionBalanced Accuracy0.68Unverified
ColumbiaEarly FusionBalanced Accuracy0.96Unverified
ColumbiaLate FusionBalanced Accuracy0.82Unverified
COVERAGELate FusionBalanced Accuracy0.72Unverified
COVERAGEEarly FusionBalanced Accuracy0.77Unverified
DSO-1Early FusionBalanced Accuracy0.94Unverified
DSO-1Late FusionBalanced Accuracy0.83Unverified

Reproductions