MemeBLIP2: A novel lightweight multimodal system to detect harmful memes

2025-04-29Unverified0· sign in to hype

Jiaqi Liu, Ran Tong, Aowei Shen, Shuzheng Li, Changlin Yang, Lisha Xu

Unverified — Be the first to reproduce this paper.

Abstract

Memes often merge visuals with brief text to share humor or opinions, yet some memes contain harmful messages such as hate speech. In this paper, we introduces MemeBLIP2, a light weight multimodal system that detects harmful memes by combining image and text features effectively. We build on previous studies by adding modules that align image and text representations into a shared space and fuse them for better classification. Using BLIP-2 as the core vision-language model, our system is evaluated on the PrideMM datasets. The results show that MemeBLIP2 can capture subtle cues in both modalities, even in cases with ironic or culturally specific content, thereby improving the detection of harmful material.

Tasks

Language Modeling Language Modelling

MemeBLIP2: A novel lightweight multimodal system to detect harmful memes

Abstract

Tasks

Reproductions