Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models

2025-02-25Code Available0· sign in to hype

Cao Yuxuan, Wu Jiayang, Alistair Cheong Liang Chuen, Bryan Shan Guanrong, Theodore Lee Chong Jen, Sherman Chann Zhi Shen

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/aliencaocao/vlm-for-memes-aisg
OfficialIn paperpytorch★ 7

Abstract

Traditional online content moderation systems struggle to classify modern multimodal means of communication, such as memes, a highly nuanced and information-dense medium. This task is especially hard in a culturally diverse society like Singapore, where low-resource languages are used and extensive knowledge on local context is needed to interpret online content. We curate a large collection of 112K memes labeled by GPT-4V for fine-tuning a VLM to classify offensive memes in Singapore context. We show the effectiveness of fine-tuned VLMs on our dataset, and propose a pipeline containing OCR, translation and a 7-billion parameter-class VLM. Our solutions reach 80.62% accuracy and 0.8192 AUROC on a held-out test set, and can greatly aid human in moderating online contents. The dataset, code, and model weights will be open-sourced at https://github.com/aliencaocao/vlm-for-memes-aisg.

Tasks

Optical Character Recognition (OCR)

Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models

Code

Abstract

Tasks

Reproductions