Multimodal image-text memes are prevalent on the inter- net, serving as a unique form of communication that com- bines visual and textual elements to convey humor, ideas, or emotions. However, some memes take a malicious turn, promoting hateful content and perpetuating discrimination. Detecting hateful memes within this multimodal context is a challenging task that requires understanding the inter- twined meaning of text and images. In this work, we address this issue by proposing a novel approach named ISSUES for multimodal hateful meme classification. ISSUES lever- ages a pre-trained CLIP vision-language model and the textual inversion technique to effectively capture the mul- timodal semantic content of the memes. The experiments show that our method achieves state-of-the-art results on the Hateful Memes Challenge and HarMeme datasets. The code and the pre-trained models are publicly avail

Mapping Memes to Words for Multimodal Hateful Meme Classification / Burbi, Giovanni; Baldrati, Alberto; Agnolucci, Lorenzo; Bertini, Marco; Del Bimbo, Alberto. - ELETTRONICO. - (2023), pp. 2824-2828. ( IEEE/CVF International Conference on Computer Vision (ICCV) Workshops) [10.1109/iccvw60793.2023.00303].

Mapping Memes to Words for Multimodal Hateful Meme Classification

Baldrati, Alberto;Agnolucci, Lorenzo;Bertini, Marco;Del Bimbo, Alberto
2023

Abstract

Multimodal image-text memes are prevalent on the inter- net, serving as a unique form of communication that com- bines visual and textual elements to convey humor, ideas, or emotions. However, some memes take a malicious turn, promoting hateful content and perpetuating discrimination. Detecting hateful memes within this multimodal context is a challenging task that requires understanding the inter- twined meaning of text and images. In this work, we address this issue by proposing a novel approach named ISSUES for multimodal hateful meme classification. ISSUES lever- ages a pre-trained CLIP vision-language model and the textual inversion technique to effectively capture the mul- timodal semantic content of the memes. The experiments show that our method achieves state-of-the-art results on the Hateful Memes Challenge and HarMeme datasets. The code and the pre-trained models are publicly avail
2023
Proc. of IEEE/CVF International Conference on Computer Vision (ICCV) Workshops
IEEE/CVF International Conference on Computer Vision (ICCV) Workshops
Burbi, Giovanni; Baldrati, Alberto; Agnolucci, Lorenzo; Bertini, Marco; Del Bimbo, Alberto
File in questo prodotto:
File Dimensione Formato  
Burbi_Mapping_Memes_to_Words_for_Multimodal_Hateful_Meme_Classification_ICCVW_2023_paper.pdf

accesso aperto

Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 725.85 kB
Formato Adobe PDF
725.85 kB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1452893
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 24
  • ???jsp.display-item.citation.isi??? 14
social impact