SISMA: Semantic Face Image Synthesis with Mamba

Botti, Filippo; Ergasti, Alex; Fontanini, Tomaso; Ferrari, Claudio; Bertozzi, Massimo; Prati, Andrea

doi:10.1007/978-3-032-11317-7_49

Diffusion Models have become very popular for Semantic Image Synthesis (SIS) of human faces. Nevertheless, their training and inference is computationally expensive and their computational requirements are high due to the quadratic complexity of attention layers. In this paper, we propose a novel architecture called SISMA, based on the recently proposed Mamba. SISMA generates high quality samples by controlling their shape using a semantic mask at a reduced computational demand. We validated our approach through comprehensive experiments with CelebAMask-HQ, revealing that our architecture not only achieves a better FID score yet also operates at three times the speed of state-of-the-art architectures. This indicates that the proposed design is a viable, lightweight substitute to transformer-based models.

SISMA: Semantic Face Image Synthesis with Mamba / Botti, F., Ergasti, A., Fontanini, T., Ferrari, C., Bertozzi, M., Prati, A.. - ELETTRONICO. - 16169:(2026), pp. 609-619. (Workshops and competitions hosted by the 23rd International Conference on Image Analysis and Processing, ICIAP 2025 ita 2025) [10.1007/978-3-032-11317-7_49].

SISMA: Semantic Face Image Synthesis with Mamba

Botti, Filippo;Ergasti, Alex;Fontanini, Tomaso;Ferrari, Claudio;Bertozzi, Massimo;Prati, Andrea

2026

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2026
			
	Titolo del volume degli atti
	
				Lecture Notes in Computer Science
			
	Collana/Serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Titolo del congresso
	
				Workshops and competitions hosted by the 23rd International Conference on Image Analysis and Processing, ICIAP 2025
			
	Luogo del congresso
	
				ita
			
	Data del congresso
	
				2025
			
	Tutti gli autori
	
						Botti, Filippo; Ergasti, Alex; Fontanini, Tomaso; Ferrari, Claudio; Bertozzi, Massimo; Prati, Andrea
					
	Appare nelle tipologie:
	
				4a - Articolo in atti di congresso

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1453031

SISMA: Semantic Face Image Synthesis with Mamba

Botti, Filippo;Ergasti, Alex;Fontanini, Tomaso;Ferrari, Claudio;Bertozzi, Massimo;Prati, Andrea

2026

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

SISMA: Semantic Face Image Synthesis with Mamba

Botti, Filippo;Ergasti, Alex;Fontanini, Tomaso;Ferrari, Claudio;Bertozzi, Massimo;Prati, Andrea

2026

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)