Text-to-Image Synthesis Based on Machine Generated Captions

Menardi, M.; Falcon, A.; Mohamed, S. S.; Seidenari, L.; Serra, G.; Del Bimbo, A.; Tasso, C.

doi:10.1007/978-3-030-39905-4_7

Text-to-Image Synthesis refers to the process of automatic generation of a photo-realistic image starting from a given text and is revolutionizing many real-world applications. In order to perform such process it is necessary to exploit datasets containing captioned images, meaning that each image is associated with one (or more) captions describing it. Despite the abundance of uncaptioned images datasets, the number of captioned datasets is limited. To address this issue, in this paper we propose an approach capable of generating images starting from a given text using conditional generative adversarial network (GAN) trained on uncaptioned images dataset. In particular, uncaptioned images are fed to an Image Captioning Module to generate the descriptions. Then, the GAN Module is trained on both the input image and the “machine-generated” caption. To evaluate the results, the performance of our solution is compared with the results obtained by the unconditional GAN. For the experiments, we chose to use the uncaptioned dataset LSUN-bedroom. The results obtained in our study are preliminary but still promising.

Text-to-Image Synthesis Based on Machine Generated Captions / Menardi M.; Falcon A.; Mohamed S.S.; Seidenari L.; Serra G.; Del Bimbo A.; Tasso C.. - ELETTRONICO. - 1177:(2020), pp. 62-74. (Intervento presentato al convegno 16th Italian Research Conference on Digital Libraries, IRCDL 2020 tenutosi a ita nel 2020) [10.1007/978-3-030-39905-4_7].

Text-to-Image Synthesis Based on Machine Generated Captions

Menardi M.;Falcon A.^{Investigation};Mohamed S. S.;Seidenari L.^{Conceptualization};Serra G.^{Conceptualization};Del Bimbo A.^{Funding Acquisition};Tasso C.

2020

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2020
			
	Titolo del volume degli atti
	
				Communications in Computer and Information Science
			
	Collana/Serie
	
				COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE
			
	Titolo del congresso
	
				16th Italian Research Conference on Digital Libraries, IRCDL 2020
			
	Luogo del congresso
	
				ita
			
	Data del congresso
	
				2020
			
	Tutti gli autori
	
						Menardi M.; Falcon A.; Mohamed S.S.; Seidenari L.; Serra G.; Del Bimbo A.; Tasso C.
					
	Appare nelle tipologie:
	
				4a - Articolo in atti di congresso

File in questo prodotto:

File	Dimensione	Formato
text_to_imageGAN.pdf Accesso chiuso Tipologia: Pdf editoriale (Version of record) Licenza: Tutti i diritti riservati Dimensione 3.68 MB Formato Adobe PDF Richiedi una copia	3.68 MB	Adobe PDF	Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1195866

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

Text-to-Image Synthesis Based on Machine Generated Captions

Menardi M.;Falcon A.^{Investigation};Mohamed S. S.;Seidenari L.^{Conceptualization};Serra G.^{Conceptualization};Del Bimbo A.^{Funding Acquisition};Tasso C.

Investigation

Conceptualization

Conceptualization

Funding Acquisition

2020

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

Text-to-Image Synthesis Based on Machine Generated Captions

Menardi M.;Falcon A.Investigation;Mohamed S. S.;Seidenari L.Conceptualization;Serra G.Conceptualization;Del Bimbo A.Funding Acquisition;Tasso C.

Investigation

Conceptualization

Conceptualization

Funding Acquisition

2020

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Menardi M.;Falcon A.^{Investigation};Mohamed S. S.;Seidenari L.^{Conceptualization};Serra G.^{Conceptualization};Del Bimbo A.^{Funding Acquisition};Tasso C.

Scheda breve

Scheda completa

Scheda completa (DC)