OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data

Cartella, Giuseppe; Baldrati, Alberto; Morelli, Davide; Cornia, Marcella; Bertini, Marco; Cucchiara, Rita

doi:10.1007/978-3-031-43148-7_21

The inexorable growth of online shopping and e-commerce demands scalable and robust machine learning-based solutions to accommodate customer requirements. In the context of automatic tagging classification and multimodal retrieval, prior works either defined a low generalizable supervised learning approach or more reusable CLIP-based techniques while, however, training on closed source data. In this work, we propose OpenFashionCLIP, a vision-and-language contrastive learning method that only adopts open-source fashion data stemming from diverse domains, and characterized by varying degrees of specificity. Our approach is extensively validated across several tasks and benchmarks, and experimental results highlight a significant out-of-domain generalization capability and consistent improvements over state-of-the-art methods both in terms of accuracy and recall. Source code and trained models are publicly available at: https://github.com/aimagelab/open-fashion-clip.

OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data / Cartella, Giuseppe; Baldrati, Alberto; Morelli, Davide; Cornia, Marcella; Bertini, Marco; Cucchiara, Rita. - ELETTRONICO. - 14233:(2023), pp. 245-256. ( Proceedings of the 22nd International Conference on Image Analysis and Processing, ICIAP 2023 ita 2023) [10.1007/978-3-031-43148-7_21].

OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data

Cartella, Giuseppe;Baldrati, Alberto;Morelli, Davide;Cornia, Marcella;Bertini, Marco;Cucchiara, Rita

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2023
			
	Titolo del volume degli atti
	
				Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
			
	Collana/Serie
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Titolo del congresso
	
				Proceedings of the 22nd International Conference on Image Analysis and Processing, ICIAP 2023
			
	Luogo del congresso
	
				ita
			
	Data del congresso
	
				2023
			
	Codice ONU Sustainable Development Goals (SDG)
	
				Goal 3: Good health and well-being
			
	Tutti gli autori
	
						Cartella, Giuseppe; Baldrati, Alberto; Morelli, Davide; Cornia, Marcella; Bertini, Marco; Cucchiara, Rita
					
	Appare nelle tipologie:
	
				4a - Articolo in atti di congresso

File in questo prodotto:

File	Dimensione	Formato
2023-iciap-fashion.pdf accesso aperto Tipologia: Pdf editoriale (Version of record) Licenza: Open Access Dimensione 1.9 MB Formato Adobe PDF	1.9 MB	Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1452878

OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data

Cartella, Giuseppe;Baldrati, Alberto;Morelli, Davide;Cornia, Marcella;Bertini, Marco;Cucchiara, Rita

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data

Cartella, Giuseppe;Baldrati, Alberto;Morelli, Davide;Cornia, Marcella;Bertini, Marco;Cucchiara, Rita

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)