NoisyArt is a dataset design to support research on webly-supervised recognition of artworks, consisting of more than 90,000 images and in more than 3000 webly-supervised classes, with a subset of 200 verified test images. Candidate artworks are identified using publicly available metadata repositories, and images are automatically acquired using search engines. Textual description and other information are provided for each artwork and artist, enabling experimentation on multi-modal techniques like zero-shot classification of artworks and captioning. Several techniques were used to mitigate label noise and domain shift in webly-supervised training settings, with the end of improve instance recognition performance. Further zero-shot learning experiments demonstrate the benefits and limitations of this kind of approaches in the challenging setting of data scarcity and noisy labels for the set of seen classes. This chapter combines and extends our ongoing work on NoisyArt dataset.

NoisyArt: Exploiting the NoisyWeb for Zero-shot Classification and Artwork Instance Recognition / Del Chiaro R.; Bagdanov A.D.; Del Bimbo A.. - STAMPA. - (2021), pp. 1-24. [10.1007/978-3-030-66777-1_1]

NoisyArt: Exploiting the NoisyWeb for Zero-shot Classification and Artwork Instance Recognition

Del Chiaro R.;Bagdanov A. D.;Del Bimbo A.
2021

Abstract

NoisyArt is a dataset design to support research on webly-supervised recognition of artworks, consisting of more than 90,000 images and in more than 3000 webly-supervised classes, with a subset of 200 verified test images. Candidate artworks are identified using publicly available metadata repositories, and images are automatically acquired using search engines. Textual description and other information are provided for each artwork and artist, enabling experimentation on multi-modal techniques like zero-shot classification of artworks and captioning. Several techniques were used to mitigate label noise and domain shift in webly-supervised training settings, with the end of improve instance recognition performance. Further zero-shot learning experiments demonstrate the benefits and limitations of this kind of approaches in the challenging setting of data scarcity and noisy labels for the set of seen classes. This chapter combines and extends our ongoing work on NoisyArt dataset.
2021
9783030667764
9783030667771
Data Analytics for Cultural Heritage: Current Trends and Concepts
1
24
Del Chiaro R.; Bagdanov A.D.; Del Bimbo A.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1450996
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact