We propose an approach for efficient word retrieval from printed documents belonging to Digital Libraries. The approach combines word image clustering (based on Self Organizing Maps, SOM) with Principal Component Analysis. The combination of these methods allows us to efficiently retrieve the matching words from large documents collections without the need for a direct comparison of the query word with each indexed word.

Efficient Word Retrieval by means of SOM clustering and PCA / S. Marinai; S. Faini; E. Marino; G. Soda. - STAMPA. - LNCS 3872:(2006), pp. 336-347. (Intervento presentato al convegno DAS 2006 tenutosi a Nelson (New Zealand) nel September 2006) [10.1007/11669487_30].

Efficient Word Retrieval by means of SOM clustering and PCA

MARINAI, SIMONE;SODA, GIOVANNI
2006

Abstract

We propose an approach for efficient word retrieval from printed documents belonging to Digital Libraries. The approach combines word image clustering (based on Self Organizing Maps, SOM) with Principal Component Analysis. The combination of these methods allows us to efficiently retrieve the matching words from large documents collections without the need for a direct comparison of the query word with each indexed word.
2006
Document Analysis Systems VII
DAS 2006
Nelson (New Zealand)
September 2006
S. Marinai; S. Faini; E. Marino; G. Soda
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/260785
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 12
social impact