Abstract - We analyze a system for the retrieval of document images on the basis of layout similarity. Layout objects are extracted and represented with the XY tree. Page similarity is computed with a tree-edit distance algorithm. The peculiarity of the approach is the use of tree grammars to model the variations in the tree which are due to segmentation algorithms or to structural differences between documents with similar layout. A few class-independent grammatical rules are used to modify each tree and obtain a reduced tree that is supposed to preserve the most relevant features of the page.
Layout based document image retrieval by means of XY tree reduction / S. Marinai; E. Marino E; G. Soda. - STAMPA. - IEEE Press:(2005), pp. 432-436. (Intervento presentato al convegno ICDAR 2005 tenutosi a SEOUL (KOREA) nel September 2005) [10.1109/ICDAR.2005.150].
Layout based document image retrieval by means of XY tree reduction
MARINAI, SIMONE;SODA, GIOVANNI
2005
Abstract
Abstract - We analyze a system for the retrieval of document images on the basis of layout similarity. Layout objects are extracted and represented with the XY tree. Page similarity is computed with a tree-edit distance algorithm. The peculiarity of the approach is the use of tree grammars to model the variations in the tree which are due to segmentation algorithms or to structural differences between documents with similar layout. A few class-independent grammatical rules are used to modify each tree and obtain a reduced tree that is supposed to preserve the most relevant features of the page.File | Dimensione | Formato | |
---|---|---|---|
ICDAR05.pdf
Accesso chiuso
Tipologia:
Versione finale referata (Postprint, Accepted manuscript)
Licenza:
Tutti i diritti riservati
Dimensione
230.15 kB
Formato
Adobe PDF
|
230.15 kB | Adobe PDF | Richiedi una copia |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.