In this paper we present a multipage administrative document image retrieval system based on textual and visual representations of document pages. Individual pages are represented by textual or visual information using a bag-of-words framework. Different fusion strategies are evaluated which allow the system to perform multipage document retrieval on the basis of a single page retrieval system. Results are reported on a large dataset of document images sampled from a banking workflow.
Multipage document retrieval by textual and visual representations / Rusinol, Marcal; Karatzas, Dimosthenis; Bagdanov, Andrew D.; Llados, Josep. - ELETTRONICO. - (2012), pp. 521-524. (Intervento presentato al convegno 21st International Conference on Pattern Recognition, ICPR 2012 tenutosi a Tsukuba, jpn nel 2012).
Multipage document retrieval by textual and visual representations
BAGDANOV, ANDREW DAVID;
2012
Abstract
In this paper we present a multipage administrative document image retrieval system based on textual and visual representations of document pages. Individual pages are represented by textual or visual information using a bag-of-words framework. Different fusion strategies are evaluated which allow the system to perform multipage document retrieval on the basis of a single page retrieval system. Results are reported on a large dataset of document images sampled from a banking workflow.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.