In the last years the interest in e-book readers is significantly growing. Two main document formats are supported by most devices: PDF and ePub. The PDF format is widely used to share documents allowing a cross-platform readability. However, it is not ideal for a comfortable reading on small screens. On the opposite, the ePub format is re-flowable and it is well suited for e-book readers. In this paper we describe a system for the conversion of PDF books to the ePub format aiming at inverting the text formatting made during the pagination. To this purpose, layout analysis techniques are performed to identify the book's table of contents and the main functional regions such as chapters, paragraphs, and notes.
Conversion of PDF Books in ePub Format / Simone Marinai;Emanuele Marino;Giovanni Soda. - STAMPA. - (2011), pp. 478-482. (Intervento presentato al convegno International Conference on Document Analysis and Recognition tenutosi a Beijing (China) nel Sept. 2011) [10.1109/ICDAR.2011.102].
Conversion of PDF Books in ePub Format
MARINAI, SIMONE;MARINO, EMANUELE;SODA, GIOVANNI
2011
Abstract
In the last years the interest in e-book readers is significantly growing. Two main document formats are supported by most devices: PDF and ePub. The PDF format is widely used to share documents allowing a cross-platform readability. However, it is not ideal for a comfortable reading on small screens. On the opposite, the ePub format is re-flowable and it is well suited for e-book readers. In this paper we describe a system for the conversion of PDF books to the ePub format aiming at inverting the text formatting made during the pagination. To this purpose, layout analysis techniques are performed to identify the book's table of contents and the main functional regions such as chapters, paragraphs, and notes.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.