This paper presents a deep Convolutional Neural Network (CNN) based approach for document image classifica­tion. One of the main requirement of deep CNN architecture is that they need huge number of samples for training. To overcome this problem we adopt a deep CNN which is trained using big image dataset containing millions of samples i.e., ImageNet. The proposed work outperforms both the traditional structure similarity methods and the CNN based approaches proposed earlier. The accuracy of the proposed approach with merely 20 images per class outperforms the state-of-the-art by achieving classification accuracy of 68.25%. The best results on Tobacoo-3428 dataset show that our proposed method outperforms the state-of-the-art method by a significant margin and achieved a median accuracy of 77.6% with 100 samples per class used for training and validation.

Deepdocclassifier: Document classification with deep Convolutional Neural Network / Afzal, Muhammad Zeshan; Capobianco, Samuele; Malik, Muhammad Imran; Marinai, Simone; Breuel, Thomas M.; Dengel, Andreas; Liwicki, Marcus. - ELETTRONICO. - (2015), pp. 1111-1115. (Intervento presentato al convegno International Conference on Document Analysis and Recognition) [10.1109/ICDAR.2015.7333933].

Deepdocclassifier: Document classification with deep Convolutional Neural Network

CAPOBIANCO, SAMUELE;MARINAI, SIMONE;
2015

Abstract

This paper presents a deep Convolutional Neural Network (CNN) based approach for document image classifica­tion. One of the main requirement of deep CNN architecture is that they need huge number of samples for training. To overcome this problem we adopt a deep CNN which is trained using big image dataset containing millions of samples i.e., ImageNet. The proposed work outperforms both the traditional structure similarity methods and the CNN based approaches proposed earlier. The accuracy of the proposed approach with merely 20 images per class outperforms the state-of-the-art by achieving classification accuracy of 68.25%. The best results on Tobacoo-3428 dataset show that our proposed method outperforms the state-of-the-art method by a significant margin and achieved a median accuracy of 77.6% with 100 samples per class used for training and validation.
2015
Proceedings of the 13th International Conference on Document Analysis and Recognition
International Conference on Document Analysis and Recognition
Afzal, Muhammad Zeshan; Capobianco, Samuele; Malik, Muhammad Imran; Marinai, Simone; Breuel, Thomas M.; Dengel, Andreas; Liwicki, Marcus
File in questo prodotto:
File Dimensione Formato  
07333933.pdf

Accesso chiuso

Tipologia: Pdf editoriale (Version of record)
Licenza: Tutti i diritti riservati
Dimensione 6.26 MB
Formato Adobe PDF
6.26 MB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1032890
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 99
  • ???jsp.display-item.citation.isi??? 68
social impact