Emerging applications such as Deep Learning are often data-driven, thus traditional approaches based on auto-tuners are not performance effective across the wide range of inputs used in practice. In the present paper, we start an investigation of predictive models based on machine learning techniques in order to optimize Convolution Neural Networks (CNNs). As a use-case, we focus on the ARM Compute Library which provides three different implementations of the convolution operator at different numeric precision. Starting from a collation of benchmarks, we build and validate models learned by Decision Tree and naive Bayesian classifier. Preliminary experiments on Midgard-based ARM Mali GPU show that our predictive model outperforms all the convolution operators manually selected by the library.
Towards a Learning-Based Performance Modeling for Accelerating Deep Neural Networks / Perri, D.; Sylos Labini, P.; Gervasi, O.; Tasso, S.; Vella, F.. - ELETTRONICO. - 11619 LNCS:(2019), pp. 665-676. (Intervento presentato al convegno International Conference on Computational Science and Its Applications tenutosi a San Pietroburgo, Russia nel 01/07/2019 - 04/07/2019) [10.1007/978-3-030-24289-3_49].
Towards a Learning-Based Performance Modeling for Accelerating Deep Neural Networks
Perri, D.;
2019
Abstract
Emerging applications such as Deep Learning are often data-driven, thus traditional approaches based on auto-tuners are not performance effective across the wide range of inputs used in practice. In the present paper, we start an investigation of predictive models based on machine learning techniques in order to optimize Convolution Neural Networks (CNNs). As a use-case, we focus on the ARM Compute Library which provides three different implementations of the convolution operator at different numeric precision. Starting from a collation of benchmarks, we build and validate models learned by Decision Tree and naive Bayesian classifier. Preliminary experiments on Midgard-based ARM Mali GPU show that our predictive model outperforms all the convolution operators manually selected by the library.File | Dimensione | Formato | |
---|---|---|---|
2212.05031.pdf
accesso aperto
Tipologia:
Versione finale referata (Postprint, Accepted manuscript)
Licenza:
Creative commons
Dimensione
652.9 kB
Formato
Adobe PDF
|
652.9 kB | Adobe PDF |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.