We present a feature engineering pipeline for the construction of musical signal characteristics, to be used for the design of a supervised model for musical genre identification. The key idea is to extend the traditional two-step process of extraction and classification with additive stand-alone phases which are no longer organized in a waterfall scheme. The whole system is realized by traversing backtrack arrows and cycles between various stages. In order to give a compact and effective representation of the features, the standard early temporal integration is combined with other selection and extraction phases: on the one hand, the selection of the most meaningful characteristics based on information gain, and on the other hand, the inclusion of the nonlinear correlation between this subset of features, determined by an autoencoder. The results of the experiments conducted on GTZAN dataset reveal a noticeable contribution of this methodology towards the model's performance in classification task.
Extended Pipeline for Content-Based Feature Engineering in Music Genre Recognition / Tina Raissi, Alessandro Tibo, Paolo Bientinesi. - ELETTRONICO. - (2018), pp. 2661-2665. (Intervento presentato al convegno 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)).
Extended Pipeline for Content-Based Feature Engineering in Music Genre Recognition
RAISSI, SEYEDEH TINA;Alessandro Tibo;
2018
Abstract
We present a feature engineering pipeline for the construction of musical signal characteristics, to be used for the design of a supervised model for musical genre identification. The key idea is to extend the traditional two-step process of extraction and classification with additive stand-alone phases which are no longer organized in a waterfall scheme. The whole system is realized by traversing backtrack arrows and cycles between various stages. In order to give a compact and effective representation of the features, the standard early temporal integration is combined with other selection and extraction phases: on the one hand, the selection of the most meaningful characteristics based on information gain, and on the other hand, the inclusion of the nonlinear correlation between this subset of features, determined by an autoencoder. The results of the experiments conducted on GTZAN dataset reveal a noticeable contribution of this methodology towards the model's performance in classification task.I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.