Motivation: The identification of cell cycle-regulated genes through the cyclicity of messenger RNAs in genome-wide studies is a difficult task due to the presence of internal and external noise in microarray data. Moreover, the analysis is also complicated by the loss of syn- chrony occurring in cell cycle experiments, which often results in add- itional background noise. Results: To overcome these problems, here we propose the LEON (LEarning and OptimizatioN) algorithm, able to characterize the ‘cycli- city degree’ of a gene expression time profile using a two-step cas- cade procedure. The first step identifies a potentially cyclic behavior by means of a Support Vector Machine trained with a reliable set of positive and negative examples. The second step selects those genes having peak timing consistency along two cell cycles by means of a non-linear optimization technique using radial basis functions. To prove the effectiveness of our combined approach, we use recently published human fibroblasts cell cycle data and, performing in vivo experiments, we demonstrate that our computational strategy is able not only to confirm well-known cell cycle-regulated genes, but also to predict not yet identified ones.

Combining optimization and machine learning techniques for genome-wide prediction of human cell cycle-regulated genes / DE SANTIS, MARIANNA; F. Rinaldi; E. Falcone; LUCIDI, Stefano; G. Piaggio; A. Gurtner; FARINA, Lorenzo. - In: BIOINFORMATICS. - ISSN 1367-4803. - STAMPA. - 30:(2014), pp. 228-233. [10.1093/bioinformatics/btt671]

Combining optimization and machine learning techniques for genome-wide prediction of human cell cycle-regulated genes

DE SANTIS, MARIANNA;LUCIDI, Stefano;
2014

Abstract

Motivation: The identification of cell cycle-regulated genes through the cyclicity of messenger RNAs in genome-wide studies is a difficult task due to the presence of internal and external noise in microarray data. Moreover, the analysis is also complicated by the loss of syn- chrony occurring in cell cycle experiments, which often results in add- itional background noise. Results: To overcome these problems, here we propose the LEON (LEarning and OptimizatioN) algorithm, able to characterize the ‘cycli- city degree’ of a gene expression time profile using a two-step cas- cade procedure. The first step identifies a potentially cyclic behavior by means of a Support Vector Machine trained with a reliable set of positive and negative examples. The second step selects those genes having peak timing consistency along two cell cycles by means of a non-linear optimization technique using radial basis functions. To prove the effectiveness of our combined approach, we use recently published human fibroblasts cell cycle data and, performing in vivo experiments, we demonstrate that our computational strategy is able not only to confirm well-known cell cycle-regulated genes, but also to predict not yet identified ones.
2014
30
228
233
DE SANTIS, MARIANNA; F. Rinaldi; E. Falcone; LUCIDI, Stefano; G. Piaggio; A. Gurtner; FARINA, Lorenzo
File in questo prodotto:
File Dimensione Formato  
DeSantis_Combining_2014.pdf

Accesso chiuso

Licenza: Tutti i diritti riservati
Dimensione 391.71 kB
Formato Adobe PDF
391.71 kB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1350107
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 5
social impact