The advent of high-throughput sequencing technologies is revolutionizing our ability in discovering and genotyping DNA copy number variants (CNVs). Read count-based approaches are able to detect CNV regions with an unprecedented resolution. Although this computational strategy has been recently introduced in literature, much work has been already done for the preparation, normalization and analysis of this kind of data.Here we face the many aspects that cover the detection of CNVs by using read count approach. We first study the characteristics and systematic biases of read count distributions, focusing on the normalization methods designed for removing these biases. Subsequently, we compare the algorithms designed to detect the boundaries of CNVs and we investigate the ability of read count data to predict the exact number of DNA copy. Finally, we review the tools publicly available for analysing read count data. To better understand the state of the art of read count approaches, we compare the performance of the three most widely used sequencing technologies (Illumina Genome Analyzer, Roche 454 and Life Technologies SOLiD) in all the analyses that we perform.albertomagi@gmail.comSupplementary data are available at Bioinformatics online.

Read count approach for DNA copy number variants detection / A. Magi;L. Tattini;T. Pippucci;F. Torricelli;M. Benelli. - In: BIOINFORMATICS. - ISSN 1367-4803. - STAMPA. - 28:(2012), pp. 470-478. [10.1093/bioinformatics/btr707]

Read count approach for DNA copy number variants detection.

MAGI, ALBERTO;TORRICELLI, FRANCESCA;BENELLI, MATTEO
2012

Abstract

The advent of high-throughput sequencing technologies is revolutionizing our ability in discovering and genotyping DNA copy number variants (CNVs). Read count-based approaches are able to detect CNV regions with an unprecedented resolution. Although this computational strategy has been recently introduced in literature, much work has been already done for the preparation, normalization and analysis of this kind of data.Here we face the many aspects that cover the detection of CNVs by using read count approach. We first study the characteristics and systematic biases of read count distributions, focusing on the normalization methods designed for removing these biases. Subsequently, we compare the algorithms designed to detect the boundaries of CNVs and we investigate the ability of read count data to predict the exact number of DNA copy. Finally, we review the tools publicly available for analysing read count data. To better understand the state of the art of read count approaches, we compare the performance of the three most widely used sequencing technologies (Illumina Genome Analyzer, Roche 454 and Life Technologies SOLiD) in all the analyses that we perform.albertomagi@gmail.comSupplementary data are available at Bioinformatics online.
2012
28
470
478
A. Magi;L. Tattini;T. Pippucci;F. Torricelli;M. Benelli
File in questo prodotto:
File Dimensione Formato  
Magi_Bioinformatics_ReadCount_2012.pdf

Accesso chiuso

Descrizione: Articolo principale
Tipologia: Pdf editoriale (Version of record)
Licenza: Tutti i diritti riservati
Dimensione 623.75 kB
Formato Adobe PDF
623.75 kB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/606819
Citazioni
  • ???jsp.display-item.citation.pmc??? 29
  • Scopus 66
  • ???jsp.display-item.citation.isi??? 63
social impact