Runs of homozygosity (ROH) can be defined as sizable chromo- somal stretches of homozygous genotypes, ranging in length from tens of kilobases to megabases. ROHs can be relevant for popu- lation and medical genetics, playing a role in predisposition to both rare and common disorders. ROHs are commonly detected by SNP microarrays, but attempts have been made to use Whole Exome Sequencing (WES) data. Currently available methods devel- oped for the analysis of uniformly spaced SNP-array maps do not fit easily the sparse and non uniform distribution of the WES tar- get design. To meet the need of an approach specifically tailored to WES data we developed H3M2, an original algorithm based on Het- erogeneous Hidden Markov Model that incorporates inter-marker distances to detect ROH from Whole Exome Sequencing (WES) data. We evaluated the performance of H3M2 to correctly identify ROHs on synthetic chromosomes and examined its accuracy in detecting ROHs of different length (short, medium and long) from real 1000 genomes project data. H3M2 turned out to be more accurate than GERMLINE and PLINK, two state-of-the-art algorithms, especially in the detection of short and medium ROHs. H3M2 is freely available at https://sourceforge.net/projects/h3m2/.

H3M2: Detection of runs of homozygosity from whole-exome sequencing data / Magi, Alberto. - In: JOURNAL OF BIOTECHNOLOGY. - ISSN 0168-1656. - ELETTRONICO. - 185:(2014), pp. S15-0. [10.1016/j.jbiotec.2014.07.053]

H3M2: Detection of runs of homozygosity from whole-exome sequencing data

MAGI, ALBERTO
2014

Abstract

Runs of homozygosity (ROH) can be defined as sizable chromo- somal stretches of homozygous genotypes, ranging in length from tens of kilobases to megabases. ROHs can be relevant for popu- lation and medical genetics, playing a role in predisposition to both rare and common disorders. ROHs are commonly detected by SNP microarrays, but attempts have been made to use Whole Exome Sequencing (WES) data. Currently available methods devel- oped for the analysis of uniformly spaced SNP-array maps do not fit easily the sparse and non uniform distribution of the WES tar- get design. To meet the need of an approach specifically tailored to WES data we developed H3M2, an original algorithm based on Het- erogeneous Hidden Markov Model that incorporates inter-marker distances to detect ROH from Whole Exome Sequencing (WES) data. We evaluated the performance of H3M2 to correctly identify ROHs on synthetic chromosomes and examined its accuracy in detecting ROHs of different length (short, medium and long) from real 1000 genomes project data. H3M2 turned out to be more accurate than GERMLINE and PLINK, two state-of-the-art algorithms, especially in the detection of short and medium ROHs. H3M2 is freely available at https://sourceforge.net/projects/h3m2/.
2014
Magi, Alberto
File in questo prodotto:
File Dimensione Formato  
Magi_JournalBiotech_H3M2_2014.pdf

accesso aperto

Descrizione: Articolo principale
Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 54.76 kB
Formato Adobe PDF
54.76 kB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1064545
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 81
  • ???jsp.display-item.citation.isi??? 0
social impact