A computational pipeline to discover highly phylogenetically informative genes in
sequenced genomes: application to S. cerevisiae natural strains.

Ramazzotti, Matteo; Berná, L.; Stefanini, Irene; Cavalieri, Duccio

doi:10.1093/nar/gks005

The quest for genes representing genetic relationships of strains or individuals within populations and their evolutionary history is acquiring a novel dimension of complexity with the advancement of next-generation sequencing (NGS) technologies. In fact, sequencing an entire genome uncovers genetic variation in coding and non-coding regions and offers the possibility of studying Saccharomyces cerevisiae populations at the strain level. Nevertheless, the disadvantageous cost-benefit ratio (the amount of details disclosed by NGS against the time-expensive and expertise-demanding data assembly process) still precludes the application of these techniques to the routinely assignment of yeast strains, making the selection of the most reliable molecular markers greatly desirable. In this work we propose an original computational approach to discover genes that can be used as a descriptor of the population structure. We found 13 genes whose variability can be used to recapitulate the phylogeny obtained from genome-wide sequences. The same approach that we prove to be successful in yeasts can be generalized to any other population of individuals given the availability of high-quality genomic sequences and of a clear population structure to be targeted.

A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to S. cerevisiae natural strains / M. Ramazzotti; L. Berná; I. Stefanini; D. Cavalieri. - In: NUCLEIC ACIDS RESEARCH. - ISSN 0305-1048. - STAMPA. - 40:(2012), pp. 3834-3848. [10.1093/nar/gks005]

A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to S. cerevisiae natural strains.

RAMAZZOTTI, MATTEO;L. Berná;STEFANINI, IRENE;CAVALIERI, DUCCIO

2012

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
			2012
		
	Rivista
	
			NUCLEIC ACIDS RESEARCH
		
	Volume
	
			40
		
	Pagina iniziale
	
			3834
		
	Pagina finale
	
			3848
		
	Tutti gli autori
	
			M. Ramazzotti; L. Berná; I. Stefanini; D. Cavalieri
		
	Appare nelle tipologie:
	
			1a - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Ramazzotti M Cavalieri D NAR 2012 - A computation pipeline for.pdf accesso aperto Tipologia: Versione finale referata (Postprint, Accepted manuscript) Licenza: Tutti i diritti riservati Dimensione 3.96 MB Formato Adobe PDF	3.96 MB	Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/781628

A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to S. cerevisiae natural strains.

RAMAZZOTTI, MATTEO;L. Berná;STEFANINI, IRENE;CAVALIERI, DUCCIO

2012

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

A computational pipeline to discover highly phylogenetically informative genes in sequenced genomes: application to S. cerevisiae natural strains.

RAMAZZOTTI, MATTEO;L. Berná;STEFANINI, IRENE;CAVALIERI, DUCCIO

2012

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)