The aim of this study was to compare the common method of exploiting infrared spectral data in animal breeding; that is, estimating the breeding values for the traits predicted by infrared spectroscopy, and an alternative approach based on the direct use of spectral information (direct prediction, DP) to predict the estimated breeding values (EBV). Traits were pH, milk coagulation properties, contents of the main casein and whey protein fractions, cheese yield measured by micro-cheese making, lactoferrin, Ca, and fat composition. For the DP method, the number of spectral variables was reduced by principal components analysis to 8 latent traits that explained 99% of the original spectral variation. Restricted maximum likelihood was used to estimate variance components of the latent traits. (Co)variance components of the original spectral traits were obtained by back-transformation and EBV of all derived milk traits were then predicted as traits correlated with the genetic information of the spectra. The rank correlation between the EBV obtained for the infrared-predicted traits and those obtained from the DP method was variable across traits. Rank correlations ranged from 0.07 (for the content of saturated fatty acids expressed as g/100 g of fat) to 0.96 (for dry matter cheese yield, %) and, for most traits, was <0.5. This result can be explained by the nature of the principal components analysis: it does not take into account the covariance between the spectral variables and the reference traits but produces latent traits that maximize the spectral variance explained. Thus, the direct approach is more likely to be effective for traits more related to the main sources of spectral variation (i.e., protein and fat). More research is required to study spectral genetic variation and to determine the best way to choose spectral regions and the type and number of considered latent traits for potential applications.

Comparison of Bayesian regression models and partial least squares regression for the development of infrared prediction equations / Bonfatti Valentina; Tiezzi Francesco; Miglior Filippo; Carnier Paolo. - In: JOURNAL OF DAIRY SCIENCE. - ISSN 1525-3198. - ELETTRONICO. - 100:(2017), pp. 7306-7319. [10.3168/jds.2016-12203]

Comparison of Bayesian regression models and partial least squares regression for the development of infrared prediction equations

Tiezzi Francesco;
2017

Abstract

The aim of this study was to compare the common method of exploiting infrared spectral data in animal breeding; that is, estimating the breeding values for the traits predicted by infrared spectroscopy, and an alternative approach based on the direct use of spectral information (direct prediction, DP) to predict the estimated breeding values (EBV). Traits were pH, milk coagulation properties, contents of the main casein and whey protein fractions, cheese yield measured by micro-cheese making, lactoferrin, Ca, and fat composition. For the DP method, the number of spectral variables was reduced by principal components analysis to 8 latent traits that explained 99% of the original spectral variation. Restricted maximum likelihood was used to estimate variance components of the latent traits. (Co)variance components of the original spectral traits were obtained by back-transformation and EBV of all derived milk traits were then predicted as traits correlated with the genetic information of the spectra. The rank correlation between the EBV obtained for the infrared-predicted traits and those obtained from the DP method was variable across traits. Rank correlations ranged from 0.07 (for the content of saturated fatty acids expressed as g/100 g of fat) to 0.96 (for dry matter cheese yield, %) and, for most traits, was <0.5. This result can be explained by the nature of the principal components analysis: it does not take into account the covariance between the spectral variables and the reference traits but produces latent traits that maximize the spectral variance explained. Thus, the direct approach is more likely to be effective for traits more related to the main sources of spectral variation (i.e., protein and fat). More research is required to study spectral genetic variation and to determine the best way to choose spectral regions and the type and number of considered latent traits for potential applications.
2017
100
7306
7319
Bonfatti Valentina; Tiezzi Francesco; Miglior Filippo; Carnier Paolo
File in questo prodotto:
File Dimensione Formato  
bonfatti2017JDS.pdf

accesso aperto

Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 708.71 kB
Formato Adobe PDF
708.71 kB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1258624
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 29
social impact