Artificial neural networks are machine learning models that have been applied to various genomic problems, with the ability to learn non-linear relationships and model high-dimensional data. These advanced modeling capabilities make them promising candidates for genomic prediction by potentially capturing the intricate relationships between genetic variants and phenotypes. Despite these theoretical advantages, neural networks have shown inconsistent performance across previous genomic prediction research, and limited studies have evaluated their performance and feasibility specifically for pig genomic predictions using large-scale data. We evaluated the predictive performance of feed-forward neural network (FFNN) models implemented in TensorFlow with architectures ranging from single-layer (no hidden layers) to four-layer structures (three hidden layers). These FFNN models were compared with five linear methods, including GBLUP, LDAK-BOLT, BayesR, SLEMM-WW, and scikit-learn’s ridge regression. The evaluation utilized data from six quantitative traits: off-test body weight (WT), off-test back fat thickness (BF), off-test loin muscle depth (MS), number of piglets born alive (NBA), number of piglets born dead (NBD), and number of piglets weaned (NW). We also assessed the computational efficiency of FFNN models on both CPU and GPU. The benchmarking employed repeated random subsampling validation with sample sizes ranging from 3,290 individuals for reproductive traits to over 26,000 individuals for production traits, using data from a total of 27,481 genotyped pigs. Hyperband tuning was used to optimize the hyper- parameters and select the best model for each structure. Results showed that FFNN models consistently underperformed compared to linear methods across all architectures tested. The one-layer structure yielded the best predictive accuracy among the FFNN approaches. Of the five linear methods, SLEMM- WW demonstrated the best balance of computational efficiency and predictive ability. GPUs offered significant computational efficiency gains for multi-layer FFNN models compared to CPUs, though FFNN models remained more computationally demanding than most linear methods. In conclusion, FFNN models with up to four layers did not improve genomic predictions compared to routine linear methods for pig quantitative traits.

Benchmarking of feed-forward neural network models for genomic prediction of quantitative traits in pigs / Wang J.; Tiezzi Francesco; Huang Y.; Maltecca C.; Jiang J.. - In: FRONTIERS IN GENETICS. - ISSN 1664-8021. - ELETTRONICO. - (2025), pp. 1-12. [10.3389/fgene.2025.1618891]

Benchmarking of feed-forward neural network models for genomic prediction of quantitative traits in pigs

Tiezzi Francesco;Maltecca C.;
2025

Abstract

Artificial neural networks are machine learning models that have been applied to various genomic problems, with the ability to learn non-linear relationships and model high-dimensional data. These advanced modeling capabilities make them promising candidates for genomic prediction by potentially capturing the intricate relationships between genetic variants and phenotypes. Despite these theoretical advantages, neural networks have shown inconsistent performance across previous genomic prediction research, and limited studies have evaluated their performance and feasibility specifically for pig genomic predictions using large-scale data. We evaluated the predictive performance of feed-forward neural network (FFNN) models implemented in TensorFlow with architectures ranging from single-layer (no hidden layers) to four-layer structures (three hidden layers). These FFNN models were compared with five linear methods, including GBLUP, LDAK-BOLT, BayesR, SLEMM-WW, and scikit-learn’s ridge regression. The evaluation utilized data from six quantitative traits: off-test body weight (WT), off-test back fat thickness (BF), off-test loin muscle depth (MS), number of piglets born alive (NBA), number of piglets born dead (NBD), and number of piglets weaned (NW). We also assessed the computational efficiency of FFNN models on both CPU and GPU. The benchmarking employed repeated random subsampling validation with sample sizes ranging from 3,290 individuals for reproductive traits to over 26,000 individuals for production traits, using data from a total of 27,481 genotyped pigs. Hyperband tuning was used to optimize the hyper- parameters and select the best model for each structure. Results showed that FFNN models consistently underperformed compared to linear methods across all architectures tested. The one-layer structure yielded the best predictive accuracy among the FFNN approaches. Of the five linear methods, SLEMM- WW demonstrated the best balance of computational efficiency and predictive ability. GPUs offered significant computational efficiency gains for multi-layer FFNN models compared to CPUs, though FFNN models remained more computationally demanding than most linear methods. In conclusion, FFNN models with up to four layers did not improve genomic predictions compared to routine linear methods for pig quantitative traits.
2025
1
12
Wang J.; Tiezzi Francesco; Huang Y.; Maltecca C.; Jiang J.
File in questo prodotto:
File Dimensione Formato  
wang2025fig.pdf

accesso aperto

Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 1.73 MB
Formato Adobe PDF
1.73 MB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1437921
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact