Motivation In recent years, high-throughput sequencing technologies have made available the genome sequences of a huge variety of organisms. However, the functional annotation of the encoded proteins often still relies on low-throughput and costly experimental studies. Bioinformatics approaches offer a promising alternative to accelerate this process. In this work, we focus on the binding of zinc(II) ions, which is needed for 5%-10% of any organism's proteins to achieve their physiologically relevant form.Results To implement a predictor of zinc(II)-binding sites in the 3D structures of proteins, we used a neural network, followed by a filter of the network output against the local structure of all known sites. The latter was implemented as a function comparing the distance matrices of the C alpha and C beta atoms of the sites. We called the resulting tool Master of Metals (MOM). The structural models for the entire proteome of an organism generated by AlphaFold can be used as input to our tool in order to achieve annotation at the whole organism level within a few hours. To demonstrate this, we applied MOM to the yeast proteome, obtaining a precision of about 76%, based on data for homologous proteins.Availability and implementation Master of Metals has been implemented in Python and is available at https://github.com/cerm-cirmmp/Master-of-metals.Graphical Abstract

Hunting down zinc(II)-binding sites in proteins with distance matrices / Laveglia, Vincenzo; Bazayeva, Milana; Andreini, Claudia; Rosato, Antonio. - In: BIOINFORMATICS. - ISSN 1367-4811. - ELETTRONICO. - 39:(2023), pp. btad653.0-btad653.0. [10.1093/bioinformatics/btad653]

Hunting down zinc(II)-binding sites in proteins with distance matrices

Laveglia, Vincenzo;Bazayeva, Milana;Andreini, Claudia;Rosato, Antonio
2023

Abstract

Motivation In recent years, high-throughput sequencing technologies have made available the genome sequences of a huge variety of organisms. However, the functional annotation of the encoded proteins often still relies on low-throughput and costly experimental studies. Bioinformatics approaches offer a promising alternative to accelerate this process. In this work, we focus on the binding of zinc(II) ions, which is needed for 5%-10% of any organism's proteins to achieve their physiologically relevant form.Results To implement a predictor of zinc(II)-binding sites in the 3D structures of proteins, we used a neural network, followed by a filter of the network output against the local structure of all known sites. The latter was implemented as a function comparing the distance matrices of the C alpha and C beta atoms of the sites. We called the resulting tool Master of Metals (MOM). The structural models for the entire proteome of an organism generated by AlphaFold can be used as input to our tool in order to achieve annotation at the whole organism level within a few hours. To demonstrate this, we applied MOM to the yeast proteome, obtaining a precision of about 76%, based on data for homologous proteins.Availability and implementation Master of Metals has been implemented in Python and is available at https://github.com/cerm-cirmmp/Master-of-metals.Graphical Abstract
2023
39
0
0
Goal 3: Good health and well-being
Laveglia, Vincenzo; Bazayeva, Milana; Andreini, Claudia; Rosato, Antonio
File in questo prodotto:
File Dimensione Formato  
Bioinformatics2023_MoM.pdf

accesso aperto

Tipologia: Pdf editoriale (Version of record)
Licenza: Open Access
Dimensione 3.2 MB
Formato Adobe PDF
3.2 MB Adobe PDF

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/1352495
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact