Motivation In recent years, high-throughput sequencing technologies have made available the genome sequences of a huge variety of organisms. However, the functional annotation of the encoded proteins often still relies on low-throughput and costly experimental studies. Bioinformatics approaches offer a promising alternative to accelerate this process. In this work, we focus on the binding of zinc(II) ions, which is needed for 5%-10% of any organism's proteins to achieve their physiologically relevant form.Results To implement a predictor of zinc(II)-binding sites in the 3D structures of proteins, we used a neural network, followed by a filter of the network output against the local structure of all known sites. The latter was implemented as a function comparing the distance matrices of the C alpha and C beta atoms of the sites. We called the resulting tool Master of Metals (MOM). The structural models for the entire proteome of an organism generated by AlphaFold can be used as input to our tool in order to achieve annotation at the whole organism level within a few hours. To demonstrate this, we applied MOM to the yeast proteome, obtaining a precision of about 76%, based on data for homologous proteins.Availability and implementation Master of Metals has been implemented in Python and is available at https://github.com/cerm-cirmmp/Master-of-metals.Graphical Abstract
Hunting down zinc(II)-binding sites in proteins with distance matrices / Laveglia, Vincenzo; Bazayeva, Milana; Andreini, Claudia; Rosato, Antonio. - In: BIOINFORMATICS. - ISSN 1367-4811. - ELETTRONICO. - 39:(2023), pp. btad653.0-btad653.0. [10.1093/bioinformatics/btad653]
Hunting down zinc(II)-binding sites in proteins with distance matrices
Laveglia, Vincenzo;Bazayeva, Milana;Andreini, Claudia;Rosato, Antonio
2023
Abstract
Motivation In recent years, high-throughput sequencing technologies have made available the genome sequences of a huge variety of organisms. However, the functional annotation of the encoded proteins often still relies on low-throughput and costly experimental studies. Bioinformatics approaches offer a promising alternative to accelerate this process. In this work, we focus on the binding of zinc(II) ions, which is needed for 5%-10% of any organism's proteins to achieve their physiologically relevant form.Results To implement a predictor of zinc(II)-binding sites in the 3D structures of proteins, we used a neural network, followed by a filter of the network output against the local structure of all known sites. The latter was implemented as a function comparing the distance matrices of the C alpha and C beta atoms of the sites. We called the resulting tool Master of Metals (MOM). The structural models for the entire proteome of an organism generated by AlphaFold can be used as input to our tool in order to achieve annotation at the whole organism level within a few hours. To demonstrate this, we applied MOM to the yeast proteome, obtaining a precision of about 76%, based on data for homologous proteins.Availability and implementation Master of Metals has been implemented in Python and is available at https://github.com/cerm-cirmmp/Master-of-metals.Graphical AbstractFile | Dimensione | Formato | |
---|---|---|---|
Bioinformatics2023_MoM.pdf
accesso aperto
Tipologia:
Pdf editoriale (Version of record)
Licenza:
Open Access
Dimensione
3.2 MB
Formato
Adobe PDF
|
3.2 MB | Adobe PDF |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.