Auteurs : Cowen, Lenore (Auteur de la Conférence)
CIRM (Editeur )
Résumé :
Computational methods to predict protein-protein interaction (PPI) typically segregate into sequence-based 'bottom-up' methods that infer properties from the characteristics of the individual protein sequences, or global 'top-down' methods that infer properties from the pattern of already known PPIs in the species of interest. However, a way to incorporate topdown insights into sequence-based bottom-up PPI prediction methods has been elusive. We thus introduce Topsy-Turvy, a method that newly synthesizes both views in a sequence-based, multi-scale, deep-learning model for PPI prediction. While Topsy-Turvy makes predictions using only sequence data, during the training phase it takes a transfer-learning approach by incorporating patterns from both global and molecular-level views of protein interaction. In a cross-species context, we show it achieves state-ofthe-art performance, offering the ability to perform genome-scale, interpretable PPI prediction for non-model organisms with no existing experimental PPI data. In species with available experimental PPI data, we further present a Topsy-Turvy hybrid (TT-Hybrid) model which integrates Topsy-Turvy with a purely network-based model for link prediction that provides information about species-specific network rewiring. TT-Hybrid makes accurate predictions for both well- and sparsely-characterized proteins, outperforming both its constituent components as well as other state-of-the-art PPI prediction methods. Furthermore, running Topsy-Turvy and TT-Hybrid screens is feasible for whole genomes, and thus these methods scale to settings where other methods (e.g. AlphaFold-Multimer) might be infeasible. The generalizability, accuracy and genome-level scalability of Topsy-Turvy and TT-Hybrid unlocks a more comprehensive map of protein interaction and organization in both model and non-model organisms. This is joint work with Rohit Singh, Kapil Devkota, and Bonnie Berger.
Keywords : protein-protein interaction; protein-protein interaction network; PPI network; pret-trained language model; protein sequence
Codes MSC :
92B99
- None of the above but in this section
92C05
- Biophysics
92D10
- Genetics
|
Informations sur la Rencontre
Nom de la rencontre : Interplay between AI and mathematical modelling in the post-structural genomics era / Interaction entre l'IA et la modélisation mathématique à l'ère post-génomique structurale Organisateurs de la rencontre : Fidelis, Krzysztof ; Grudinin, Sergei ; Laine, Elodie Dates : 20/03/2023 - 24/03/2023
Année de la rencontre : 2023
URL Congrès : https://conferences.cirm-math.fr/2767.html
DOI : 10.24350/CIRM.V.20020003
Citer cette vidéo:
Cowen, Lenore (2023). Topsy-Turvy: integrating a global view into sequence-based protein-protein interaction prediction. CIRM. Audiovisual resource. doi:10.24350/CIRM.V.20020003
URI : http://dx.doi.org/10.24350/CIRM.V.20020003
|
Voir aussi
Bibliographie