Modelling protein complexes with crosslinking mass spectrometry and deep learning

Affiliations
  • 1Technische Universität Berlin, Chair of Bioanalytics, Berlin, Germany.
  • 2Georg-August-Universität Göttingen, Department of General Microbiology, Institute for Microbiology & Genetics, GZMB, Göttingen, Germany.
  • 3Technische Universität Berlin, Robotics and Biology Laboratory, Berlin, Germany.
  • 4Science of Intelligence, Research Cluster of Excellence, Berlin, Germany.
  • 5Georg-August-Universität Göttingen, Department of General Microbiology, Institute for Microbiology & Genetics, GZMB, Göttingen, Germany. jstuelk@gwdg.de.
  • 6Technische Universität Berlin, Chair of Bioanalytics, Berlin, Germany. juri.rappsilber@tu-berlin.de.
  • 7Si-M/”Der Simulierte Mensch”, a Science Framework of Technische Universität Berlin and Charité – Universitätsmedizin Berlin, Berlin, Germany. juri.rappsilber@tu-berlin.de.
  • 8Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK. juri.rappsilber@tu-berlin.de.

|

Abstract

Scarcity of structural and evolutionary information on protein complexes poses a challenge to deep learning-based structure modelling. We integrate experimental distance restraints obtained by crosslinking mass spectrometry (MS) into AlphaFold-Multimer, by extending AlphaLink to protein complexes. Integrating crosslinking MS data substantially improves modelling performance on challenging targets, by helping to identify interfaces, focusing sampling, and improving model selection. This extends to single crosslinks from whole-cell crosslinking MS, opening the possibility of whole-cell structural investigations driven by experimental data. We demonstrate this by revealing the molecular basis of iron homoeostasis in Bacillus subtilis.

Related Concept Videos

JoVE Research Video for Protein Networks 02:26

3.9K

An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,…

JoVE Research Video for Peptide Identification Using Tandem Mass Spectrometry 01:33

5.4K

Tandem mass spectrometry, also known as MS/MS or MS2, is an analytical technique that employs two mass analyzers. Essentially it is a series of mass spectrometers that helps isolate a particular biomolecule and then helps study its chemical properties.
This technique helps gather information regarding the protein from which the peptide was obtained and to study the peptides’ amino acid sequence. Identifying peptides from a complex mixture is an important component of the growing field of…

JoVE Research Video for Mass Spectrometry: Complex Analysis 01:21

529

Mass spectrometry is an important technique for the identification of pure compounds. However, it has some limitations for the analysis of complex mixtures, often due to excessive fragmentation making the spectrum too complicated to decipher. Mass spectrometry can be combined with suitable separation methods in sequence, forming hyphenated methods, which are useful in the analysis of complex mixtures.
GC–MS is a powerful hyphenated method commonly used in forensics and environmental…