Team:TU Darmstadt/Modeling GNM
From 2012.igem.org
Homology Modeling | | Gaussian Networks | | Molecular Dynamics | | Information Theory | | Docking Simulation |
---|
Contents |
Gaussian network model
Theory
Nearly all biologically important processes such as enzyme catalysis, ligand binding and allosteric regulation occur on a large time-scale (micro- to millisecond). A Gaussian network model (GNM) is a coarse-grained representation of a protein as a network consisting of balls and springs. In our approach, the protein chain is reduced to the coordinates of the Cα atom of each residue, represented by a ball. The springs simulate the connection between the amino acids. While Molecular Dynamics (MD) simulations are computational expensive, a GNM calculation only needs a few seconds.
Computation
The dynamics of the structure in the GNM is described by the topology of contacts within the Kirchhoff matrix G. Thus in this network of N interacting sites, the elements of G are computed as:
where Rij is the distance between point i and j. We used Gamma as the intra Cα-contact matrix. The inverse of it describes correlations between fluctuations within the proteins native state. The diagonal of the matrix is replaced by the sum of contacts of one Cα-atom within the whole protein. After a singular value decomposition (SVD) we have calculated the normal modes of the protein. Slow modes describe functionally relevant residues within a biomolecule[2]. The opposite, Fast modes, represent an uncorrelated motion without significant changes in the structure.
A recent examination of the X-ray crystallographic B-factors of over 100 proteins showed that the GNM closely reproduces the experimental data [3].
Application to our Proteins
We computed the GNM in R [4] by using the BioPhysConnectoR [5] library.
- PnB-Esterase 13
- AroY
- TpHA1
- TpHA2
- TpHA3
Results
For the visualisation of the slow modes we colour coated the molcule structures. That actually means we replaced the atomic b-factors in our pdb files and substituted them with the slow mode values. Hence we are able to plot a heat-map on our protein structure. High slow mode values are red colored and low blue.
AroY
Here you can see AroY in ribbon representation (left) and the slow modes as function of the residue number (right).
Here we can easily conclude that the C-terminal Region of AroY seems to be highly flexible in contrast of the rest of the protein. This could be an mechanical important feature or otherwise a membrane bounded anchor. Hence it will be not so clever to tag it with GFP or fuse it to s different protein (C-terminal). Thus AroY is an excellent example of how GNM can help your experimental design.
TpHA2
We illustrate TphA2 in ribbon representation and the slow modes as a function of residue number. The C-terminal region of the TphA2 seems to be rigid. The residues TRP 107 , SER 108, TYR 107 , HIS 105 and the amino acids around peaking in a correlated movement. Note that the model used for this GNM was one of our homology models with a poor scoring. Although this residues may be important for the movement they are located in a bad scoring loop region. All in all this model needs further work and optimization.
TpHB
[[File: |700px|center]]
- PnB-Esterase 13
References
[1] A. R. Atilgan, S. R. Durell, R. L. Jernigan, M. C. Demirel, O. Keskin, and I. Bahar, “Anisotropy of fluctuation dynamics of proteins with an elastic network model.,” Biophys J, vol. 80, no. 1, pp. 505–515, Jan. 2001.
[2] C. Chennubhotla, A. J. Rader, L.-W. Yang, and I. Bahar, “Elastic network models for understanding biomolecular machinery: from enzymes to supramolecular assemblies.,” Physical Biology, vol. 2, no. 4, pp. S173–S180, 2005.
[3] I. Bahar and A. J. Rader, “Coarse-grained normal mode analysis in structural biology.,” Current Opinion in Structural Biology, vol. 15, no. 5, pp. 586–592, 2005.
[4] R. D. C. Team, “R: A Language and Environment for Statistical Computing.” Vienna, Austria, 2008.
[5] F. Hoffgaard, P. Weil, and K. Hamacher, “BioPhysConnectoR: Connecting sequence information and biophysical models.,” BMC Bioinformatics, vol. 11, p. 199, 2010.