<< Chapter < Page Chapter >> Page >

A prm-based approach

The research group of Nancy Amato has been working on roadmap-based methods to study the process of protein folding . They start with the known native structure of a protein, and incrementally find conformations more and more different from the native state, and build a roadmap using these conformations. The goal is to find a large ensemble of pathways between the native structure and unfolded structures, and to study these pathways and their properties. In their work, the degrees of freedom of the protein are assumed to be the φ and ψ backbone dihedral angles of each amino acid residue. The side chains are assumed to be rigidly attached to the backbone. In their initial work, they generated new conformations for their roadmaps by adjusting the backbone dihedral angles in the folded conformation randomly with various standard deviations. This approach worked well for very small proteins, but did not scale well.

A later sampling approach was based on counting native contacts . For the purposes of their method, a native contact was defined as any two alpha-carbons within 7 Å of each other in the folded state of the protein. A new conformation is generated at each step of the sampling phase of the roadmap construction by randomly perturbing some existing sample. The resulting structure was accepted with a probability as follows:

The acceptance criterion for newly sampled conformations.
The energy, E, used in this research includes a term favoring known secondary structure contacts, and a Lennard-Jones 12-6 term as presented in the previous section. The parameters of the Lennard-Jones term are selected to favor interactions between H and O atoms, and thus hydrogen bonds. The energy thresholds for acceptance are decided by experiment. If accepted, a structure is placed in a bin corresponding to the number of native contacts present, with one bin for each possible number of native contacts from 1 to n. The procedure begins by sampling around structures in the 100% native contacts bin (initially, just the native structure). Once the next bin, with one fewer contacts, is full, samples are made by perturbing structures in that bin. Thus, the sampling generally proceeds from structures with all native contacts to structures with very few native contacts.

Once the samples are generated, an attempt is made to connect the k nearest neighbors of each node to the node itself. A series of conformations on the line connecting the two nodes are tested, and if their energy is below a threshold, the edge is included in the roadmap. The weight of the edge depends on the sequence of energies of the conformations computed along the connecting line. The probability of moving from the (i-1)th structure to the ith along the line is given by:

The weight of an edge is the sum of the logarithms of the probabilities, and is intended to represent the energetic feasibility of making the transition from the conformation represented by one node to the next. This assumes that moving from one node in the roadmap to an adjacent one consists of a series of move along the edge, each associated with a probability. Note that in reality, the path taken by a protein transitioning between two structures need not be a straight line in conformation space.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Geometric methods in structural computational biology. OpenStax CNX. Jun 11, 2007 Download for free at http://cnx.org/content/col10344/1.6
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Geometric methods in structural computational biology' conversation and receive update notifications?

Ask