Regulation of Pak2 activity involves at least two mechanisms: (i) phosphorylation of the conserved Thr402 in the activation loop and (ii) interaction of the autoinhibitory domain (AID) with the catalytic domain. We collected 482 human protein kinase sequences from the kinome database and globally mapped the evolutionary interactions of the residues in the catalytic domain with Thr402 by sequence-based statistical coupling analysis (SCA). Perturbation of Thr402 (34.6%) suggests a communication pathway between Thr402 in the activation loop, and Phe387 (ΔΔE387F,402T = 2.80) in the magnesium positioning loop, Trp427 (ΔΔE427W,402T = 3.12) in the F-helix, and Val404 (ΔΔE404V,402T = 4.43) and Gly405 (ΔΔE405G,402T = 2.95) in the peptide positioning loop. When compared to the cAMP-dependent protein kinase (PKA) and Src, the perturbation pattern of threonine phosphorylation in the activation loop of Pak2 is similar to that of PKA, and different from the tyrosine phosphorylation pattern of Src. Reciprocal coupling analysis by SCA showed the residues perturbed by Thr402 and the reciprocal coupling pairs formed a network centered at Trp427 in the F-helix. Nine pairs of reciprocal coupling residues crucial for enzymatic activity and structural stabilization were identified. Pak2, PKA and Src share four pairs. Reciprocal coupling residues exposed to the solvent line up as an activation groove. This is the inhibitor (PKI) binding region in PKA and the activation groove for Pak2. This indicates these evolutionary conserved residues are crucial for the catalytic activity of PKA and Pak2.
Citation: Hsu Y-H, Traugh JA (2010) Reciprocally Coupled Residues Crucial for Protein Kinase Pak2 Activity Calculated by Statistical Coupling Analysis. PLoS ONE 5(3): e9455. doi:10.1371/journal.pone.0009455
Editor: Andreas Hofmann, Griffith University, Australia
Received: September 29, 2009; Accepted: February 9, 2010; Published: March 1, 2010
Copyright: © 2010 Hsu, Traugh. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This project was supported by National Institutes of Health (NIH) grant GM26738 to Jolinda A.Traugh. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Statistical coupling analysis (SCA) regards evolution as a natural mutagenesis process and utilizes the known protein sequences to economically examine comprehensive correlations between amino acid residues. This analysis retrieves two co-evolved residues in a protein for either structural or functional reasons. The statistical coupling through sequence-based analysis was introduced as a reporter of thermodynamic coupling in proteins with PDZ domains . Other research applied the matrix clustering analysis to systematically analyze large amounts of SCA data to identify the key residues for the structures of the G protein-coupled receptors , G proteins , and RXR heterodimers . Cross-correlation analysis of sequence-based SCA and structure-based molecular dynamics predicted energetic coupling residues essential for HhaI (Haemophilus haemolyticus I) methyltransferase catalysis .
The SCA results are presented in a two-dimensional array (matrix), representing the correlation values between any of the two residues. Suel et al.  used matrix clustering analysis to find the correlation patterns from the two-dimensional SCA array. The correlation pattern showed the residues coupled reciprocally to structural determinants of the G protein-coupled receptors. We incorporated this concept and retrieved all reciprocal coupling residues of Pak2. The distilled results of the statistical coupling analysis eliminated all perturbations from a single direction. These reciprocally coupled pairs of residues are the structurally or functionally coupled residues occurring through evolution.
Pak2 belongs to the serine/threonine protein kinase family containing the conventional Pak 1–3 and Pak 4–6 . Pak2 is primarily inactive during growth, and is transiently activated by Cdc42 in response to moderate stress, and constitutively activated during apoptosis by caspase cleavage , . Pak2 has a basal autophosphorylation activity wherein five serines in the regulatory domain of Pak2 can be autophosphorylated . Activation of the protein kinase requires autophosphorylation at seven serine sites in the regulatory domain and one conserved threonine in the catalytic domain , . Cleavage by caspase 3 or binding of Cdc42(GTP) facilitates autophosphorylation at Ser141, Ser165 and Thr402 , . Thr402 and Ser141 are critical for Pak2 regulation and activation , .
Inactive Pak1 exists in solution as a homodimer, with the autoinhibitory domain (AID) of each monomer interacting with its partner to inhibit activity , . As Pak1 and Pak2 are homologous, it is generally assumed that Pak2 exists as a trans-autoinhibited homodimer. The bilobal structure of the catalytic domain is the conserved signature in protein kinases. Movement of the two lobes to an open or closed conformation regulates the phosphorylation efficiency in the cAMP-dependent protein kinase (PKA) and the insulin receptor , . Pak2 catalytic activity is inhibited by binding of two α-helices in the AID (residues 111–131) to the G-helix, and the kinase inhibitory segment (KI, residues 136–149) to the active site cleft –. Autophosphorylation of Thr402 is through intermolecular phosphorylation, as opposed to the intra-molecular phosphorylation of the other 7 serine sites .
Post-analysis of the SCA data requires a good structural model. Although there is no Pak2 structure published, X-ray crystal structures of Pak1 are good working models for Pak2. Pak2 has 93% homology in the catalytic domain. Both the inactive (1F3M.PDB) and active conformations (1YHV.PDB) of Pak1 were utilized for analysis of Pak2 , . The inactive conformation (1F3M.PDB) illustrates a catalytic domain with a disordered activation loop, hydrogen bonding between the Lys138 (Lys141 for Pak1) to the catalytic residue Asp368 (Asp389 for Pak1), and hydrophobic interactions between the AID and the catalytic domain . The active conformation of Pak1, containing only the catalytic domain, shows a conformational change in the activation loop as compared to the inactive conformation .
Human protein kinases from 8 hierarchical groups with the conserved bilobal structure in their kinase domain are the perfect candidates for SCA . We applied the SCA to the sequence of Pak2 (STE hierarchical group) to examine the evolutionary structural and functional determinants in the catalytic domain. Because Pak2 autophosphorylation on Thr402 in the activation loop is necessary for Pak2 activity, analysis of the correlative sites in the catalytic domain will define residues that co-evolved with Thr402. Two other protein kinases, PKA and Src (in the AGC and TK protein kinase hierarchical groups, respectively) were examined and compared to Pak2. PKA is a tetramer of two regulatory and catalytic subunits. The catalytic subunit has a similar structure to the catalytic domain of Pak2 and was used to analyze the solvent accessibility of Pak2 by H/D exchange . PKA has the conserved threonine phosphorylation site in the activation loop, and the crystal structures of PKA provide good working models to study Pak2 , . The protein tyrosine kinase Src is a monomer or a dimer, and has many published crystal structures –. Tyrosine (Tyr416) in the activation loop replaces the conserved Thr402 in Pak2. The differences between threonine and tyrosine phosphorylation in the activation loop will provide a good contrast to study the effects of phosphorylation on the activation loop of Pak2.
In this study, the reciprocal coupling residues from the SCA calculation were mapped onto the Pak1 structure to illustrate their relative positions, and to identify possible interactions and functional effects for these residues. Perturbation of Thr402 on Pak2 showed local effects at Val404 and Gly405 in the activation loop and remote effects through a network centered at Trp427. Perturbation of Thr402 on the activation loop of Pak2 was similar to that of PKA, while there were significant differences in perturbation with Src Tyr416(Src). Nine reciprocal coupling pairs in Pak2 were identified. The reciprocal coupling residues on the Pak2 surface line up as an activation groove extended from the active site cleft to the large lobe.
Multiple Sequence Alignments (MSA)
The sequences of all 637 human protein kinases were collected from the human kinome database . After eliminating sequences for 106 nonfunctional copies of kinase genes (pseudogenes), 40 atypical kinases, 6 secondary inactive kinase domains, and 3 partial sequences containing less than 200 residues, 482 final sequences were utilized for alignment. We included 47 human kinase domains (pseudo kinases) that lacked some conserved catalytic residues and were predicted to be enzymatically inactive , because they have the features of the conserved bilobal structure and increased the diversity of the pool. The collected sequences were aligned using the multiple sequence alignment program, MUSCLE, which processed all sequences at the same time to prevent inconsistencies . The initial alignment from MUSCLE included unnecessary gaps in the target kinase Pak2, expanding the length of sequence about 3-fold. We truncated the gaps to shorten the alignment to 252 residues for the purpose of analysis. The final multiple sequence alignments for the calculation of Pak2 (Figure S1), PKA (Figure S2) and Src (Figure S3) are provided.
Calculation of Statistical Parameters in MSA
The SCA was calculated as described by Lockless and Ranganathan . The evolutionary conservation parameter (ΔE), based on the 20 amino acid distribution of the human protein kinase alignment, was compared to that of eukaryotic ExPASy database. In MSA, the evolutionary conservation value at each position i was quantitated as a vector of 20 amino acid frequencies and defined as Equation 1.(1)
ΔE values have the arbitrary energy unit, kT*. The unit was omitted to prevent confusion with the real energy unit. X was any of the 20 amino acids. Pi and PMSA were the binomial probability at site i and MSA respectively. The amino acid frequencies in eukaryotic proteins were used as the frequency reference for the probability calculation. The randomness of the amino acid distribution determined the value of evolutionary conservation at each point.
The statistical coupling analysis quantitatively measured the change of the amino acid distribution at site I, caused by conservation of site j in the MSA. The subset, consisting of the sequences with the same amino acid as Pak2 at site j, was collected for coupling analysis. Equation 2 was used to calculate the difference in the amino acid distribution between the subset alignment and MSA (ΔΔEi,j)(2)
Reciprocal Coupling Analysis
Reciprocal coupling analysis was adopted to identify the reciprocal coupling pairs in the results of the SCA, using a 2-dimensional array (282×282). The reciprocal coupling pairs must meet the requirement that when site i was coupled to j, j must couple to i. The top 10 coupling positions for each perturbation site were selected according to their ΔΔEj,i and ΔΔEi,j value. When site i was coupled to j with the highest ΔΔE j,i value, and site j was coupled to i with the highest ΔΔE i,j value, the pairs were identified as the major reciprocal coupling pairs. The statistical coupling data were imported to TIGR Multi-experiment Viewer to generate a matrix of 282×282 . The matrix represented all perturbations in columns to residues in rows. All of the statistical coupling analysis and reciprocal coupling analysis were calculated by Microsoft Excel and Visual Basic for Applications. The structural overlay and graphs were generated by the Swiss-Pdb Viewer . The protein structures used to calculate the distances between residues included 1YHV.PDB (Active Pak1) , 1F3M.PDB (Inactive Pak1) , 1ATP.PDB (PKA)  and 2SRC.PDB (Src) .
Evolution Value and Conservation of the Residues in Pak2
Human protein kinases from the kinome were selected for SCA, as indicated in Methods, to calculate the values of the evolutionary conservation (ΔE) for Pak2 (Figure 1a). The ΔE ranged from the least conserved residue Ser398 (ΔE = 0.08) to the most conserved Trp427 (ΔE = 14.97) in the αF-helix. The percent level of normalized ΔE values were superimposed on the Pak1 structure (Figure 1b). The conserved residues were abundant in the glycine-rich loop, αC-helix, β3-sheet, the activation loop, the magnesium positioning loop and αF-helix. This indicates the universal importance of the residues related to catalytic activity, ATP binding and magnesium ion binding. The αF-helix shown as a highly conserved region may play an important role in the catalytic activity and maintaining the structure of the large lobe.
(a) The Pak2 evolutionary conservation values (ΔE) for amino acids in the catalytic domain (residues 249 to 500). Residue numbers are those for Pak2. (b) The evolutionary conservation values were normalized to percentage and mapped on the tertiary structure of Pak1 (1YHV.PDB), which is highly homologous to Pak2. The levels of ΔE are shown in the color index. The secondary structures, α-helices A-J and β-sheets 1–9, are identified in (b).
The detailed sequence alignment, secondary structures and domains of Pak2, PKA and Src are shown in Figure 2. The glycine-rich loop, catalytic loop, magnesium positioning loop, APE motif and F-helix are highly conserved in the alignment. The secondary structures and the highly conserved regions became important checkpoints for the multiple sequence alignment. The residues perturbed by the phosphorylation site in the activation loop and the reciprocal coupling residues of Pak2, PKA and Src are also identified in the alignment.
Secondary structure assignments are shown as rectangles (α-helices) and arrows (β-strands) above the aligned sequences. The assigned functional domains are shown by the empty box in the alignment. The residues highlighted in green are the ones appeared in Table 1; the residues highlighted in yellow are the ones appeared in Table 2; the residues highlighted in pink are the ones appeared in both Table 1 and 2.
Evolutionary Coupling Network of Thr402 for Pak2
The evolutionary coupling values (ΔΔE) of the residues to the autophosphorylation site Pak2 Thr402 was quantified by perturbation to the conservation value ΔE . We calculated the changes of the evolutionary coupling values (ΔΔE) of each individual Pak2 residue caused by perturbation at moderately conserved Thr402 (34.6%). The results of perturbation at Thr402 (Figure 3a, left panel and Table 1) were positioned on the crystal structure of the homologous Pak1 to visualize the perturbation effect in terms of the color index (Figure 3a, right panel). The average of ΔΔE for all residues was 0.83. Local effects caused by Thr402 perturbation identified Val404 (ΔΔE404V,402T = 4.43) and Gly405 (ΔΔE405G,402T = 2.95) in the peptide positioning loop. The second strongest effect was observed distally at Trp427 (ΔΔE427W,402T = 3.12) in the F-helix (residues 423–438). Additionally, Phe387 (ΔΔE387F,402T = 2.80) in the magnesium positioning loop was affected by perturbation of Thr402. This suggests a communication pathway between Thr402 in the activation loop, the magnesium positioning loop and the F-helix.
(a) The histogram represents the statistical coupling energy (ΔΔE) of the Pak2 catalytic domain (residues 249 to 500) perturbed by Thr402 in the activation loop. The stick representation of the perturbation effect of Thr402 (blue) displays Pak2 residues with ΔΔE>3.0 (red) and between 3.0 and 2.6 (orange). (b) The histogram represents the statistical coupling energy (ΔΔE) of the catalytic subunit domain of PKA (residues 43 to 297) perturbed by Thr197 in the activation loop. The results are superimposed on the x-ray crystal structure of PKA (1ATP.PDB). The color chart is the same as Figure 2. The phosphorylation site Thr197 (blue), the major coupled residues Cys199 and Trp222 (ΔΔE>3.0, red), and the secondary coupled residues (ΔΔE between 3.0 and 2.6, orange) are identified. (c) The histogram represents the statistical coupling energy (ΔΔE) of the Src catalytic domain (residues 267 to 515) perturbed by Tyr416 in the activation loop. The statistical coupling residues of Src are shown on the tertiary structure (2SRC.PDB). The phosphorylation site Tyr416 (blue), the center of the coupled residues Trp428 (ΔΔE = 12.06, red), and the secondary coupled residues (ΔΔE between 12.0 and 7.6, green) are identified.
To compare the perturbation of threonine in the activation loop of Pak2, two other protein kinases, cAMP-dependent protein kinase (PKA) and Src (in the AGC and TK protein kinase hierarchical groups, respectively), were also examined by SCA. Phosphorylation of Thr197(PKA) in PKA and Tyr416(Src) in Src in the activation loop is essential for their protein kinase activity , , , , . SCA was applied to perturbation of Thr197(PKA) in PKA (34.8%) and Tyr416(Src) in Src (20.5%). The average of ΔΔEi,197T for all residues of PKA was 0.83, and the average of ΔΔEi,416Y for Src was 2.31. The results with Thr197(PKA) in PKA showed a similar pattern of affected residues as Pak2, as visualized in the structure of PKA (1ATP.PDB) (Figure 3b). Cys199(PKA) (ΔΔE199C,197T = 4.45) of PKA was two residues away from the phosphorylation site Thr197, which is the same as Val404 and Thr402 in Pak2. Tyr416 (Src) in the activation loop of Src resulted in similar coupling residues in Pak2. These residues were superimposed to the structure of Src (2SRC.PDB) (Figure 3c). Trp428(Src) (Trp409 in Pak2) in the peptide positioning loop was the most impacted residue by Tyr416(Src) perturbation (ΔΔE428W,416Y = 12.06), different from Pak2 and PKA.
Reciprocal Coupling Analysis of Pak2
To analyze the evolutionary coupling effect on one residue, we mutated the particular residue in silico and measured the evolutionary coupling values (ΔΔE) of each individual Pak2 residue. Two examples of mutations at Trp427 and Gly439 are shown in Figure 4a. The ΔΔE values indicate the extent of perturbation caused by the mutation. In these two examples, Gly439 (ΔΔE439G,427W = 1.5) was the strongest coupling residue to Trp427 (Figure 4a, upper panel); Trp427 was the strongest coupling residue to Gly439 in (Figure 4a, lower panel). Thus, Trp427 and Gly439 are reciprocally coupled through evolution. The complete calculations for each position are shown in the two-dimensional matrix (Figure 4b).
(a) Perturbations of Trp427 (upper panel) and Gly439 (bottom panel) were generated to obtain statistical coupling energies to all of the Pak2 residues. The most impacted residue of Trp427 perturbation is Gly439 and visa versa, and is defined as the major reciprocal coupling pair. (b) The results of the coupling energy (ΔΔE) calculations were imported into TIGR MeV program to show the 252 (Perturbation)×252 (Coupling) matrix. The intensity of ΔΔE was expressed in the heat map color scheme shown on top of the graph. (c) Plot of reciprocal coupling residues. The cross marks show the major reciprocal coupling residues (the highest ΔΔEi,j and ΔΔEj,i values in each perturbation) and the green dots show the minor reciprocal coupling residues (top 2–10 ΔΔEi,j and ΔΔEj,i values in each perturbation). The nine reciprocal coupling pairs are identified in Table 1. The dashed line is a symmetrical line for the coupling pairs and measures the length between residues.
Reciprocal coupling analysis of the entire catalytic domain of Pak2 was carried out to retrieve the reciprocal coupling pairs. The top 10 evolutionary coupling values (ΔΔE) were extracted from Figure 4b for reciprocal coupling analysis and the data are shown as green dots (Figure 4c). A total of nine reciprocal coupling pairs with the strongest correlation between two residues were identified, as indicated by pair number (Figure 4c, Table 2). The generated map of the reciprocal coupling pairs was symmetrical along the diagonal.
The nine reciprocal coupling pairs of Pak2 were positioned on the structure for Pak1 (1YHV.PDB) (Figure 5a,b). Among the nine pairs, three had residues that interacted with each other or were within the contact distance. Asn373- Asp386 (Pair 2), Glu413- Arg488 (Pair 4) and Glu324- Lys383 (Pair 8) were within 3.6 Å (Table 2). Pair 4 and Pair 8 were identified as ion pairs. From the crystal structures (Figure 5a,b), Glu413- Arg488 (Pair 4) stabilized the distance between the peptide positioning loop and the H-helix in the large lobe. Pair 8 is an ion pair in the hinge region of the bilobal structure. Pair 2 (residues Asn373 and Asp386) were located in the active site cleft.
(a) Four common reciprocal coupling pairs for Pak2 were superimposed on the Pak1 structure (1YHV.PDB) (see Table 1). (b) Five specific reciprocal coupling pairs were superimposed on the same structure. (c) Four specific reciprocal coupling pairs for PKA were identified on the PKA structure (1ATP.PDB). (d) Two specific reciprocal coupling pairs in Src were shown on 2SRC.PDB. The same color was used when two residues were reciprocally coupled. In (c) and (d), the common reciprocal couplings between the three structures are in gray.
On the other hand, the rest of the residues were not within a contactable distance, but were evolutionarily conserved. The correlation between these residues may be caused by structural or functional structures. Glu294- Arg367 (Pair 1) was solvent exposed in the active site cleft and both residues were important for stabilization of phosphate on ATP and the activation loop , , , , . The hydrophobic F-helix, which stabilizes the large lobe, is part of the hydrophobic pocket critical for PKA activity . Evolutionary coupling of Trp427 and Gly439 (Pair 3) in the F-helix of Pak2 (same in PKA) could also provide stabilization of the large lobe. However, the function of four of the reciprocal coupling pairs was not clear in the initial analysis. His360- Lys370 (Pair 6), 11.8 Å apart and located between the D-helix and the catalytic loop, could be stabilizing the catalytic loop region. Trp409-Cys480 (Pair 9) in the peptide positioning loop and H-helix, and Leu481- His497 (Pair 5) between the H- and I-helices, were 8.8 and 10.1 Å, respectively. Together, they could stabilize the region containing the peptide positioning loop, the H-helix and the I-helix. The two residues, Gly380- Pro461 (Pair 7) were at a remote distance (25.3 Å), and associated with two regions, the C-terminus and the G-H loop.
Comparison of Reciprocal Coupling Pairs between Pak2, PKA and Src
The reciprocal coupling pairs of Pak2, PKA and Src were compared (Table 2) to seek the required and universal coupling pairs in protein kinases. The reciprocal pairs were separated in two groups, common coupling pairs and specific coupling pairs. There were four common reciprocal coupling pairs for all three of the protein kinases. The specific coupling pairs appeared only in one or two proteins. Pak2 had five specific reciprocal coupling pairs, PKA had four and Src had two. Three specific coupling pairs from PKA and two from Src were present on Pak2.
The distances between the specific evolutionary coupling pairs in Pak2 were from 2.6 Å to 14.8 Å and with the average of 7.7 Å. There were minor differences in the average distances of the common coupling pairs among Pak2 (7.7 Å), PKA (7.7 Å) and Src (7.5 Å). Because these four common coupling pairs are conserved in Pak2, PKA and Src, it they could be responsible for the general and critical functions of protein kinases, such as the catalytic events in ATP binding, magnesium binding or protein substrate binding. Interestingly, these residues are localized in a two-dimensional surface extending from the active site to the F-helix (Figure 5a). The core structure of the large lobe is composed of six helices and several loops between the helices (Figure 1b). The D, E, G, H and I helices were surrounding the hydrophobic F-helix. The catalytic sites, the magnesium positioning loop and the activation loop resided on top of the core. Most coupling residues were on the interface between the helix-core (D, E, F, H, I helices) and the activity region, including the magnesium positioning loop, the activation loop, and the peptide positioning loop. The G-helix on the other side is in charge of the enzymatic activity (Figure 1b). The residues of the G-helix have low ΔE values, although the helix is conserved in the known kinase structures. The activation loop and G-helix are important regulatory domains , , . The peptide positioning loop is anchored by the coupling of Glu413-Arg488 to the H-helix.
The specific coupling pairs for Pak2, PKA and Src are shown in the crystal structures (1YHV.PDB, 1ATP.PDB and 2SRC.PDB respectively) (Figure 5b, c, d). The specific reciprocal coupling pairs included both distant pairs and proximal pairs. The distances between the specific evolutionary coupling pairs in Pak2 were from 3.2 Å to 25.3 Å and with the average of 11.8 Å. The average distances of the specific coupling pairs are 13.1 Å for PKA and 5.6 Å for Src (Table 2).
The Pak2 catalytic domain contained all of the evolutionary coupling pairs of PKA and Src, except for Tyr229- Phe238(PKA) in PKA (Pair 10), which was unique. Tyr229- Phe238(PKA) (Pair 10), existing only in PKA, appeared in two approximately parallel (9°) ring-structures in the F-helix and the F-G loop. These two residues were 7.35 Å apart (7.48 Å when PKI was bound) , . Interestingly, when the regulatory subunit bound to the G-helix of the catalytic subunit, the two ring structures were twisted (16°) and moved closer to each other (7.17 Å) . Thus, an interaction between Tyr229(PKA) and Phe238(PKA) might be involved in regulation of G-helix of PKA, which would be different from that of Pak2 and Src. Pairs 5, 6 and 7 in Pak also existed in PKA, but not in Src. With Src, Glu339- Lys401(Src) (Pair 8) and Trp428- Cys498(Src) (Pair 9), were the two specific coupling pairs and were present in Pak2, but not PKA. Glu324- Lys383 (Pair 8) was ion paired in the hinge region of Pak2 and Src with the side chains contacting each other. Activation of Pak does not destroy the interaction between Glu324 and Lys383, but Lys383 is twisted in the active conformation. Trp428(Src) in Pair 9 was significantly perturbed by Tyr416(Src), indicating this reciprocal coupling pair was related to tyrosine phosphorylation at the activation loop. When we examine further these specific coupling pairs, at least one residue of the coupling pairs 5, 6, 7, and 9 are in contact with the F-helix, indicating the importance of this F-helix.
The protein kinase Pak2 can be activated by either Cdc42 binding or caspase 3 cleavage. Cdc42 binds to the Cdc42/Rac interaction and binding sequence (CRIB) (residues 74–87) of Pak2, which overlaps with the autoinhibitory domain (AID). Binding of Cdc42 to inactive Pak2 disrupts the inhibition . Under apoptotic stress, Pak2 is constitutively activated by caspase 3 cleavage and autophosphorylation . In either event, the protein kinase activity of Pak2 requires ATP binding, autophosphorylation of Thr402 and disruption of the autoinhibition to obtain an active conformation. In our previous study, we used amide hydrogen/deuterium exchange and mass spectrometry to analyze the inactive and active Pak2. The N-terminus, glycine-rich loop, C-helix, magnesium positioning loop, F-helix and the G-helix are affected by the activation of Pak2. In order to analyze further the activation mechanism of Pak2, we have now used statistical methods to systematically analyze the critical residues in the protein kinases through evolution.
Phosphorylation of Thr402 in the activation loop has been regarded as a crucial element for conformational change in the activation loop and is required for Pak2 activity. To examine the Thr402 perturbation effect, we compared the evolutionary coupling analysis with the amide H/D exchange data obtained during activation of Pak2  (Figure 6). Autophosphorylation of Pak2 caused a significant increase in the deuterium exchange in the region 417–429 containing part of the F-helix and the E-F loop, and the region 436–451 containing parts of the F and G-helices and the F-G loop. In the SCA data, we have shown Val404 and Gly405 coupled locally with Thr402, and remotely with Trp427 and Gly439 in the F-helix in response to autophosphorylation of Thr402. Also, Trp427 and Gly439 are reciprocal coupling pairs. Perturbation of the critical autophosphorylation site Thr402 showed the importance of the conserved Trp427 in the F-helix.
(a) The colored bars illustrate the solvent accessibility changes in the primary sequence. The numbers show the mass (M/Z) of the peptic peptides with a measurable MALDI-TOF signal after H/D exchange. An increase of solvent accessibility following autophosphorylation is shown as the solvent accessibility index. Two fragments (light gray bars), m/z 1476 and 1579, containing Thr402 disappeared after phosphorylation. (b) The differences in solvent accessibility are shown on crystal structure of Pak1 (1YHV.PDB). Residues, Trp427 (red) and Gly439 (orange), were statistically coupled to Thr402 (blue). Trp427 and Gly439 were in the fragments m/z 1532 and 1828 that had increased solvent accessibility after autophosphorylation.
Using statistical analysis, Kannan and Neuwald  proposed the F-helix as the most conserved region of protein kinase family. The tryptophan in the F-helix played a key role in proper positioning of the G-helix, coordinating with a buried water molecule and forming a hydrophobic pocket below the F-helix . Kornev et al  showed this hydrophobic pocket involves hydrophobic residues in the F-helix and four conserved leucines in the H-helix. Kannan et al  have shown further that the eukaryotic protein kinases have the regulatory machinery that is defined by the G-Helix and the activation loop, and is missing in the prokaryotic eukaryotic-like kinases. Phosphorylation of the activation loop couples this regulatory machinery to the core catalytic machinery in the C-terminal lobe. In our reciprocal coupling analysis, the F-helix serves as a central helix in contact with most of the reciprocally coupled residues. Particularly, Leu481, Cys480, Arg488, Pro461, Trp409 are within 5 Å of Trp427 (Fig 5a,b). Among these residues, Trp427 in the F-helix, and Leu481 and Cys480 in the H-helix were the main components of the hydrophobic pocket . The hydrophobic F helix has been identified as a mechanism for assembling an active protein kinase and for anchoring the two hydrophobic spines that link both the N-lobe and the C-lobe , and is an independent validation of the importance of the F-Helix. It also explains in part how some of these distal residues may be functionally correlated. For instance, many of the correlated residues such as Phe185(PKA) are part of the spine, and many of the correlated residues (Fig 5c) are directly adjacent to the regulatory spine.
Threonine phosphorylation on the activation loop of Pak2 perturbed the kinase using a similar network as PKA, but a different network than the tyrosine phosphorylation on Src. Perturbation at the threonine autophosphorylation site on the activation loop of Pak2 identified evolutionary coupled Val404 and Gly405 in the peptide-positioning (P+1) loop, Trp427 and Gly439 in the F-helix and Met323 and Glu324 in the hinge region. On the other hand, perturbation of the Src Tyr416(Src) phosphorylation site was coupled with Pro425(Src) and Trp428(Src) in the peptide positioning loop and Met495(Src), Cys498(Src) and Trp499(Src) in the H-helix. The peptide positioning loop binds to the protein substrates for phosphorylation and its sequence is important for the recognition of specific substrates , . With the different residues in this region (Table 1 and Figure 2), the phosphorylation of the tyrosine in the activation loop of Src could perturb different residues in the peptide positioning loop as compared to that of the threonine in Pak2 and PKA. The differences relate directly to the mechanism for recognition of the P-site residue in the protein substrate. In the case of the two serine/threonine kinases, Gly200(PKA) or Gly405(Pak2) is binding directly to the backbone of the P-site residue that is to be phosphorylated. In the case of the tyrosine kinases the backbone is too far away and the peptide is positioned instead by the aromatic ring and stacking with the aromatic ring in the P+1 loop. Thus, the two Ser/Thr kinases, Pak2 and PKA, have a different substrate recognition mechanism than the tyrosine kinase Src. Moreover, the hinge region was shown to be significantly altered between the open and closed conformation of the kinase domain of Pak2 and PKA , , , . Phosphorylation of Pak2 and PKA may affect the hinge region, leading to the movement between the two lobes of the kinase domain. Pak2 and PKA alter activation via the residues next to the phosphorylation site in the activation loop, the hinge region and F-helix while Src is via Pro425(Src) and Trp428(Src) on the peptide positioning loop, the catalytic loop and H-helix (Table 1, Figure 3). Thus, there are two different ways to achieve the kinase catalytic activity.
Recently, several studies have used statistical analysis and structural comparison to identify critical residues in protein kinases –. The critical roles of the F-Helix, Trp222(PKA) (Trp427 in Pak2) in the F-Helix and the hydrophobic pocket between the F and H-helix for the protein kinases were particularly identified in these studies. Two clusters of residues were proposed to be evolutionary coupled . Perturbation of the phosphorylation site Thr197(PKA) (Thr402 in Pak2) affected several proposed coupling residues. Trp222(PKA) in F-helix of PKA (Trp427 in Pak2) is claimed as a center in one of the two clusters of evolutionary coupled residues, while Cys199(PKA), Gly200(PKA), Gly234(PKA), Trp296(PKA) and Tyr229(PKA) in PKA were in the other cluster of residues. These reports support our finding that Trp222 in PKA and Trp427 in Pak2 is a key interaction site with Thr197(PKA) in PKA and Thr402 in Pak2. We show that phosphorylation of this critical threonine regulates critical structural changes that lead to activation of the protein kinases.
The residues exposed to the solvent are most likely to contact with ligand, substrate or other proteins. All of the reciprocal coupling residues having an exposed side chain on the protein surface of the Pak structure are identified in Figure 7a, b. The coupling residues on the surface of Pak2 lined up in a groove extending from the active site cleft to the large lobe. The residues residing in the groove include Glu294 and Arg367 (Pair 1) and Asn373 and Asp386 (Pair 2) in the active site cleft. Lys370 in Pair 6, Trp409 in Pair 9 and Gly439 in Pair 3 are on the large lobe (Figure 7a). Glu324 and Lys383 (Pair 8) are exposed on the Pak2 surface in the hinge region (Figure 7b). Lys370, Trp409 and Gly439 in the large lobe, have their reciprocal coupling partners, His360, Cys480 and Trp427 below the surface of Pak.
(a) Identification of the Pak2 reciprocal coupling residues that are on the Pak1 surface (1YHV.PDB). (b) The structure turned 120 degrees counterclockwise from (a) shows the reciprocal coupled ion pair in the hinge region. (c) The catalytic domain of Pak (green) binding to the AID (yellow) modified from 1F3M.PDB. (d) PKA (green) bound to PKI (yellow) modified from 1ATP.PDB.
Glu294- Arg367 (Pair 1) and Asn373- Asp386 (Pair 2) are the two reciprocal coupling pairs in the active site. These four conserved residues are responsible for the stabilization of the magnesium ions and ATP, the C-helix and the activation loop in PKA , . The side chains of Glu294 and Arg367 in the inactive conformation of Pak1 (1F3M.PDB) interact with the kinase inhibitory region (residues 136–146 for Pak2, 138–147 for Pak1) of the autoinhibitory domain . While in the active conformation (1YHV.PDB), the relative positions of Glu294 and Arg367 are significantly changed due to the twist in the C-helix (Fig 5a) .
In the inactive Pak1 conformation (1F3M.PDB), Asn373 and Asp386 (Pair 2) are solvent exposed. These two residues are coordinated with manganese ions in the PKA structure (1ATP.PDB). In our previous H/D exchange experiments, we found that AID binding to Pak2 does not block coordination of the magnesium ions to Pair 2 . A comparison of Figure 7c to Figure 7a indicates the AID blocks Pair 1, while Pairs 2, 3, 6 and 9 are still solvent exposed. When we overlaid the Pak1 and PKA structures, the groove overlapped with the pseudosubstrate (PKI) binding region on PKA (Figure 7d) . When the contact region between the PKA catalytic domain and the regulatory domain was compared with the Pak2 coupling residues, part of the contact region resided in the PKI binding groove, thus blocking substrate binding. We predict that variation of the amino acid composition of the protein kinases in this groove will alter the substrate specificity.
Note that several residues clustered within the hydrophobic core contacting to the F-helix beneath the surface were also shown to be involved in the substrate binding affinity of PKA , , including Trp222(PKA) (Pair 3), Glu208(PKA) and Arg280(PKA) (Pair 4) and His294(PKA) (Pair 5). These identified residues stretched from the active site of the enzyme to the C-terminal substrate-binding domain. Coupling of the active site to the H and I helices was independently correlated and then experimentally validated in a genetic screen by two-hybrid assays , . Arg280(PKA) between the H and I-helix formed a salt bridge interaction with Glu208(PKA) in the APE motif packing up against Trp222(PKA). Mutation of Arg280(PKA) to lysine decreased the kinase activity in PKA .
On the reverse side of Pak2, Glu324- Lys383 (Pair 8) are ion paired in the hinge region of Pak2 and of Src, with their side chains contacting each other (Figure 7b). This solvent exposed pair connects the linker region of the bi-lobal structure with the magnesium positioning loop , , . The two lobes of the active conformation (1YHV.PDB) are closer than the inactive conformation (1F3M.PDB) of Pak1. The active conformation has a 15 degree rotation of the bilobal structure, compared to the inactive conformation . When we examined Pair 8 in both structures, the hydrogen bonds between Glu324 and Lys383 were twisted in the inactive conformation. In this inactive conformation, the distances between two oxygen atoms of Glu324 and the nitrogen atom of Lys383 are 3.96 and 2.87 Å, while the ion pair shows shorter distances, 3.25 and 2.89 Å in the active conformation.
Multiple sequence alignment of human protein kinases based on Pak2.
(0.13 MB TXT)
Multiple sequence alignment of human protein kinases based on PKA.
(0.13 MB TXT)
Multiple sequence alignment of human protein kinases based on Src.
(0.13 MB TXT)
We thank Dr. David Johnson (UCR) for his generous advice in this project, and Dr. Michael Dunn (UCR) for his comments on the manuscript.
Conceived and designed the experiments: YHH JAT. Performed the experiments: YHH. Analyzed the data: YHH. Contributed reagents/materials/analysis tools: YHH JAT. Wrote the paper: YHH JAT.
- 1. Lockless SW, Ranganathan R (1999) Evolutionarily conserved pathways of energetic connectivity in protein families. Science 286: 295–299.
- 2. Suel GM, Lockless SW, Wall MA, Ranganathan R (2003) Evolutionarily conserved networks of residues mediate allosteric communication in proteins. Nat Struct Biol 10: 59–69.
- 3. Hatley ME, Lockless SW, Gibson SK, Gilman AG, Ranganathan R (2003) Allosteric determinants in guanine nucleotide-binding proteins. Proc Natl Acad Sci U S A 100: 14445–14450.
- 4. Shulman AI, Larson C, Mangelsdorf DJ, Ranganathan R (2004) Structural determinants of allosteric ligand activation in RXR heterodimers. Cell 116: 417–429.
- 5. Estabrook RA, Luo J, Purdy MM, Sharma V, Weakliem P, et al. (2005) Statistical coevolution analysis and molecular dynamics: identification of amino acid pairs essential for catalysis. Proc Natl Acad Sci U S A 102: 994–999.
- 6. Jaffer ZM, Chernoff J (2002) p21-activated kinases: three more join the Pak. Int J Biochem Cell Biol 34: 713–717.
- 7. Roig J, Traugh JA (2001) Cytostatic p21 G protein-activated protein kinase gamma-PAK. Vitam Horm 62: 167–198.
- 8. Walter BN, Huang Z, Jakobi R, Tuazon PT, Alnemri ES, et al. (1998) Cleavage and activation of p21-activated protein kinase gamma-PAK by CPP32 (caspase 3). Effects of autophosphorylation on activity. J Biol Chem 273: 28733–28739.
- 9. Gatti A, Huang Z, Tuazon PT, Traugh JA (1999) Multisite autophosphorylation of p21-activated protein kinase gamma-PAK as a function of activation. J Biol Chem 274: 8022–8028.
- 10. Jakobi R, Huang Z, Walter BN, Tuazon PT, Traugh JA (2000) Substrates enhance autophosphorylation and activation of p21-activated protein kinase gamma-PAK in the absence of activation loop phosphorylation. Eur J Biochem 267: 4414–4421.
- 11. Jung JH, Traugh JA (2005) Regulation of the interaction of Pak2 with Cdc42 via autophosphorylation of serine 141. J Biol Chem 280: 40025–40031.
- 12. Tuazon PT, Chinwah M, Traugh JA (1998) Autophosphorylation and protein kinase activity of p21-activated protein kinase gamma-PAK are differentially affected by magnesium and manganese. Biochemistry 37: 17024–17029.
- 13. Parrini MC, Lei M, Harrison SC, Mayer BJ (2002) Pak1 kinase homodimers are autoinhibited in trans and dissociated upon activation by Cdc42 and Rac1. Mol Cell 9: 73–83.
- 14. Lei M, Lu W, Meng W, Parrini MC, Eck MJ, et al. (2000) Structure of PAK1 in an autoinhibited conformation reveals a multistage activation switch. Cell 102: 387–397.
- 15. Johnson LN, Noble ME, Owen DJ (1996) Active and inactive protein kinases: structural basis for regulation. Cell 85: 149–158.
- 16. Hubbard SR, Wei L, Ellis L, Hendrickson WA (1994) Crystal structure of the tyrosine kinase domain of the human insulin receptor. Nature 372: 746–754.
- 17. Tu H, Wigler M (1999) Genetic evidence for Pak1 autoinhibition and its release by Cdc42. Mol Cell Biol 19: 602–611.
- 18. Zhao ZS, Manser E, Chen XQ, Chong C, Leung T, et al. (1998) A conserved negative regulatory region in alphaPAK: inhibition of PAK kinases reveals their morphological roles downstream of Cdc42 and Rac1. Mol Cell Biol 18: 2153–2163.
- 19. Frost JA, Khokhlatchev A, Stippec S, White MA, Cobb MH (1998) Differential effects of PAK1-activating mutations reveal activity-dependent and -independent effects on cytoskeletal regulation. J Biol Chem 273: 28191–28198.
- 20. Wu H, Zheng Y, Wang ZX (2003) Evaluation of the catalytic mechanism of the p21-activated protein kinase PAK2. Biochemistry 42: 1129–1139.
- 21. Lei M, Robinson MA, Harrison SC (2005) The active conformation of the PAK1 kinase domain. Structure 13: 769–778.
- 22. Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S (2002) The protein kinase complement of the human genome. Science 298: 1912–1934.
- 23. Hsu YH, Johnson DA, Traugh JA (2008) Analysis of conformational changes during activation of protein kinase Pak2 by amide hydrogen/deuterium exchange. J Biol Chem 283: 36397–36405.
- 24. Zheng J, Trafny EA, Knighton DR, Xuong NH, Taylor SS, et al. (1993) 2.2 A refined crystal structure of the catalytic subunit of cAMP-dependent protein kinase complexed with MnATP and a peptide inhibitor. Acta Crystallogr D Biol Crystallogr 49: 362–365.
- 25. Karlsson R, Zheng J, Xuong N, Taylor SS, Sowadski JM (1993) Structure of the mammalian catalytic subunit of cAMP-dependent protein kinase and an inhibitor peptide displays an open conformation. Acta Crystallogr D Biol Crystallogr 49: 381–388.
- 26. Xu W, Doshi A, Lei M, Eck MJ, Harrison SC (1999) Crystal structures of c-Src reveal features of its autoinhibitory mechanism. Mol Cell 3: 629–638.
- 27. Xu W, Harrison SC, Eck MJ (1997) Three-dimensional structure of the tyrosine kinase c-Src. Nature 385: 595–602.
- 28. Williams JC, Weijland A, Gonfloni S, Thompson A, Courtneidge SA, et al. (1997) The 2.35 A crystal structure of the inactivated form of chicken Src: a dynamic molecule with multiple regulatory interactions. J Mol Biol 274: 757–775.
- 29. Sicheri F, Moarefi I, Kuriyan J (1997) Crystal structure of the Src family tyrosine kinase Hck. Nature 385: 602–609.
- 30. Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: 1792–1797.
- 31. Saeed AI, Sharov V, White J, Li J, Liang W, et al. (2003) TM4: a free, open-source system for microarray data management and analysis. Biotechniques 34: 374–378.
- 32. Guex N, Peitsch MC (1997) SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electrophoresis 18: 2714–2723.
- 33. Johnson DA, Akamine P, Radzio-Andzelm E, Madhusudan M, Taylor SS (2001) Dynamics of cAMP-dependent protein kinase. Chem Rev 101: 2243–2270.
- 34. Nolen B, Taylor S, Ghosh G (2004) Regulation of protein kinases; controlling activity through activation segment conformation. Mol Cell 15: 661–675.
- 35. Akamine P, Madhusudan , Wu J, Xuong NH, Ten Eyck LF, et al. (2003) Dynamic features of cAMP-dependent protein kinase revealed by apoenzyme crystal structure. J Mol Biol 327: 159–171.
- 36. Zheng J, Knighton DR, ten Eyck LF, Karlsson R, Xuong N, et al. (1993) Crystal structure of the catalytic subunit of cAMP-dependent protein kinase complexed with MgATP and peptide inhibitor. Biochemistry 32: 2154–2161.
- 37. Kim C, Xuong NH, Taylor SS (2005) Crystal structure of a complex between the catalytic and regulatory (RIalpha) subunits of PKA. Science 307: 690–696.
- 38. Kannan N, Neuwald AF (2005) Did protein kinase regulatory mechanisms evolve through elaboration of a simple structural component? J Mol Biol 351: 956–972.
- 39. Kornev AP, Taylor SS, Ten Eyck LF (2008) A helix scaffold for the assembly of active protein kinases. Proc Natl Acad Sci U S A 105: 14377–14382.
- 40. Kannan N, Taylor SS, Zhai Y, Venter JC, Manning G (2007) Structural and functional diversity of the microbial kinome. PLoS Biol 5: e17.
- 41. Xu F, Du P, Shen H, Hu H, Wu Q, et al. (2009) Correlated mutation analysis on the catalytic domains of serine/threonine protein kinases. PLoS One 4: e5913.
- 42. Deminoff SJ, Howard SC, Hester A, Warner S, Herman PK (2006) Using substrate-binding variants of the cAMP-dependent protein kinase to identify novel targets and a kinase domain important for substrate interactions in Saccharomyces cerevisiae. Genetics 173: 1909–1917.
- 43. Deminoff SJ, Ramachandran V, Herman PK (2009) Distal recognition sites in substrates are required for efficient phosphorylation by the cAMP-dependent protein kinase. Genetics 182: 529–539.
- 44. Torkamani A, Kannan N, Taylor SS, Schork NJ (2008) Congenital disease SNPs target lineage specific structural elements in protein kinases. Proc Natl Acad Sci U S A 105: 9011–9016.