Figure 1.
Schematic representation of cavities in a homodimeric molecule.
Cavity-lining (CL) atoms are in filled circles, black for those in tertiary structures and blue in interfaces. Non-cavity-non-surface (NCNS) atoms in the tertiary structure are in open, black circles and those in the interface are in open, blue circles. Surface atoms are in red. Cavities within the tertiary structure (CT) and interface (CI) are distinct from the surface pockets (P).
Table 1.
Average values of the total number of cavities and the total cavity volume in protein tertiary structures and interfaces.
Figure 2.
Dependence of the total volume of cavities on the volume (or the total number of atoms) of the protein (or the interface).
Plot of total volume of cavities against (A) the volume of the protein tertiary structure, and (B,C) the total number of atoms in interfaces. The correlation coefficient, r, is given in parentheses. In (A) the equation for the least-squares line is y = 0.016x–311 (R2 = 0.74); if approximated by a power law the corresponding equation is (A) y = 0.000002x1.779 (R2 = 0.70). If one uses the total number of atoms in the tertiary structure (in place of the volume) the distribution looks very similar to (A) and the two equations are y = 0.371x–279 (R2 = 0.75) and y = 0.0009x1.687 (R2 = 0.71).
Table 2.
Equations describing the dependence of the volume on the number of atoms/residues lining individual cavities.
Figure 3.
Distribution of the volume of cavities.
Histogram of the volume of cavities (A) all taken together, and separated into (B) solvated and (C) empty cavities.
Figure 4.
Visualization of different features in some cavities.
(A) Surface representation of the structure with the PDB code, 1bkp, with five interface cavities. The two subunits are rotated about a vertical axis and taken apart to show how cavities are formed between them. (B) The cartoon representation of the secondary structures for the same protein as in (A), along with the maze representation of the cavities (with the enclosed water molecules in red). The individual cavities are labeled and their volume (Å3), number of water molecules and Rvs are as follows. 1: 12.3, 1, 0.99; 2: 13.3, 1, 0.99; 3: 31.7, 1, 0.75; 4: 110.5, 3, 0.84; and 5: 111.6, 3, 0.83. (C) Ter_str cavities within one subunit of the dimeric molecule, 1dpg. One of the cavities (pointed by an arrow, volume: 20.7 Å3 and Rvs: 0.92) with two water molecules is shown in (D) along with the CL atoms, and all the hydrogen bonds are given in (E). Diagrams (C–E) do not have the molecules in the same orientation. In (D) and (E) the water molecules are in grey.
Figure 5.
Histogram of Rvs for (A) all the cavities and (B) cavities with volume>100 Å3.
Figure 6.
Propensities of residues to be associated with cavities.
Propensities of different types of residues in (A) cavity-lining and (B) non-cavity-non-surface regions, and in the cavity-lining region of (C) solvated and (D) empty cavities in protein structures and interfaces.
Figure 7.
Propensities of atoms to be associated with cavities.
Propensities of different types of atoms in (A) cavity-lining and (B) non-cavity-non-surface regions, and in the cavity-lining region of (C) solvated and (D) empty cavities in protein structures and interfaces. The atom types have been defined in Methods.
Figure 8.
Plot of the number of water molecules present in a cavity against its volume (Å3).
For a given value along the y axis the average of all the values along the x axis is shown (the horizontal bars representing the standard deviations). The numbers of points used for averaging are 948, 321, 83, 26, 10, 5 and 3 (six cavities have >7 water molecules). The equation for the least-squares line is y = 0.023x+0.70 (R2 = 0.99).
Figure 9.
Change in the Voronoi volumes of atoms in cavities.
Change in the Voronoi volumes (relative to those in the NCNS region of tertiary structure) of (A) CL atoms, (B) NCNS in interface, and (C) CL atoms in empty and solvated Ter_str cavities. For the solvated cavities in (C) the calculation has been done twice, once including the coordinates of the water molecules and the other excluding.
Figure 10.
Propensity of cavity lining atoms to occur in different secondary structural elements.
Figure 11.
An example of a cavity involving β-sheets.
The second largest cavity (with a volume of 222.4 Å3 and Rvs of 0.75) in the structure of 4htc, located in the interface, involving β-sheets. Of the 32 CL atoms, 4 are from helix, 14 from β-strands and 14 belong to ‘Others’.
Figure 12.
Some examples of cavities with water molecules, cation and ligand.
The largest interface cavity (containing 56 water molecules, with a total volume of 2341 Å3 and Rvs of 0.5), in the homodimeric structure of 1oac is displayed in (A). Cavities with a large water content are also observed in dimeric interfaces: 1mor (46, 2202 Å3 and 0.5), 1chm (58, 1798 Å3, 0.3). PDB codes for cavities containing 10–20 water molecules are Ter_str (1cmb); Inter_H (1ade, 1b8j, 1bis, 1chm, 1mkb (2 cases), 1mor, 1oac, 1sox, 1utg and 5rub); Inter_C (1kz7). (B) shows a Ca2+ ion and a water (red) molecule in a cavity (volume: 17.2 Å3 and Rvs: 0.96) of 2scp. (C) shows α-d-glucose-6-phosphate and nine water molecules in a cavity (565.1 Å3 and 0.69) of 1mor.
Table 3.
Solvated cavities in interface and their hydrogen-bonding pattern.
Table 4.
Number of water molecules mediating interaction between residues with opposite or like charges across the interface.