分析测试百科网

搜索

喜欢作者

微信支付微信支付
×

Size and Shape of Protein Molecules-1

2019.4.23
头像

zhaochenxu

致力于为分析测试行业奉献终身

Size and Shape of Protein Molecules at the Nanometer Level Determined by Sedimentation, Gel Filtration, and Electron Microscopy

An important part of characterizing any protein molecule is to determine its size and shape. Sedimentation and gel filtration are hydrodynamic techniques that can be used for this medium resolution structural analysis. This review collects a number of simple calculations that are useful for thinking about protein structure at the nanometer level. Readers are reminded that the Perrin equation is generally not a valid approach to determine the shape of proteins. Instead, a simple guideline is presented, based on the measured sedimentation coefficient and a calculated maximum S, to estimate if a protein is globular or elongated. It is recalled that a gel filtration column fractionates proteins on the basis of their Stokes radius, not molecular weight. The molecular weight can be determined by combining gradient sedimentation and gel filtration, techniques available in most biochemistry laboratories, as originally proposed by Siegel and Monte. Finally, rotary shadowing and negative stain electron microscopy are powerful techniques for resolving the size and shape of single protein molecules and complexes at the nanometer level. A combination of hydrodynamics and electron microscopy is especially powerful.

Key Words: Protein shape - hydrodynamics - gel filtration - sedimentation - electron microscopy


Introduction

Most proteins fold into globular domains. Protein folding is driven largely by the hydrophobic effect, which seeks to minimize contact of the polypeptide with solvent. Most proteins fold into globular domains, which have a minimal surface area. Peptides from 10 to 30 kDa typically fold into a single domain. Peptides larger than 50 kDa typically form two or more domains that are independently folded. However, some proteins are highly elongated, either as a string of small globular domains or stabilized by specialized structures such as coiled coils or the collagen triple helix. The ultimate structural understanding of a protein comes from an atomic-level structure obtained by X-ray crystallography or nuclear magnetic resonance. However, structural information at the nanometer level is frequently invaluable. Hydrodynamics, in particular sedimentation and gel filtration, can provide this structural information, and it becomes even more powerful when combined with electron microscopy (EM).

One guiding principle enormously simplifies the analysis of protein structure. The interior of protein subunits and domains consists of closely packed atoms (1). There are no substantial holes and almost no water molecules in the protein interior. As a consequence of this, proteins are rigid structures, with a Young’s modulus similar to that of Plexiglas (2). Engineers sometimes categorize biology as the science of “soft wet materials”. This is true of some hydrated gels, but proteins are better thought of as hard dry plastic. This is obviously important for all of biology, to have a rigid material with which to construct the machinery of life. A second consequence of the close packed interior of proteins is that all proteins have approximately the same density, about 1.37 g/cm3. For most of the following, we will use the partial specific volume, v 2, which is the reciprocal of the density. v 2 varies from 0.70 to 0.76 for different proteins, and there is a literature on calculating or determining the value experimentally. For the present discussion, we will ignore these variations and assume the average v 2 = 0.73 cm3/g.


How Big Is a Protein Molecule?

Assuming this partial specific volume (v 2 = 0.73 cm3/g), we can calculate the volume occupied by a protein of mass M in Dalton as follows.

$$V{\left( {{\text{nm}}^{3} } \right)} = \frac{{{\left( {{0.73\;\,{\text{cm}}^{3} } \mathord{\left/ {\vphantom {{0.73\;\,{\text{cm}}^{3} } {\text{g}}}} \right. \kern-\nulldelimiterspace} {\text{g}}} \right)}\; \times \;{\left( {{{\text{10}}^{{{\text{21}}}} {\text{nm}}^{{\text{3}}} } \mathord{\left/ {\vphantom {{{\text{10}}^{{{\text{21}}}} {\text{nm}}^{{\text{3}}} } {{\text{cm}}^{{\text{3}}} }}} \right. \kern-\nulldelimiterspace} {{\text{cm}}^{{\text{3}}} }} \right)}}}{{6.023\; \times \;10^{{23}} {{\text{Da}}} \mathord{\left/ {\vphantom {{{\text{Da}}} {\text{g}}}} \right. \kern-\nulldelimiterspace} {\text{g}}}}\; \times \;M{\left( {{\text{Da}}} \right)} = 1.212\; \times \;10^{{ - 3}} {\left( {{{\text{nm}}^{3} } \mathord{\left/ {\vphantom {{{\text{nm}}^{3} } {{\text{Da}}}}} \right. \kern-\nulldelimiterspace} {{\text{Da}}}} \right)}\; \times \;M{\left( {{\text{Da}}} \right)}.$$

(2.1)

The inverse relationship is also frequently useful: M (Da) = 825 V (nm3).

What we really want is a physically intuitive parameter for the size of the protein. If we assume the protein has the simplest shape, a sphere, we can calculate its radius. We will refer to this as R min, because it is the minimal radius of a sphere that could contain the given mass of protein

$$R_{{\min }} = {\left( {3V/4\pi } \right)}^{{1/3}} = 0.066M^{{1/3}} {\text{ (for }}M{\text{ in Dalton, }}R_{{{\text{min}}}} {\text{ in nanometer)}}{\text{.}}$$

(2.2)

Some useful examples for proteins from 5,000 to 500,000 Da are given in Table  1 .

Table 1  R min for proteins of different mass

Protein M (kDa)

5

10

20

50

100

200

500

R min (nm)

1.1

1.42

1.78

2.4

3.05

3.84

5.21

It is important to emphasize that this is the minimum radius of a smooth sphere that could contain the given mass of protein. Since proteins have an irregular surface, even ones that are approximately spherical will have an average radius larger than the minimum.


How Far Apart Are Molecules in Solution?

It is frequently useful to know the average volume of solution occupied by each molecule, or more directly, the average distance separating molecules in solution. This is a simple calculation based only on the molar concentration.

In a 1-M solution, there are 6 × 1023 molecules/l, = 0.6 molecules/nm3, or inverting, the volume per molecule is V = 1.66 nm3/molecule at 1 M. For a concentration C, the volume per molecule is V = 1.66/C.

We will take the cube root of the volume per molecule as an indication of the average separation.

$$d = V^{{1 \mathord{\left/ {\vphantom {1 3}} \right. \kern-\nulldelimiterspace} 3}} = {1.18} \mathord{\left/ {\vphantom {{1.18} C}} \right. \kern-\nulldelimiterspace} C^{{1 \mathord{\left/ {\vphantom {1 3}} \right. \kern-\nulldelimiterspace} 3}} ,$$

(3.1)

where C is in molar and d is in nanometer. Table  2 gives some typical values.

Table 2 Distance between molecules as function of concentration

Concentration

1 M

1 mM

1 μM

1 nM

Distance between molecules (nm)

1.18

11.8

118

1,180

Two interesting examples are hemoglobin and fibrinogen. Hemoglobin is 330 mg/ml in erythrocytes, making its concentration 0.005 M. The average separation of molecules (center to center) is 6.9 nm. The diameter of a single hemoglobin molecule is about 5 nm. These molecules are very concentrated, near the highest physiological concentration of any protein (the crystallins in lens cells can be at >50% protein by weight).

Fibrinogen is a large rod-shaped molecule that forms a fibrin blood clot when activated. It circulates in plasma at a concentration of around 2.5 g/l, about 9 μM. The fibrinogen molecules are therefore about 60 nm apart, comparable to the 46-nm length of the rod-shaped molecule.


The Sedimentation Coefficient and Frictional Ratio. Is the Protein Globular or Elongated?

Biochemists have long attempted to deduce the shape of a protein molecule from hydrodynamic parameters. There are two major hydrodynamic methods that are used to study protein molecules—sedimentation and diffusion (or gel filtration, which is the equivalent of measuring the diffusion coefficient).

The sedimentation coefficient, S, can be determined in an analytical ultracentrifuge. This was a standard part of the characterization of proteins in the 1940s and 1950s, and values of S 20,w (sedimentation coefficient standardized to 20°C in water) are collected in references such as the Chemical Rubber Co. (CRC) Handbook of Biochemistry (3). Today, S is more frequently determined by zone sedimentation in a sucrose or glycerol gradient, by comparison to standard proteins of known S. Five to twenty percent sucrose gradients have been most frequently used, but we prefer 15–40% glycerol gradients in 0.2 M ammonium bicarbonate, because this is the buffer used for rotary shadowing EM (Section  6 ). The protein of interest is sedimented in one bucket of the swinging bucket rotor, and protein standards of known S (Table  5 ) are sedimented in a separate (or sometimes the same) gradient. Following sedimentation, the gradient is eluted into fractions and each fraction is analyzed by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) to locate the standards and the test protein. Figure 1 shows an example determining the sedimentation coefficient of the structural maintenance of chromosome (SMC) protein from Bacillus subtilis.

MediaObjects/12575_2009_9008_Fig1_HTML.gif

Fig. 1 Glycerol gradient sedimentation analysis of SMC protein from B. subtilis (BsSMC; upper panel) and sedimentation standards catalase and bovine serum albumin (lower panel). A 200-μl sample was layered on a 5.0-ml gradient of 15–40% glycerol in 0.2 M ammonium bicarbonate and centrifuged in a Beckman SW55.1 swinging bucket rotor, 16 h, 38,000 rpm, 20°C. Twelve fractions of 400 μl each were collected from a hole in the bottom of the tube and each fraction was run on SDS-PAGE. Lane SM shows the starting material, and fraction 1 is the bottom of the gradient. The bottom panel shows that the 11.3-S catalase eluted precisely in fraction 4, while the 4.6-S BSA eluted mostly in fraction 8, with some in fraction 9. We estimated the BSA to be centered on fraction 8.2. Experiments with additional standard proteins have demonstrated that the 15–40% glycerol gradients are linear over the range 3–20 S, so a linear interpolation is used to determine S of the unknown protein. BsSMC is in fractions 7 and 8, estimated more precisely at fraction 7.3. Extrapolating from the standards, we determine a sedimentation coefficient of 6.0 S for BsSMC. Other experiments gave an average value of 6.3 S for BsSMC (19).


生物在线
仪器推荐
文章推荐