Beruflich Dokumente
Kultur Dokumente
Reference:
“Protein Structure Prediction A Practical
Approach” edited by M J E Sternberg
IRL Press
Chapter 7 Protein Folding and Unfolding
Introduction
Denaturation studies
Detelerious chemical reactions in proteins
Deamidation of asparagine residues
Isomerization of prolines
Destructive oxidation events
Proteolytic processes
Probing the stabilizing interactions in proteins
Replacement of conserved residues: E. coli
Thioredoxin
Carbohydrate side chains and protein stability
Is there a trade-off between stability and
Activity ?
8.1.3 Enzymes in Organic Media
Introduction
Enzyme behavior in anhydrous organic solvents
Some case studies
Water-
10.1 Introduction
10.2 Physical and chemical properties of soluble
proteins
10.2.1 Aqueous solubility
10.2.2 Hydrodynamic properties in aqueous
solution
10.2.3 Spectral properties
10.2.4 Ionization
10.2.5 Chemical properties
10.3 Proteins in membranes
10.3.1 Association with membrane
10.3.2 Structures of integral membrane proteins
10.3.3 Identifying amino acid sequences likely to
transverse membranes
10.3.4 Dynamic behavior in membranes
10.4 Flexibility of protein structure
GenBank (美国基因、蛋白数据库)
GeneCard(http://bioinformatics.weizmann.ac.il/cards/index.html)
EMBL (欧洲分子生物学实验室数据库)
HUGD(人类基因突变数据库)
DDBJ(日本国家遗传研究所基因数据库)
Swiss-Port(瑞士蛋白数据库)
蛋白三维结构数据库(美国 Brookheaven 实验室)
PIR(美国国家生物医学技术研究基金会蛋白数据库)
Human SNP Database(Whitehead institute/MIT center for genome
research)
OPD (Oligonucleotide probe database)
PROSITE (EMBL,蛋白序列中的特征序列及位点)
HSSP(EMBL, 三维结构已知的蛋白的同源蛋白)
DSSP(EMBL,蛋白二级结构及溶剂信息)
FSSP(EMBL,蛋白折叠方式相似性的结构家族)
SBASE(ABC,ICGEB,蛋白结构域、功能域资料)
TFD(NCBI,各种转录因子及其特性)
TRANSTERM(新西兰 Otago 大学, 翻译终止信号数据库)
Rebase(New england biolabs 公司,限制酶及甲基化酶数据库)
Genome Data Base (GDB)
The PredictProtein server(EMBL in Heidelberg)
BLAST (http://www.ncbi.nlm.nih.gov/BLAST/)
dbEST(http://www.ncbi.nlm.nih.gov/dbEST/)
Entrez (http://www.ncbi.nlm.nih.gov/Entrez/)
MMBD(http://www.ncbi.nlm.nih.gov/Structure/)
基因芯片数据库---?
MPDB(合成寡核苷酸探针数据库)---?
Chapter 2 Physic-chemical Properties of Proteins
Chapter 3 Methods for Characterization and
Purification of Proteins
Summary
1. Protein solubility is not a fixed quantity for a given
protein. Rather, it is a function of many variables. Two
of these are pH and salt concentration. Proteins show a
minimum solubility at their isoelectric point.
Frequently, proteins require the addition of a small
amount of salt to become soluble, but excessive
amounts of salt lead to protein precipitation.
2. There are several methods for determination of
molecular weight. These include sedimentation
analysis and gel-exclusion chromatography.
Sedimentation analysis may be used in two different
ways: (1) by independently determining the
sedimentation and diffusion rates and combining this
information to calculate a molecular weight and (2) by
equilibrium ultracentrifugation. Gel-exclusion
chromatography uses cross-linked polydextrans and
relates molecular weight to the rate of migration
through a column.
3. Electrophoretic methods are used in various ways to
characterize protein mixtures and purified proteins.
The high resolution attainable by electrophoresis
makes it ideal for determining the number of proteins
in a mixture as well as their approximate size
4. Methods of protein purification include differential
centrifugation, differential precipitation with
(NH4)2SO4, gel-exclusion chromatography, different
electrophoretic mobility, and differential affinities for
column matrice containing different functional groups.
Column procedures are particularly versatile because
of the large number of functional groups that can be
used to bind proteins in different ways and because of
the variety of conditions for differential column
elution.
Key-points:
From our presentation you will learn the
following:
1. Certain acidic and basic properties are common to all amino
acids found in proteins except for the amino acid proline.
2. Side chains give amino acids their individuality. These side
chains serve a variety of structural and functional roles.
3. The alpha-carboxyl group of one amino acid can react with
the alpha-amino group of another amino acid to form a
dipeptide.
4. Many amino acids, reacting in a similar way, can become
linked to form a linear polypeptide chain.
5. The Amino acid sequence in a polypeptide can be determined
by a process of partial breakdown into manageable
fragments, followed by stepwise analysis proceeding from
one end of the chain to the other.
6. Polypeptide chains with a prespecified sequence can be
synthesized by well-established chemical methods.
Example:
Figure 5.1 Schematic representation of an experiment to
demonstrate that the information for folding into a biologically
active conformation in contained in the protein’s amino acid
sequence.
Figure 5.2 Basic dimensions of (a) the peptide and (b) the
dipeptide
Figure 5.4 Ramachandran plot
Figure 5.5 Ramachandran plot(2)
1st rule: was that the peptidyl C-N linkage and the four
atoms to which the C and the N atoms are directly linked always
forms a planar structure;(which indicates the only flexibility in
the polypeptide backbone arises from rotation about the carbon
that joins adjacent peptide planar groups)
2nd rule: was that peptidyl carbonyl and amino groups
always form the maximum number of hydrogen bonds.
Taken together, these two rules drastically reduce the number of
possible conformations available to the polypeptide chain.
Outlines:
Protein structure prediction is becoming urgent because of the
increased discrepancy between the number of known protein sequences
and the number of experimentally-determined structure. In this chapter,
we will discuss (1) the principles of protein structure as they related to
the prediction problem; (2) the approaches of protein structure
prediction; and (3) some examples.
6.1 Introduction
3. Atomic packing
The net effect of attractive and repulsive van der Waals interactions
between atoms is to favor close atomic packing. Thus to a first
approximation the protein core resembles the solid state. Surface
residues and most atoms of the chain in the unfolded state are less
ordered and resemble in part the liquid state. Thus for residues that are
in the protein core, folding leads to a liquid→solid transition. This
transition is primarily enthalpic.
4. Conformational entropy
the formation of the folded structure restricts the dihedral
conformational space samples by the main chain and the buried side
chain. This freezing of rotamers is entropically unfavorable.
5.Electrostatic effects---ion pairs and hydrogen bonds
The net effects of hydrophobicity, close packing, and conformational
entropy would probably lead to a compact protein that lacks a specific
architecture. The specificity for the tertiary structure could be
considered as residing in the location of the hydrogen bonding and the
ion pairing groups. Electrostatic effects in the protein/solvent system are
complex.
An individual fully charged (or just apolar atom) extending from the
protein surface into water will be surrounded by a solvation shell of
water molecules. Transfering this charged atom the protein core is
energetically very unfavorable due to removing the solvation shell.
Thus, isolated charges are very rarely observed buried within proteins.
6. Disulfide bridges
The common view is that disulfide cross-links stabilize the folded state
by entropically restricting the degrees of freedom of the unfolded state
compared to the same chain without cross-links. For a single link, the
stability increases with the length of the link but for multiple bridges
there are complex effects. Typically a link will yield a few kcal mol-1 of
stability. There are small proteins whose stability is considered to be
enhanced by the entropic effect of multiple disulfide bridges.
slow fast
US UF N (7.3)
1. Introduction
Protein stability is a very important area of study within
biotechnology, Because:
(1) For enzymes, although enzymes are protein molecules that act as
extremely efficient catalysts, the usefulness of enzymes and
proteins as analytical tools and as industrial catalysts is often
limited by their requirements for “mild” storage and reaction
conditions. This is because many emzymes lose their catalytic
abilities over time, that is, they have poor operational or long-
term stability;
(2) Stability is an issue also in the development and use of protein-
based analytical or sensor devices, and it can be of literally vital
importance in protein pharmaceuticals or therapeutics, where
deterioration of protein preparations over extended storage
periods can be a serious drawback;
(3) Interesting in protein stability will likely grow due to the
increasing use of recombinant therapeutic proteins, the advent of
protein engineering and recent strides in understanding protein
folding.
4. Folding stability
5. Kinetic stability
Kinetic stability is distinct from (and need not correspond with)
thermodynamic stability. It involves measuring the persistence of
catalytic (or other biological) activity with time under adverse
conditions of temperature, pH, solvents, salt concentration and so on (or,
to put it another way, the progressive loss of function). In can be
represented by the scheme
Nkin→I
Where N is the native, functional protein, I is an irreversible inactivated
form and kin is the rate constant for the inactivation process.
To conclude, there are many indices of protein stability. The most
prominent of these are summarized in Table 1.
1. Introduction
The crucial importance of a protein’s tertiary structure, i.e. its
molecular shape, has been remarked upon in previous chapters. Tertiary
structure arises from interactions between the side chains (or R-groups)
of the covalently-linked amino acids making up the polypeptide. It is
the tertiary structure that orientates the critical residues and side chains
into the correct geometrical relationship to permit function. This is not
to state that a protein molecles is completely rigid, however. There is
aboundant evidence that enzymes and proteins undergo slight but
significant changes in shape on binding substrates or modulators. This
part examines denaturation or unfolding of a protein and the nature of
subsequent molecular changes leading to irreversible inaction. It also
surveys exploirations of stabilizing interactionsin proteins and considers
the contribution of the carbohydrate portions of glycoproteins to
stability.
2. Denaturation studies
Armed with the parameters described in previous section for
measurement of protein stability, researchers have been able to study the
loss of protein function or strucutre I a rational fashion. Understanding
the causes of activity loss can help on the formulation of stabilizing
strategies, since one will know what changes must be prevented. Many
enzyme deactivations have been characterized in detail. A first-order
exponential process describes many deactivations, since two-state
transitions are often observed in reality. More complex phenomena do
occur, however, but even some of these have been successfully modeled.
Oligomeric proteins are more likely to undergo complex deactivation.
Many monomeri proteins show two-state unfolding but there are
exception. The single subunit protease zymogen, pepsinogen, undergoes
a transition that is not two-state.
One can observe denaturation of most proteins at high temperatures.
This conformational stability of proteins is due to the (quite small) net
difference between a very large number of weak stabilizing interactions
and the nearly-equally large conformational entropy. This net free
(Gibbs) energy of stabilization (typically 40 kj mol-1) is equivalent to
that of a small number of interactions. Only a very few further
interactions is sufficient to explain the greater stabilities of very stable
or extremophilic proteins: a single interaction may contribute up to 25kj
mol-1. Some proteins, however, will denature at low temperatures and
this has been well described for metmyoglobin.
There are examples where the unfolding temperature of a protein of
interest decreases greatly in the presence of chaotropic agents. Finding
of this sort show that different denaturing influences (where
temperature, pHor chaotropic agents) act in an equivalent fashion by
encouraging unfolding of the three-dimensional protein structure. Where
inactivation takes place, some covalent change or alternation in the
degree of association occurs instead of, together with, or in addition to,
the unfolding phemomenon. Activity will be lost when the unfolding
disrupts the integrity of the molecule’s active or functional site(s). In
summary, loss of a protein’s biological activity can occur by either
conformational or covalent processes.
Air-liquid interfaces can also have important destructive effects on
proteins, especially under conditions of rapid mixing or agitation, as has
been demonstrated for a variety of proteins. Surfactants which
preferentially absorb to the air-water interface can prevent inactivation,
providing further eidence for this interfacial effect.
8. Conclusion:
It can be seen from discussion above, that some conclusions have
emerged from studies of native and mutant proteins under extreme
conditions. Inactivating covalent processes, which follow unfolding of
the polypeptide, fall into a small number of defined reactions.
Electrostatic interactions can be stabilizing in appropriate, rigid regions
of the polypeptide. However, tight packing of the hydrophobic core so
as to minimize cavities is perhaps the most important contributor to
folded protein stability.
1. Introduction
The successful use of non-membrane bound enzymes in biphasic
aqueous-organic systems (and even in anhydrous organic solvents) was
a novel and surprising development. Here the focus will be confined to
the stability aspects of such studies.
4. Combined Strategies
One can stabilize enzymes for use in the presence of organic
solvents by strategies such as immobilization or protein engineering.
Membrane-bound (as distinct from soluble) multi-subunit enzymes may
also be stabilized as protein-lipid complexes by organic solvents. It was
found that Cytochrome oxidase and H+-ATPase from inner
mitochondrial membrane were stabilized by factors of 100 and 9,
respectively when the water content of the toluene bulk phase was
reduced from 13 microliters per milliliter to 3 microliters per milliliter.
The judicious use of hydrophobic, non-polar solvents as alternatives to
water has tremendous potential for the achievement of stabilized
enzymes.
1. Inroduction
(1) Salts
A particular salt exerts stabilizing or destabilizing effects on proteins
depending on its position in the Hofmeister lyotropic series which
relates to the effects of salts on protein solubility:
(more stabilizing )(CH3)4N+>N+H4>K+, Na+>Mg2+>Ca2+>Ba 2+
SO4 2->Cl->Br->NO3->ClO4->SCN-
The stabilizing ions “salt out” hydrophobic residues in the protein,
causing the adoption of more compact structure. This effect may be
attributed to the increased ionic strength of the solution and to the
increase in the number of water clusters around the protein. This helps
prevent the unfolding which is the initial event in any protein
deterioration process.
Most stabilizing ions seems to act via a surface tension effect. Ions
can also stabilize proteins by shielding surface charges and can act as
osmolytes by affecting the bulk properties of water. Note that ammonia
sulfate, which is widely used as a stabilizing ions from the Hofmeister
list above, the NH4+ cation and the SO42+ anion. One can stabilize
protein in solution while avoiding precipitation by adding ammonium
sulfate to final concentration in the range 20~400mM. Besides a
ammonium sulfate, salts containing citrate, sulfate, acetate, phosphate
and quaternary ammonium ions are generally useful. Note, however,
that the nature of the counterion will influence the overall effect of such
compounds on protein stability.
3. Basis of stabilization
Volkin and Klibanov have classified stabilizing additives into three
classes:
(1) Specific: substrates and ligands, where the Native→Unfolded
equilibrium shifts towards the native form;
(2) Non-specific: neutral salts and polyhydroxyl compounds,
which function as explained above;
(3) Competitors: which out compete the enzyme for the
inactivating reagents or remove the catalysts of deteriorative
chemical reactions: examples included added protein,
chelating and reducing agents.
4. Use of Additives
It is important to note that the additives discussed below are
generally applicable as stabilizing agents for proteins but a given
substance may not be effective for a particular protein. Both sucrose and
PEG, for instance, are good stabilizers of invertase but have denaturing
effects on lysozyme; the same additive has contrary effects on the two
enzymes.One should note the stabilizing or destabilizing effects of the
component ions of a salt when choosing additives; ref.6 includes a
useful discussion of this topic.
Osmolytes are a diverse group of substances comprising such
compounds as polyols, mono-and polysaccharides, neutral polymers
(such as PEG) and amino acids and their derivatives. One should use
polyols and sugars at high final concentrations: typical figures range
from 10-40%(w/v). Sugars are reckoned to be the best stabilizers but
reducing sugars can react with protein amino groups, leading to
inactivation. This problem can avoided by using non-reducing sugars or
the corresponding sugar alcohols. Glycerols is a very widely used low
molecular weight polyol. Its advantages include its ease of removal by
dialysis and its noninterference with ion exchange chromatography.
However, glycerol suffers from two significant disadvantages: it is a
good bacterial substrate and it greatly lowers the glass transition
temperature of material to be preserved by lyophilization or drying. The
5-carbon sugar alcohol, xylitol, can often replace glycerol; it can be
recycled from buffers and is not a convenient food source for bacteria.
Amino acids with no net charge, notably glycine and alanine, can act
as stabilizers if used in the range 20~500 mM. Amino acids and
derivatives occur as osmolytes in nature.
5. Substrate and specific ligands
3. Conclusion
A wide range of protein and enzyme properties have successfully been
altered by chemical modification procedures. These include solubility,
catalytic activity, substrate selectivity and stability. Chemical
modification can dramatically increase protein stability, with good
recoveries of activity. Active sites may be protected by specific ligands,
if required, but some degree of inactivation is always a possibility.
Partial loss of biological activity may be due either to decreased activity
of all the enzyme/protein molecules in the system or to total inactivation
of some of the molecules present. It can be difficult to distinguish
between these alternatives. Lessened activity among the entire
population of molecules often leads to altered kinetic parameters or pH
profiles, while a decreased turnover number accompanied by an
unchanged Km suggests the presence of a molecular fraction with
unaltered activities. Complete loss of biological activity following a
given modification reaction is usually interpreted in terms of the
targetted residue(s) being essential for activity.
Despite these reservations, chemical modification may be carried out
and characterized quite quickly. The information gained is ueful for
further rounds of chemical modification or for suggesting target sites for
site-directed mutation of the protein of interest. The ability to create
non-protein amino acid derivatives by chemical modification usually
complements the scope of mutational/protein engineering strategies for
the study or manipulation of proteins. Meanwhile, mutagenesis and
expression techniques continue to improve while chemical techniques
are likewise becoming more sophisticated. Very many reagents satisfy
the main criteria of a useful protein modifier, namely a high specificity
for one type of amino acid residue within a protein molecule and an
ability to react with that residue under mild conditions of pH and solvent
composition. It is likely that these powerful and complementary
strategies will be increasingly combined in the future.
8.2.3 Immobilization
1. Introduction
The laboratory scientists and biotechnologist will often need to
store an isolated or purified protein for varying lengths of time. If the
protein is an object of study, it will take some time to ascertain its
properties. If it is a commercial end product, or finds use as a tool in
some procedure, it will likely be used in small quantities over an
extended period. The protein must retain as much as possible of its
original, post-purification, biological (or functional) activity
throughout this time. The storage period or “shelf life” can range
from a few days to more than one year. The protein’s long-term or
kinetic stability becomes critically important under these conditions.
Shelf life can depend on the nature of the protein and on the storage
conditions. How one can prevent deterioration due to microbial
contamination and proteolysis and that correct use of low temperature
for extended storage are the focus of this section. It also considers
drying and freezes drying as processes for long-term protein
preservation. Of course, one cannot guarantee that any or all of these
necessarily-broad recommendations will “work” for a given protein,
or that a particular stabilization factor will result from a procedure
that seems to “work”.
3. Avoidance of proteolysis
It can be difficult to remove proteases completely during
purification of a target protein. Unless the object protein is
completely pure, even tiny amounts of contaminating proteolytic
enzymes can cause serious losses of activity during extended storage
periods. The molecular diversity of proteases complicates the
situation: there are exopeptidase and endopeptidase. In addition, there
are four types of proteases classified by their molecular reaction
mechanisms: the serine, cysteine (or thiol), acid and metallo-
proteases. Use of EDTA in the concnetration range 2-5 mM should
complex the divalent metal ions essential for metalloprotease action.
Pepstatin A is a potent but reversible inhibitor of acid proteases. It is
used at concentrations around 0.1 micromolar, as are similar protease
inhibitors. The compound phenylmethylsulphonyl fluoride (PMSF)
reacts irreversibly with the essential serine in the active site of serine
proteases, inactivating them. It can also act on some thiol proteases.
It is typically used at a final concentration of 0.5-1.0 mM, following
dissolution in a solvent such as acetone. Before addition, of course,
one must ensure that none of these compounds will adversely affect
the protein of interest.
If the protein of interest is itself a proteolytic enzyme, use of
protease inhibitors is not feasible. One may need to store such a
protein in dried form or as a freeze-dried preparation. Alternatively,
one can place it in a solution with a pH value far removed from the
protease’s optimum pH. Trypsin, for example, is most active at
mildly alkaline pH values. Daily stock solution of trypsin are often
prepared in 1mM HCl, where the very acid pH value renders the
enzyme effectively incapable of catalysis. This helps prevent
autolysis during the course of the experiment. The enzyme molecule
does not inactivate under these conditions and is fully active on
dilution into a suitable assay solution.
7. Freezing drying
Lyophilization, or freeze drying, is a method for the preservation of
labile materials in a dehydrated form. It can particularly suitable for
high value biomolecules such as proteins. The process involves the
removal of bulk water from a frozen protein solution by sublimation
under vacuum with gentle heating (primary drying). This is followed by
controlled heating to more elevated temperatures for removal of the
remaining “bound water” from the protein preparation (secondary
drying). Residual moisture levels are often lower than 1%. If the freeze
drying operation is carried out correctly, the protein will preserve all or
most of its initial biological activity of the protein in question.
8.3 Conclusion
This chapter has tried to show that many different approaches have
contributed to our understanding of, and ability to manipulate, protein
stability. It is likely that such cross-fertilization will continue in the
future. Advances in determining three-dimensional protein structure will
aid molecular modeling and rational design approaches. Mutagenesis
and expression techniques continue to improve while chemical
techniques are likely becoming more sophisticated. It is likely that these
powerful and complementary strategies will be increasingly combined
in the future.
Available stabilization strategies include immobilization, use of
additive, chemical modification and protein engineering. (see Table).
Keypoints:
1. How does the structure of proteins relate to the
function for which they were designed?
2. How cells design protein (from the evolutionary
viewpoint)?
The first oxygen attaches itself with the lowest affinity, and successive
oxygen are bound with a higher affinity. The exact value of n for
hemoglobin is a function of the extent of oxygen binding as well as the
pressure of other factors. In general, a value of n>1 indicates
cooperative binding (or positive cooperativity) between small-molecule
ligands, a value of n<1 indicates anticooperative binding (or negative
cooperativity), and a value of n=1 indicates no cooperativity.
10.1 Introduction
Many of these properties are not evident when proteins are crystallized,
but appear in solution or in membrane where the proteins are more
flexible. Nevertheless, knowing the crystal structure of a protein is
necessary to understand its properties under other conditions.
dr/dt=[(Mw(1-νρ))/NAf]ω2r
3. Gel filtration
4. Rotation
1. Absorbance
Absorbance of UV light by proteins is not very sensitive to thei
conformation or environments, except for that by the aromatic rings of
Phe, Tyr, and Trp residues. The spectral properties of the aromatic
residue reflect their environment. Their absorbance spectra are shifted
somewhat to longer wavelengths in a nonpolar environment such as the
interior of a protein. The absorbance spectra of the aromatic groups
consequently can be used to determine their average exposure to water.
2. Fluorescence
Fluorescence by the aromatic side chains is much more sensitive to their
environment than is absorbance, but it varies in an unpredictable
manner. The quantum yield may be either increased or decreased by
folding, so a folded protein can have either greater or less fluorescence
than the unfolded form. The magnitude of the fluorescence is not very
informative in itself, but it can serve as a sensitive probe of any
perturbations of the folded state.
Fluorescence by a protein is especially complex when there is more than
one aromatic side chain. The close proximity of aromatic groups in a
folded protein usually results in very efficient energy transfer between
them. ……
3. Circular dichroism
The CD and optical rotary dispersion(ORD) spectra of a protein are very
sensitive to its conformation. In the far-UV region (below 250nm), these
spectral characteristics are determined primarily by the polypeptide
backbone conformation, especially its secondary structure. The
spectrum of a protein of known structure is usually close to that
expected from the average of the spectra of α-helice, β-sheets, and
irregular conformations of model polypeptides, weighted by the fraction
of the polypeptide chain in each conformation. Consequently, CD
spectra can be used to estimate the relative proportions of the various
types of secondary structure in a protein. Early methods interpreted he
CD spectrum in terms of the model spectra of α-helice, β-sheets, and
irregular conformations; more recent procedures use spectra of a
number of proteins of known structure to fit the spectrum being
analyzed. As long as the unknown spectrum does not have any unique
features, fitting it with actual protein spectra usually gives the most
meaningful interpretation. However, other chromophores, especially
aromatic rings, can contribute significantly to the far-UV spectrum of a
protein. Recently, Infrared and Raman spectroscopy are being developed
to measure protein secondary structure in solution.
10.2.4 Ionization
The folded conformation of protein have a variety of effects on the
ionization of their polar groups. Many charged groups are brought into
close proximity on the surface of a folded protein, so ionization of
groups that would increase the net charge may be hindered. This general
electrostatic effect influences the ionization of all the groups. Specific
interactions, such as hydrogen bonding or salt bridging, also occur and
primarily affect the ionization of particular groups. The kPa values of
groups can be influenced by many environmental and electrostatic
effects in small molecules. The variety of environments in folded
proteins can produce very unusual ionization properties. The pKa values
of residues of one type can vary widely within a single protein, Often
over a range of 3-4 pH units, because of their different environments.
b. Porin
3. Identifying amino acid sequences likely to transverse membranes
The integral membrane proteins of known structure are not markedly
different in structure or amino acid composition from water-soluble
proteins, except that they are slightly more hydrophobic. They differ
mainly in the nature of the amino acid side chains that are on part of
their surfaces. Those side chains of membrane proteins that are on the
surface in contact with the membrane bilayer are less polar than the
protein interior, whereas the surfaces of water-soluble proteins are much
more polar than the interior. Those observations make it likely that
segments of polypeptide chains that traverse membranes could be
identified from their amino acid sequences alone.
These are a few of the relatively short-term projects that the protein
engineers are currently focusing their efforts on. The long-term goal is
to provide a theoretical framework for addressing the relationship
between the three-dimensional structure and the function of a protein.
c. Mutagenesis principle
Most protein engineering involves recombinant DNA
methodologies, but other methods such as random mutation via DNA
damaging agents or environmental pressures have been used. A gene
encoding the target protein is cloned from the original source or
synthesized and subcloned based on the protein sequence of interest.
Once the gene for the target protein is cloned, one or many amino acids
can be substituted, deleted, or inserted into the gene. Both natural and
unnatural amino acids can be introduced into the gene at this point.
The ready availablity of synthetic DNA provides a protein engineer
with new vistas in the manipulation and selective alteration of cloned
DNA. In particular, it is now feasible to change any cloned nucleotide
sequence to any other desired sequences and to determine the effect of
the change. One of the most powerful techniques for accomplishing
such nucleotide substitutions, insertions, or deletions is through the use
of synthetic DNA oligonucleotides.
2. Oligosynthesis
a. The synthesis of DNA is based on the Merrifield principle of solid-
phase chemistry that has previously developed for the synthesis of
peptides. Activated nucleotided are sequentially added to the 5’
hydroxyl of an oligo chain bound to an insoluble solid support by a 3’
hydroxyl linkage.
One of the most powerful indirect methods for assessing the correct
incorporation of the structural alteration expected from the engineering
is mass spectrometry, specially using electrospray methodology. ……
As noted above, the strongest link between the introduced change and
the observed alterations in structure or function comes from the
demonstration that no other changes have been introduced, at least in the
amino acid sequence. The identity of the peptide map in all regions
other than those affected by the desired change can usually be taken as
good evidence that there have been no other changes. Table 1.4 shows
some of the methods that might be used to investigate such changes or
to confirm that no observable differences exist between the parent and
wild-type protein.
1. What is needed?
Prior to altering the composition of the wild-type protein, the protein
engineer must have a well-defined rational for mutagenesis studies. In
particular, these studies fall into two primary, although not mutually
exclusive, areas: structure-function analysis and reconfiguraton of the
protein to provide useful properties (e.g. increased stability, longer half
life, etc.). Structure-function analysis is often performed to understand
the critical elements of the protein. Many of these studies involve large
screens of mutants, and these mutants are often dramatically different
from the wild-type protein. Another common alteration in the wild-type
protein is cassette substitution or removal of several adjacent amino
acids, with each mutant containing a different set of substitutions. This
type of mutagenesis scan often pinpoints the critical functional domain
of the protein. Of course, each of these mutants may significantly alter
the protein’s conformation, and, therefore, it is critical to determine the
effects of the mutations by analytical techniques. Assuming that the
protein conformation (secondary and tertiary structure) remains
significantly unaltered, the observed differences in function between the
mutant and wild-type molecules provide essential insight into the
importance of the mutated regions in the protein’s overall function.
2. Industrial issues
Protein engineering holds great promise for elucidating the underlying
mechanisms of protein structure, function and folding. Beyond
enhancing the general understanding of proteins, proteins, protein
engineering has the potential to great improve their use in industrial
applications. For enzyme design, protein engineering provide the
opportunity to design more efficient and stable enzymes. These enzymes
can be used to catalyse complex reactions that, by standard chemical
methods, are inefficient or lead to racemic mixtures resulting in impure
products or intermediate. In addition, enzymes for many industrial
chemical reaction should be stabilized against environmental stresses
such as heat or organic solvents.
蛋白质组研究的兴起
在后基因组时代,研究的重点已从揭示遗传信息转移到功能基因组学上来。
但是,由于生物功能主要体现者是蛋白质,而蛋白质有其自身特有的活动规
律。如蛋白质修饰加工、转运定位、结构变化、蛋白质与蛋白质间、蛋白质与其
他生物大分子的相互作用等,均无法在基因组水平上获得。因为基因组学有
样的局限性,促使人们从整体水平上探讨细胞蛋白质的组成及其活动规律。
蛋 白 质 组 和 蛋 白 质 组 学 概 念 的 提 出
蛋白质组学(Proteomics)是以蛋白质组为研究对象的新的研究领域。
它可分为:①表达蛋白质组学(expression proteomic 即把细胞、组织中
的蛋白,建立蛋白定量表达图谱,或扫描 EST 图。该方法依赖 2-D 凝胶图
和图像分析技术,而且在整个蛋白质组水平上提供了研究细胞通路,以及疾
病、药物相互作用和一些生物刺激引起的功能紊乱的可能性。②细胞图谱蛋白
质组学(cell-map proteomics):即确定蛋白质在亚细胞结构中的位置;
通过纯化细胞器或用质谱仪鉴定蛋白复合物组成等,来确定蛋白质 -蛋白质
的相互作用。Humphery-Smith 等总结了基因组结果后提出了"功能蛋白
质组( Functional Proteome)"的新概念。即细胞内与某个功能有关或在
某种条件下的一群蛋白质。鉴于此,我国学者李伯良提出了"功能蛋白质组学
(Functional Proteomics)"的概念。即把"功能蛋白质组"作为主要研究内
容。这一概念的提出,为蛋白质组研究的可能性奠定了理论基础。
蛋白质组研究的理论基础
蛋 白 质 组 分 析 主 要 基 于 3 条 理 由 :
③ 蛋白质组是动态反映生物系统所处的状态。细胞周期的特定时期、分化的不
同阶段、对应的生长和营养状况、温度、应激和病理状态,这些状态所对应的
蛋白质组是有差异的。蛋白质组学的研究可望提供精确、详细的有关细胞或组
织状况的分子描述。因为诸如蛋白质合成、降解、加工、修饰的调控过程只有通
过 蛋 白 质 的 直 接 分 析 才 能 揭 示 。
蛋 白 质 组 学 用 于 医 疗 研 究 的 重 点
蛋白质组方面的研究,将帮助人们寻找到一些用于医疗的可识别蛋白,
这些蛋白可作为诊断标记或作为诊断靶分子提供给从事医药和诊断研究的机
构 。 研 究 主 要 有 以 下 五 个 方 面 :
1.癌症针对研究的肿瘤类型包括:食道、肺、结肠、前列腺、胰腺、乳房以及成
神 经 细 胞 瘤 。
2.神经性疾病研究方向主要包括:脑损伤和感染性蛋白质疾病,如克雅氏
病 ( CJD ) 、 牛 海 绵 状 脑 病 ( BSE ) 、 帕 金 森 氏 病 。
3.器官移植排异蛋白质组研究将寻求一种体外检测的方法,用于人体器官
(心脏、肝、肺或肾)移植后的过敏和慢性排异性反应。
4.心血管疾病列入研究的心血管疾病有心力衰竭、高血压合肥大型心肌炎。
5.糖尿病、肥胖症通过蛋白质组学方法对于肥胖症及糖尿病相关的多肽进行
识 别 , 作 为 潜 在 的 识 别 分 子 和 治 疗 靶 象 。
SWISS-2DPAGE
Two-dimensional polyacrylamide gel electrophoresis database
[Search][Documents][Services][Software][Related
servers][Other databases][Job openings]
Access to SWISS-2DPAGE SWISS-2DPAGE documents
• by description (DE lines) or by ID • User manual
• by accession number (AC lines) • Release notes (December 19, 2000)
• by clicking on a spot: select one of our 2-D • Protocols:
PAGE reference maps, click on a spot and then o Technical information about 2-D
get the corresponding information from the PAGE (IPG's, silver staining,
SWISS-2DPAGE database. protocols, etc)
• by author (RA lines) o High performance 2-D gel comparison
• by spot serial number (2D lines) • 2-D PAGE maps published:
• by full text search o Human CSF, ELC, HEPG2,
• SRS, searching in SWISS-2DPAGE using the HEPG2SP, LIVER, LYMPHOMA,
Sequence Retrieval System PLASMA, PLATELET, RBC, U937,
• retrieve in a table all the protein entries CEC, KIDNEY.
identified on a given reference map
o Dictyostelium discoideum, Escherichia
• compute estimated location on reference maps coli, Saccharomyces cerevisiae.
for a user-entered sequence
Services Software
• Downloading SWISS-2DPAGE by FTP • Melanie 3 - Software package for 2-D PAGE
• SWISS-2DSERVICE - Get your 2-D Gels analysis
performed according to Swiss standards
• 2-D PAGE training - attend a one week course • Make2ddb package - A package preparing the
in Geneva data and the programs necessary to build a
federated 2-DE database on one's own web site.
• 2-D PAGE museum - gels run by trainees
during the 2-D PAGE courses
Gateways to other 2-D PAGE related servers and services
• 2D Hunt - 2-D electrophoresis web site finder
• Yahoo - Science:Biology
Miscellaneous Local links
• Protein Spotlight • Geneva and Swiss local pages
• Links to conferences and events • Swiss Institute of Bioinformatics (SIB)
• Swiss-Quiz • The Health On the Net foundation (HON)