Beruflich Dokumente
Kultur Dokumente
REVIEWARTICLE
ARTICLE
ADVANCES in genetic engineering have made possible the Heterologous production of proteins
production of therapeutics and vaccines for human and
animals in the form of recombinant proteinsl,2. These bio- Protein over-expression refers to the directed synthesis of
technology-derived recombinant proteins form a new large amounts of desired proteins. The heterologous pro-
class of drugs for many ailments like genetic disorders, duction of proteins and enzymes involves two major
cancer, hypertension and AIDS for which we have no steps:
better treatment or cure. Unlike chemical drugs, biologi-
(1) Introduction of foreign DNA into the host cells. This
cals are our own molecules and hence more compatible
step has three major considerations. (a) Identification and
with biological systems. At present there are more than
isolation of the DNA to be introduced; (b) Identification
100 biotechnology-derived therapeutics and vaccines
of the vector and construction of recombinant vector;
approved by US FDA for medical use and over 1000
(c) Identification of the suitable expression system to re-
additional drugs and vaccines are in various phases of
ceive rDNA.
clinical trials. In addition, use of DNA, proteins and
enzymes in diagnostics is increasing exponentially. Indus- (2) Factors affecting the expression of foreign DNA for
trial uses of enzymes in food, textile, leather, detergent, protein synthesis in the chosen expression system.
medicinal chemistry sectors are also increasing rapidly.
Points (1a) and (1b) are topics in themselves and will
The growing need of therapeutic and other applications of
not be dealt with in this review. Briefly, at present a
enzymes and proteins could only be met by heterologous
variety of vectors are available to ferry DNA in and out of
synthesis of recombinant proteinsl,2. Table 1 indicates key
cells: plasmids, lambda phage, cosmids, phagmids, artifi-
therapeutic recombinant biotech products with their recent
cial chromosomes from bacteria, yeast or human origin
market sales and names of key manufacturers. Table 2
(BAC, YAC and HAC3, respectively). The vectors could
shows how rapidly many new therapeutic products have
either be integrating (becomes part of the host’s chromo-
recently entered the market and the packed pipeline of
somes) or extrachromosomal. They could be in copies
varying from one to several hundreds.
In general, expression vectors have the following attri-
†
For correspondence. (e-mail: perd@wilnetonline.net) butes (Figure 1):
CURRENT SCIENCE, VOL. 80, NO. 9, 10 MAY 2001 1121
REVIEW ARTICLE
1998 sales
Product (billion USD) Manufacturer
1994 17 –
1995 33 450
1996 79 700
1998 > 100 1200
vectors to be faithfully replicated and distributed to cytoplasmic proteins often results in the release of endo-
daughter cells during cell division. Use of artificial chro- toxins, which must be removed from the final product.
mosomes in commercial production of heterologous Currently, strategies to secrete the target proteins by trans-
proteins remains unexplored. location into the periplasmic space or to release the target
proteins by linking to existing excretory systems are being
developed8. Additionally, the efficiency of expression will
Available expression system also depend on differences of codon utilization by bacteria9.
At times the original sequence of the heterologous gene
Prokaryotic and eukaryotic systems are the two general
has to be modified to reflect the codon usage by the cho-
categories of expression systems. Prokaryotic systems are
sen expression system. E. coli has toxic cell wall pyrogens
generally easier to handle and are satisfactory for most
and hence products need to be tested more extensively
purposes. However, there are serious limitations in using
before use.
prokaryotic cells for the production of eukaryotic pro-
teins. For example, many of the eukaryotic proteins un-
dergo a variety of post-translational modifications like Yeast
proper folding, glycosylation, phosphorylation, formation
Yeast is the favoured alternative host for expression of
of disulphide bridges, etc. There is no universal expres-
foreign proteins for research, industrial or medical use10.
sion system for heterologous proteins. All expression sys-
As a food organism, it is highly acceptable for the produc-
tems have some advantages as well as some disadvantages
tion of pharmaceutical proteins. In contrast, E. coli has
that should be considered in selecting which one to use.
toxic cell wall pyrogens and mammalian cells may contain
Choosing the best one requires evaluating the options –
oncogenic or viral DNA. Compared to mammalian cells,
from yield to glycosylation, to proper folding, to econo-
yeast can be grown relatively rapidly (doubling time
mics of scale-up.
90 min) on simple media and to high cell density and its
genetics is more advanced than any other eukaryote, so
Bacterial system that it can be manipulated as readily. Added advantages
are the availability of complete genomic sequence, the
E. coli is by far the most widely employed host, provided nuclear stable high copy plasmids and ability to secrete
post-translational modifications of the product are not the target protein11. Saccharomyces strains have high
essential. Its popularity is due to the vast body of know- copy stably-inherited plasmid of 6.3 kb known as 2-
ledge about its genetics, physiology and complete geno- micron plasmid, which codes for 4 genes FLP, REP1,
mic sequence which greatly facilitates gene cloning and REP2 and D. It also contains an open reading frame
cultivation4,5. High growth rates combined with the ability (ORF), STB locus (required in cis for stabilization) and
to express high levels of heterologous proteins, i.e. strains two 599 bp inverted repeat sequences. FLP encodes a
producing up to 30% of their total protein as the ex- site-specific recombinase which promotes flipping about
pressed gene product, result in high volumetric producti- the FLP recombination targets (FRT) within the inverted
vity. Furthermore, E. coli can grow rapidly to high densities repeats, so that cells contain two forms of 2-micron plas-
in simple and inexpensive media. Strains used for recom- mids, A and B. The simple 2-micron shuttle vectors con-
binant production have been genetically manipulated so tain the 2-micron ORI-STB, a yeast selectable marker and
that they are generally regarded as safe for large-scale bacterial plasmid sequence (ori and selection markers)
fermentation. Purification has been greatly simplified by and are used in host strains which supply REP1 and REP2
producing recombinant fusion proteins which can be proteins.
affinity-purified, e.g. glutathione-S-transferase and maltose- These lower eukaryotic systems are able to glycosylate
binding fusion proteins6. However, expression in bacteria the target proteins, but it has been shown that both N- and
does have some serious disadvantages. It poses significant O-linked oligosaccharide structures are however, signifi-
problems in post-translational modifications of proteins. cantly different from their mammalian counterpartsl2,13.
Common bacterial expression systems such as E. coli Hypermannosylation (addition of a large number of man-
have no capacity to glycosylate proteins in either N- or nose residues to the core oligosaccharide) is a common
O-linked conformation. Although other bacterial strains feature in yeast, hindering proper folding and therefore
such as Neisseria meningirulls have recently been shown the activity of the protein. At the moment yeast provides a
to O-glycosylate some of their endogenous proteins, the good compromise between bacteria on one side and
trisaccharide added is different from O-linked sugars mammalian cell lines on the other.
found in eukaryotes. Protein expressed in large amounts
often precipitates into insoluble aggregates called inclu- Insect
sion bodies, from which it can only be recovered in an
active form by solubilization in denaturing agents The baculoviruses have emerged as a popular system for
followed by careful renaturation7. Lysis to recover the overproducing recombinant proteins in eukaryotic
CURRENT SCIENCE, VOL. 80, NO. 9, 10 MAY 2001 1123
REVIEW ARTICLE
cellsl4,15. Several factors have contributed to their popu- system for expressing recombinant proteins19,20. Circular
larity. Being eukaryotes, they use many of the protein plasmids are common in prokaryotes, but only a few
modifications, processing, and transport systems present eukaryotes have been identified and studied for having
in higher eukaryotic cellsl6. They use a helper-indepen- circular nuclear plasmids and Dictyostelium is one of
dent virus that can be propagated to high titers in insect them21,22. The cellular slime molds have families of
cells adapted for growth in suspension cultures, making it similar plasmids which are found in the nuclei of different
possible to obtain large amounts of recombinant proteins species. They are organized differently from the yeast 2-
with relative ease. Expressed proteins are usually ex- micron like plasmids. Dictyostelium plasmids are pack-
pressed in the proper cellular compartment, i.e. membrane aged in a nucleosomal structure similar to the chromatin
proteins are usually localized to the membrane, organization of higher eukaryotes. The best characterized
nuclear proteins to the nucleus and secreted proteins family is that of the Ddp2-like plasmids. These plasmids
secreted into the medium. Majority of the overproduced are small (4.4–5.8 kb), encode ORF and contain an
proteins remain soluble in insect cells. Viral genome is inverted repeat. The product of ORF is required in trans
large (130 kb) and thus can accommodate large fragments for plasmid maintenance, while an approximately 600 bp
of foreign DNA. Baculoviruses are non-infectious to fragment, presumably containing the origin of replication
vertebrates and their promoters have been shown to be is required in cis. The second best characterized plasmid
inactive in mammalian cells, which gives them a possible family is Ddp1, found in wild type isolates like NC4 and
advantage over other systems when expressing oncogenes V12. Ddp1 is a 13.7 kb plasmid which is present at an
or potentially toxic proteins. Also the process develop- estimated copy number of 50–100 per cell and encodes at
ment time is short. Expression using baculoviral vectors least 5 growth-specific and 5 development-specific tran-
also has some limitations. Since baculoviruses infect scripts. Although Ddp1 is larger and highly transcribed
invertebrates, it is possible that the processing of proteins than the plasmids of the Ddp2-like family, none of the
produced by vertebrates is different and this seems to be known transcripts seems to be essential for its replication.
the case for some post-translational modifications, e.g. However, while approximately 1.2 kb region appeared
internal proteolytic cleavages at arginine- or lysine-rich to carry all the elements necessary for extra chromosomal
sequences are highly inefficient. The glycosylation capa- replication, in the absence of selection, this fragment is
bility is generally limited to producing only high mannose quickly lost from the sub-population. In addition, deletion
type and not processed to complex type oligosaccharides of plasmid sequences outside the region results in reduc-
containing fucose, galactose and sialic acid. tion of plasmid copy number. Therefore at least some of
the Ddp1-encoded genes are required for the long-term
maintenance of the plasmid at its normal copy number.
Mammalian cells17,18 The development of reliable transformation systems for
D. discoideum has provided the possibility of expressing
Ideally, proteins requiring mammalian post-translational
heterologous genes in this microbe23,24. Dictyostelium is a
modifications should be expressed in mammalian cells. If
simple eukaryotic micro-organism with a haploid genome
product authenticity is absolutely essential for clinical
of 5 × 107 bp and a lifecycle that alternates between
efficacy, then despite the many shortcomings, a mammalian
single-celled and multicellular stages. They grow to a
host is the only choice, as it offers the greatest degree of
high cell density without the serum factors or special aera-
product fidelity. It should, however, be noted that oligo-
tion needed by animal cell cultures. There is no cell wall
saccharide processing is species- and cell type-dependent
and the high copy number plasmid vectors allow the
among mammalian cells. Differences in glycosylation
expression of protein in cell-associated, membrane-
pattern are reported in rodent cell lines and human tissues.
attached or secreted form under the control of regulatable
Even the use of human cell line is not perfect, since the
promoters25. The cells of Dictyostelium can carry out both
transformation event required in most cases to produce a
O- as well as N-glycosylation26,27. The major advantages
stable cell line may itself result in altered glycosylation
of this system include a very simple and cheap growth
profiles. Also mammalian expression techniques are time
medium and the potential for large-scale production of
consuming and much more difficult to perform on a large
proteins.
scale. Complex nutrient requirement and low product
concentration have meant that the end product must be
highly value-added for this approach to be commercially
Factors affecting intracellular expression
viable.
Having the target DNA in an appropriate vector in the
Dictyostelium discoideum expression system of choice is the first step in optimizing
production of the heterologous proteins. Within a given
Recently, the cellular slime mold Dictyostelium dis- system the transcription and translation processes leading
coideum has been developed as an alternative eukaryotic to the heterologous protein production are a complex set
1124 CURRENT SCIENCE, VOL. 80, NO. 9, 10 MAY 2001
REVIEW ARTICLE
species, unless the original species comprises a large pro- been found to increase solubility in some cases42. This
portion of the total mRNA. Thus, the overall rate of trans- may be due to a decreased translation rate or to the fact
lation of an mRNA is not usually affected by a slower that hydrophobic interactions, such as those occurring in
elongation rate unless ribosomes become limiting, which aggregates, become less favourable. A dramatic increase
would affect all transcripts in the cell. In contrast to the in the yield of active, soluble protein is observed on
normal situation, there is evidence that codon usage may reducing the rate of induction43.
affect both the yield and quality of a protein when a gene
is transcribed to very high levels. With very high levels of
mRNA containing rare codons, aminoacyl-tRNAs may Post-translational processing
become limiting, increasing the probability of mistransla-
tion38, which is the incorporation of an amino acid which Prokaryotic expression systems are generally useful for
does not correspond to the codon being translated, and producing heterologous proteins from cloned eukaryotic
possibly causing ribosomes to drop off. Thus the codon cDNA. In some cases however, eukaryotic proteins that
content of a foreign gene may influence the yield of pro- have been synthesized in bacteria are either unstable or
tein where the mRNA is produced at very high levels. lack biological activity. The inability of prokaryotic
This may more likely occur on growth in minimal organisms to produce authentic versions of proteins is for
medium, when the cell produces a wide variety of biosyn- the most part due to the absence of appropriate mecha-
thetic enzymes encoded by genes containing rare codons. nisms for generating certain post-translational modifica-
The effect on product quality has been difficult to meas- tions. In eukaryotes there are a number of modifications
ure, but requires further attention since it has further that may occur at the post-translational stage, after protein
implications for therapeutic proteins. Proteins containing synthesis is complete.
amino acid mis-incorporation are difficult to separate and
may affect the activity and antigenicity of the product. Amino-terminal modifications of polypeptides: These
Since small genes are now frequently synthesized chemi- are the most common processing events and occur on
cally, they may be easily and perhaps profitably engi- most cytosolic proteins44. Two types of events normally
neered to contain optimal codons for high-level expres- occur – removal of the N-terminal Met residue, catalysed
sion. mRNA secondary structure, in addition to codon by Met aminopeptidase (MAP) and acetylation of the N-
usage may affect translational elongation39. terminal residue, catalysed by N-acetyltransferase (NAT).
Both enzymes are associated with ribosomes and act on
nascent polypeptide. In most cases the structure of the N-
Polypeptide folding terminus should not affect the biological activity of a pro-
tein, but there may be exceptions, e.g. the response of
During or following translation, the polypeptide must fold haemoglobin to physiological modifiers involves the N-
so as to adopt its functionally-active conformation. Since terminus and correct folding of alpha and beta globins
many denatured proteins can be refolded in vitro, it is therefore advantageous. Similarly N-acetylation of
appears that the information for correct folding is con- melanocyte-stimulating hormone is required for full bio-
tained in the primary polypeptide structure40. However, logical activity.
folding comprises rate-limiting steps during which some
molecules may aggregate, particular!y at high rates of Disulphide bond formation: In eukaryotes, formation of
synthesis and at higher temperatures. There is evidence disulphide bond (cys-s-s-cys) occurs in the lumen of RER
that certain heat shock proteins act as molecular chaper- and is mediated by an enzyme called disulphide isom-
ones in preventing the formation and accumulation of erase45. Disulphide bond is confined to secretory proteins
unfolded aggregates, while accelerating the folding reac- and exoplasmic membrane proteins. This is important in
tions. Due to the intrinsic nature of polypeptide folding stabilization of tertiary structure. An improperly folded
and low specificity of chaperones, it is very unlikely that protein is unstable and lacks activity.
foreign cytosolic proteins will accumulate in non-native
conformations, but when fragments of proteins or fusion Proteolytic cleavage of a precursor form: This is required
proteins are expressed, normal folding domains may in some cases. Selected segments of amino acid sequences
be perturbed, resulting in an insoluble product. Insolu- are removed to yield a functional protein46.
ble proteins can often be renatured in vitro, though
the techniques for this can be complex and unpredict- Glycosylation: This is the most extensive of all the post-
able41. In contrast to intracellular proteins, naturally translational modifications and has important function in
secreted proteins encounter an abnormal environment in secretion, antigenicity and clearance of glycoproteins47.
the cytoplasm; disulphide bond formation is not favoured Oligosaccharides can attach to proteins in three ways:
and glycosylation cannot occur. In E. coli, foreign pro- (i) Via an N-glycosidic bond to the R-group of an
teins are frequently insoluble, but low temperature has Asn-residue within the consensus sequence Asn-X-Ser/
1126 CURRENT SCIENCE, VOL. 80, NO. 9, 10 MAY 2001
REVIEW ARTICLE