Genetic Engggggg

JBC Papers in Press. Published on October 20, 2010 as Manuscript M110.
139659
The latest version is at http://www.jbc.org/cgi/doi/10.1074/jbc.M110.139659
PROSPECTING METAGENOMIC ENZYME SUBFAMILY GENES FOR DNA

FAMILY SHUFFLING BY A NOVEL PCR-BASED APPROACH
Qiuyan Wang 1, Huili Wu 1, Anming Wang 1, Pengfei Du 1, Xiaolin Pei 1, Haifeng Li 1, Xiaopu
Yin 1, Lifeng Huang 1, Xiaolong Xiong 2
Center for Biomedicine and Health 1, College of Life Sciences 2,
Hangzhou Normal University; Hangzhou 310012, P. R. China
Running head: TMGS-PCR based DNA family shuffling
Address correspondence to: Qiuyan Wang, Center for Biomedicine and Health, Hangzhou
Normal University, No. 222 Wenyi Road, Hangzhou 310012, Zhejiang, China. Fax:
(86)571-28865630, E-mail: qywanghznu@gmail.com.
DNA family shuffling is a powerful Directed evolution mimics the processes of

method for enzyme engineering, which Darwinian evolution in a test tube, combining
utilizes recombination of naturally occurring random mutagenesis and/or recombination with
functional diversity to accelerate screening or selection for enzyme variants that
laboratory-directed evolution. However, the have the desired properties (1). Directed
Downloaded from www.jbc.org by guest, on October 25, 2010

use of this technique has been hindered by the evolution through random mutagenesis of a
scarcity of family genes with the required single starting sequence can result in significant
level of sequence identity in the genome improvement toward the required function.
database. We describe here a strategy for However, this strategy has the disadvantage of
collecting metagenomic homologous genes for having to screen an extremely high number of
DNA shuffling from environmental samples candidates to find the rare positive mutants that
by truncated metagenomic gene specific PCR are needed as the genetic material for DNA
(TMGS-PCR). Using identified metagenomic shuffling (5). A more potent directed evolution
gene specific primers, twenty-three 921 bp strategy is known as DNA family shuffling; it
truncated lipase gene fragments, which refers to the recombination of equivalent genes
shared 64 to 99% identity with each other and from natural homologous families rather than
formed a distinct subfamily of lipases, were random mutagenesis of a single gene (6). This
retrieved from 60 metagenomic samples. approach takes advantage of the fact that most of
These lipase genes were shuffled, and selected the deleterious mutations have long ago been
active clones were characterized. The removed by natural selection, while diverse
chimeric clones show extensive functional and function has been created by the positive
genetic diversity, as demonstrated by mutation interchange. The reassortment of
functional characterization and sequence proven mutations yields a higher frequency of
analysis. Our results indicate that homologous functional progeny sequences, and because
sequences of genes captured by TMGS-PCR multi-gene shuffling starts with more than one
can be used as suitable genetic material for parental sequence, it accesses a broader range of
DNA family shuffling with broad applications progenitor combinations. These attributes make
in enzyme engineering. the approach more efficient, reducing
loss-of-function mutations dramatically so that
Many excellent examples of the application fewer progeny molecules need to be screened to
of directed evolution to a wide range of enzymes discover superior performers (7-9).
have clearly demonstrated its critical role in Although DNA family shuffling has the
tailoring enzymes for industrial applications(1-4). above-mentioned advantages over random
Copyright 2010 by The American Society for Biochemistry and Molecular Biology, Inc.
mutagenesis, there are only dozens of published environmental DNA (or so called
reports using it, unlike the thousands published ‘metagenomes’) have enabled us to mine
that use random mutagenesis. The modest microbial diversity, allowing us to access their
number of published cases of DNA family genomes (17-20). Two strategies are generally
shuffling is indicative of its limitation: reliance used to screen and identify novel genes from
on natural homologous gene materials. It is metagenomic libraries: function-based and
always frustrating searching for qualified sequence-based screening. In function-based
homologous sequences in the genome database. screening, clones expressing desired traits are
Recently, several techniques for the selected from libraries, and aspects of the
combinatorial generation of protein chimeras in molecular biology and biochemistry of the active
regions of low homology have been developed clones are analyzed. Conversely, sequence-based
that can circumvent homology dependence for a screening is not dependent on the expression of
DNA shuffling process. These techniques cloned genes in heterologous hosts. Generally, it
include non-homologous random recombination is based on the conserved DNA sequences of
(10), sequence homology-independent protein target genes. Hybridizations or PCRs are
recombination (11) and incremental truncation performed based on the deduced DNA consensus

for the creation of hybrid enzymes (ITCHY) (21).
(12-14). However, random non-homologous We report here the development of a
recombination results in the creation of libraries homologous gene collection method termed
containing a large number of out-of-frame or “truncated metagenomic gene specific PCR”
otherwise inactive clones. The theoretical size of (TMGS-PCR). This approach led to success in
such libraries renders them unsuitable for tapping into homologous sequences of enzymes
manual screening because a vast number of from environmental DNA with PCR with a
clones must be interrogated to identify highly designated metagenomic starting gene and
active enzymes (14). However, Castle et al. have experimentally validated truncated gene specific
provided an alternative and ultimately primers (Fig. 1). Sequences analysis and
complementary solution to this problem: functional characterization of the resulting
building synthetic libraries using a subset of shuffled library demonstrated that TMGS-PCR
diversity from an alignment (15). This method can provide homologous genes with great
has been proven to be successful for providing genetic diversity for DNA family shuffling.
sufficient genetic diversity for directed enzyme
evolution, even based on information from EXPERIMENTAL PROCEDURES
alignments with few genes of low homology.
Bacterial strains, plasmids and growth
However, the need to develop more alternatives
conditions - E. coli DH5α and pBluescript SK+
to collecting natural homologous genes
(Stratagene, La Jolla, CA, USA) were used as
efficiently for DNA family shuffling remains
host and vector, respectively, for cloning of the
urgent.
lipase. E. coli strain BL21-CodonPlus
In the past decade, advances in the field of
(DE3)-RIPL (Stratagene, La Jolla, CA, USA)
‘metagenomics’ have dramatically revised our
and pET-24a(+) (Novagen, Inc., Madison, WI)
view of biodiversity (16). Considering the
were used as host and vector, respectively, for
estimation that >99% of microorganisms in most
heterologous expression of the lipase. Plasmid
environments are not amenable to culturing, very
pMD18-T (TaKaRa Ltd., Otsu, Japan) was used
little is known about their genomes. The
for gene cloning and sequencing. E. coli cells
isolation, archiving and analysis of
were routinely grown in Luria–Bertani (LB)
2
medium at 37°C, supplemented with 100 µg/ml Xho I sites of pET-24a(+), creating
ampicillin, 50 µg/ml streptomycin sulfate, 10 pET-24a-LipS05. The recombined plasmid was
µg/ml tetracycline HCl and 12.5 µg/ml transformed into competent cells of E. coli strain
chloramphenicol, if needed. BL21-CodonPlus (DE3)-RIPL using the CaCl2
Library construction- A total of 60 method following standard protocols.
environmental samples were collected from Homologous gene amplification - Sets of
terrestrial microenvironments in China and were primers, which were assigned designations
named as S01-60. DNA extraction from samples comprising the first three letters Lip referring to
was carried out using the PowerSoil® DNA lipase and the sequence length (bp) of the
Isolation Kit (MOBIO, Carlsbad, CA, USA) product, were used to amplify related lipase
according to the manufacturer's instructions. fragments from enviromental DNA samples. The
Electrophoresis for DNA from envireomental product was amplified using a BIO-RAD DNA
samples was performed with a Bio-Rad CHEF Engine Peltier Thermal Cycler. The conditions
DR II system (Bio-Rad, Richmond, CA, USA). were as follows. Thirty cycles of reactions were
A metagenomic library of the S05 sample was performed under the conditions of denaturation
constructed using the approximately 40 kb at 94°C for 1 min, annealing at 58°C for 1 min,

end-repaired DNA fragments and and an extension step at 72°C for 1 min. This
TM
CopyControl Fosmid Library Production Kit sequence was repeated 35 times followed by a
(Epicentre Technologies, Madison, WI, USA) 10-min final extension step at 72°C. All PCR
using protocols provided by the manufacturer. products were cloned into the TA vector and
Screening and subcloning- For detection of confirmed by DNA sequencing.
lipolytic activity, transformed cells were spread Sequences analysis - The Lipase
immediately after electroporation on LB agar Engineering Database
containing 1% (w/v) tributyrin (Merck, (http://www.led.uni-stuttgart.de/) was
Darmstadt, Germany) and incubated overnight at searched using the BLASTP program (23).
37°C. After 3–5 days at room temperature or Multiple sequence alignments were constructed
4°C , colonies were monitored for the presence using the CLUSTAL-X program (24). To
of clear halos (22). Sau3A I partially digested determine the evolutionary relationship of
fosmid DNA from the positive clone was ligated enviromental Lipases with other lipases,
to BamHI-linearized and dephosphorylated phylogenetic analysis was performed with the
pBluescript SK+ vector, and the ligation mixture amino acid sequences of environmental lipases
was used to transform E. coli DH5α. The new and 50 lipases obtained from Lipase Engineering
clones were examined on the same type of Database using neighbor-joining methods with
indicator plates for lipase activity. A 1000 bootstrap replications performed by MEGA
tributyrin-positive clone was selected and version 4.0 (25).
sequenced. The gene encoding LipS05 was DNA shuffling - The chimeric library was
amplified by PCR using Lip24aF and Lip24aR constructed using a modified DNA shuffling
primers, method (26). Twenty-four ~1.3 kb DNA
5′-CGCCATATGATGGAATTTATTCCCCCT chimeric parent genes containing the LipS01-50
G-3′ and 5′- truncated fragments were amplified using
TAGCTCGAGTTTGGCACGCAGCG-3′, Lip24aF/R primers. The reaction mixture (50 µl)
which resulted in Nde I and Xho I sites at the 5′ contained 1 µl of Pfu polymerase (2.5 units/µl)
and 3′ ends, respectively. The digested PCR (Stratagene, La Jolla, CA), 1 µl of 20 µM each
fragment was inserted between the Nde I and primer, 30 ng of each plasmid DNA, 5 µl of
3
dNTP (2.5 mM each) and 5 µl of 10x Pfu buffer. and the incubation temperature was lowered to
The resulting parent DNA fragments were then 20°C for 20 h.
digested with bovine pancreas DNase I (TaKaRa Inclusion bodies were solubilized using the
Ltd., Otsu, Japan) as follows. A 25 µl volume of protein refolding kit (Novagen), according to the
solution containing 0.05 µg of each parent DNA manufacturer's protocol, with slight
was mixed with 6.25 µl of 0.2 M Tris-HCl (pH modifications. E. coli BL21 cells expressing
7.5), 0.25 µl of 10 mg/ml BSA and 0.25 µl of 0.1 lipases were harvested, resuspended in 0.1
M MnCl2. Digestion was initiated with the culture volume of wash buffer (20 mM Tris-HCl
addition of 2.5 µl of DNase I (0.5 unit/ml) into pH 7.5/10 mM EDTA/1% Triton X-100), and
the mixture at 37°C. Following incubation for 30 broken by sonication. After centrifugation, the
min, the digestion was stopped by adding 0.5 M pellet was washed twice with 0.1 culture
EDTA to a final concentration of 50 mM and volumes of wash buffer and then solubilized at
heat inactivating at 95°C for 10 min. The room temperature in solubilization buffer (0.1 M
digested fragments were separated by gel glycine, pH 11/0.3% N-lauroylsarcosine) at a
electrophoresis. The desired 100-200 bp DNA concentration of 5 to 10 mg/ml wet weight of the
fragments were isolated and purified using the

pelleted cell debris, including the inclusion
QIAquick gel extraction kit (Qiagen, Valencia, bodies. For refolding, the denatured protein was
CA). dialyzed twice (for 3 h and 12 h) against 50
Reassembly of the DNase I digested sample volumes of dialysis buffer (20 mM
fragments was conducted in a 50 µl reaction Tris-HCl, pH 8.5) supplemented with 0.1 mM
mixture containing 37 µl of fragment DNA(~3 DTT, twice again (3 h each) against dialysis
µg), 5 µl of 10x Pfu buffer, 8 µl of dNTP (2.5 buffer, and then once again (12 h) against 25
mM each) and 1 µl (2.5 units) of Pfu polymerase. sample volumes of dialysis buffer supplemented
A progressive hybridization PCR cycling with 1.0 mM reduced glutathione/0.2 mM
program was used for reassembly. The oxidized glutathione. All dialysis steps were
reassembled reaction mixture (1 µl) along with performed at 4°C.
primers Lip24aF/R was used to amplify the
Soluble protein was incubated with
full-length genes using PCR. Then the Nde
Ni-NTA agarose (Qiagen, Hilden, Germany) for
I-Xho I restriction fragment was inserted into
1 h at 4°C, and the mixture was loaded onto a
pET-24a(+) at the corresponding sites, and the
chromatography column. The column was
resulting plasmid was used to transform E. coli
washed with a buffer containing 100 mM
BL21-CodonPlus (DE3)RIPL by
sodium phosphate (pH 8.0), 200 mM sodium
electroporation.
chloride, and 10 mM imidazole. The N-terminal
Expression, refolding and purification - A single
His-tagged lipase was eluted from the column
colony of selected mutants was used to inoculate
with the same buffer containing 500 mM
5 mL LB, containing 50 µg/ml kanamycin, 50
imidazole. The protein was dialyzed overnight
µg/ml streptomycin sulfate, 10 µg/ml
against 20 mM sodium phosphate (pH 8.0) and
tetracycline HCl and 12.5 µg/ml
concentrated with polyethylene glycol (PEG)
chloramphenicol, and the culture was agitated at
20000. Then the purfied enzyme was stored at
225 rpm overnight at 37°C. The cells were
4°C and used within three days for enzyme
grown in 1 L of LB media at 37°C until
assays. The purified enzymes were assayed by
OD600 nm=0.8, then protein expression was
SDS-PAGE followed by Coomassie brilliant
induced using 0.5 mM
blue G-250 staining and the protein
isopropyl-β-D-thiogalactopyranoside (IPTG),
4
concentration was determined by the Bradford protein (LipS05) with a molecular mass of 48
method. kDa. A BLAST search in the Lipase Engineering
Activity assay - The time course of the Database (http://www.led.uni-stuttgart.de/)
esterase-catalyzed hydrolysis of p-nitrophenyl revealed that the LipS05 can be aligned to
alkanoate esters was followed by monitoring the several lipases around the catalytic site (E value
production of p-nitrophenyl at 405 nm in 1 cm <0.01). Those sequences showing the highest
pathlength cells with UV-Visible homology were from Rhodococcus sp.
spectrophotometer (Shimadzu Co., Kyoto, (gi|111021394|; identity, 25/76 32%; similarity,
Japan). In the standard assay, 20µl of 10 mM 39/76 51%), Coprinopsis cinerea (gi|169844727|;
p-nitrophenyl butyrate (pNPC4), p-nitrophenyl identity, 27/82 32%; similarity, 45/89 50%),
caprylate (pNPC8) or p-nitrophenyl laurate Streptomyces griseus (gi|182439565|; identity,
(pNPC12) solution was added to the reaction 19/58, 32%; similarity, 41/82 50%) and
system for a final concentration of 0.2 mM in 50 Cryptococcus neoformans (gi|134109957|;
mM phosphate buffer (pH 8.0) and incubated at identity, 25/89 28%; similarity, 48/89 53%).
25°C. Then, the reaction was started with the LipS05 and the homologous uncharacterized
lipases have a conserved active-site motif

addition of 20 µl of the enzymatic solution. The
background hydrolysis of the substrate was consisting of the pentapeptide GXSXG (Fig. 2B)
deducted with a reference sample of identical that is characteristic of lipases (28).
composition to the incubation mixture, except The LipS05 gene was subcloned into the
that lipase was omitted. One unit of enzymatic pET-24a(+) vector and overexpressed in E. coli
activity was defined as the amount of protein BL21-CodonPlus (DE3)-RIPL. The supernatant
releasing 1 µmol of p-nitrophenyl from of cell extract from E. coli harboring
p-nitrophenyl alkanoate esters per minute (27). pET-24a-LipS05 showed a significantly elevated
activity compared with that of E. coli harboring
Nucleotide sequence accession numbers -
pET-24a without the subcloned lipase sequence.
Nucleotide sequences described in this paper
Therefore, high-throughput functional screening
have been submitted to GenBank under the
can be applied to detect the phenotypic
following accession numbers: GU942683 to
variability of mutants of LipS05 using this
GU942706.
expression system (29).
RESULTS Recovery of homologous sequences - Since
there are no conserved domains in the LipS05
Isolation of starting lipase gene - To isolate
equence, only sequences that correspond to tens
the starting lipase gene from a metagenomic
of amino acids that allow alignment, it is not
sample, S05 environmental DNA library was
possible to design degenerate primers to isolate
constructed using a CopyControl fosmid library
related family genes from other environmental
production kit (Epicentre) according to the
samples as in traditional PCR-based
manufacturer’s instructions. This process was
metagenomic screening. Sets of primers specific
followed by the screening of the lipolytic
to arbitrary locations of the LipS05 sequence
activity on a tributyrin agar plate. A positive
were designed (Table 1). Initially, overlapping
clone showing the lipolytic activity among
primers targeted terminal to the LipS05
approximately 30,000 colonies was selected (Fig.
sequence were designed to amplify full length
2A). Sequence analysis of the short insert DNA
homologous genes from environmental DNA
obtained by subsequent subcloning experiments
samples. For full length Lip1311 primer sets, no
revealed the presence of one open reading frame
more than 3 specific amplification products of
consisting of 1,311 nucleotides, encoding a
5
the correct size (1311 bp) were generated from the starting gene LipS05 were chosen for use as
reactions for 60 templates. For the other two genetic materials for DNA family shuffling.
overlapping terminal primers, Lip1275 and Sequence analysis of LipS05 and subfamily
Lip1247, none of the products were observed by sequences - A multiple alignment was
gel analysis. Despite repeated attempts to constructed to illustrate conserved and variable
optimize PCR conditions, only nontarget sites within the retrieved lipase subfamily. The
amplification products were obtained for the classical signature of lipases, the catalytic site
above-mentioned primers. Therefore, sets of (GXSXG), was also conserved in the subfamily.
primers with a series of progressively longer The pairwise identities of the protein sequences
deletions were designed to collect more ranged from 64% to 99% (68–99% DNA
homologous genes. For the Lip1095 primer set, sequence identity). This high sequence identity
four specific amplification products with a size showed the close relationship between these
of 1095 bp were found under similar PCR proteins and indicated they can be considered to
conditions. While for Lip923 primer set, 20 belong to the same subfamily. In spite of this
specific amplification products was generated high pairwise identity between sequences, 141
under the same PCR condition. A total of 60 variable sites with different homology levels

inserts were sequenced from those 20 clone were also observed in multiple alignments (Fig.
libraries with inserts from at least 3 randomly 4). Therefore, a combination of 141 variable
selected clones sequenced from each library. sites with a minimum of two amino acids change
Twenty-three novel lipase gene fragments, with in one site would comprise at least 2141, or
conserved pentapeptide GXSXG and between 68 2.8×1042, different sequences. Although the vast
to 97% nucleotide identity to LipS05, were majority of the sequence space that can be
retrieved. Corresponding to soil sample name, explored will be contingent on the recombining
these lipase genes were named LipS01-1, capacity of DNA family shuffling, it can be
LipS01-2, LipS03, LipS05-2, LipS07, LipS08-1, assumed that the DNA family shuffling of those
LipS08-2, LipS09, LipS10, LipS11, LipS12, homologous genes will result in extensive
LipS13, LipS16, LipS17, LipS19, LipS30, genetic diversity.
LipS33, LipS48-1, LipS48-2, LipS49, LipS50-1, The phylogenetic tree constructed by the NJ
LipS50-2 and LipS51, respectively. To confirm method for the lipase family proteins is shown in
the correlation between the number of specific Fig. 5. All of the lipases from environmental
PCR products and the length of the truncated samples cluster closely together on the left,
fragments, the Lip481 primer set, which had while all of the known lipases cluster loosely
further truncation of sequence length, was together on the right. The lipases map together
applied in PCR screening of those environmental on a branch of the tree separate from those
DNA samples, which resulted in 16 specific known lipases and form a clearly distinct group.
amplification products under similar reaction The phylogenetic tree clearly indicates that
conditions. Optimized amplification reaction novel lipases form a well-supported clade (NJ
conditions and an increase in the number of bootstrap, 100%) that is loosely related to the
sequenced clones may result in more novel other known lipases (bootstrap, <50%).
sequences retrieved. As the isolated sequences Analysis of shuffled clones - To explore
were of the appropriate number and length for whether those homologous genes can be used as
following DNA family shuffling, no attempts suitable genetic materials for DNA family
were made to isolate more sequences. shuffling, we created a shuffled library with 24
Twenty-three 923 bp retrieved lipase genes and homologues and characterized a small number of
6
chimeric clones. The shuffled library of substrates, the value of its specific activity (56.2
chimeras was transformed into E. coli and U mg-1) was slightly higher (about 1.3 fold) than
screened for lipolytic activity on a tributyrin agar that of the other two mutants. For the long-chain
plate (Fig. 6A). About 10% of the transformants substrate pNPC12, the mutant C3 showed the
formed clearing halos, indicating expression of highest specific activity with 17.4 U mg-1, which
active enzyme. The size of the resulting clear was more than 2 fold more than that of the other
halos in active clones was uniform, indicating two mutants. To investigate the correlation
that phenotypic diversity was generated from the between structure and observed activity, these
chimeric library. To gain a deep insight into the three clones were sequenced. The sequence of
functional diversity of the chimeric mutants, the C3 is a chimera of LipS33, LipS11, LipS30,
three active clones with the largest halos were LipS11, LipS01-2, LipS08-2 and LipS11. The
overexpressed and purified to apparent E5 gene encodes a chimeric enzyme with
homogeneity. The clones’ specific activity on sequence elements originating from LipS01-2,
pNPC4, pNPC8 and pNPC12 were measured to LipS08-2, LipS30, LipS33, LipS11, LipS30,
demonstrate varied activity on short-, medium-, LipS08-2, LipS33, LipS13, LipS12, LipS13,
or long-chain p-nitrophenyl alkanoate esters LipS30 and LipS16. D5 contained sequence

substrates of the mutant enzymes. The results elements from LipS09, LipS48-2, LipS30,
demonstrated that the mutants exhibited a LipS13 and LipS30 (Fig. 6C). As such strikingly
distinct substrate specificity, with significantly high crossover frequency was observed in the
varied specific activity on different substrates selected clones, another 7 unselected chimeric
(Fig. 6B). The mutants C3 and E5 showed a clones in small library were sequenced to
preference for the short-chain pNPC4, while determine the average crossover frequency in the
mutant D5 had its maximal activity on the library. A per-gene average of 8 crossovers and
medium-chain pNPC8. Although both C3 and 3 spontaneous mutations were observed in
E5 exhibited maximal activity on pNPC4, an sequenced genes.
obvious difference in activity on pNPC4 and
pNPC8 was observed between them: for mutant DISCUSSION
C3, the activity on pNPC8 was 54% relative to
In the present study, a novel metagenomic
that of pNPC4, while mutant E5 demonstrated a
gene-specific strategy was chosen to retrieve
comparable activity on pNPC4 and pNPC8. All
related genes, in contrast to traditional
of the mutants indicated similar substrate
PCR-based metagenomic screening, which uses
specificity for the long-chain substrate pNPC12,
known gene information to design degenerate
with no more than 20% of maximal activity on
primers corresponding to conserved regions of
their preferred substrates. In addition to this
an enzyme family (21). Total DNA extracted
distinct substrate specificity, the mutants also
directly from environmental samples does not
had a definite difference in their specific activity
typically contain an even representation of the
on those substrates. For the short-chain substrate
population's genomes within the sample. (30).
pNPC4, the mutant C3 showed the highest
Therefore, the targeted homologous genes of
specific activity (87.4 U mg-1) among the
known sequences might be overshadowed by
mutants, which was 1.5 and 3.5 fold higher than
sequences from unknown dominant microbial
that of D5 and E5, respectively. For the
populations in environmental DNA, which would
medium-chain substrate pNPC8, although the
severely decrease the efficiency and specificity of
mutant D5 showed the maximal activity relative
gene amplification and affect the product yield
to its activity on the short- and long-chain
(18). In fact, LeCleir G et al. failed to retrieve
7
any chitinase sequences from environmental to shed light on a novel gene family (33-37).
samples using known sequences based PCR The functional diversity of chimeric
approach, despite evidence suggesting high phenotypes generated by DNA-family-shuffling
levels of chitinolytic activity in the sample (31). was evident in experiments assessing both
A previous study retrieved only one lipase activity on indicative plates and substrate
sequence from the environmental DNA of olive specificity of purified mutants. The varied size
oil percolation using known sequences based of the clear halos around the mutant clones
PCR approach, even with extensive analysis of indicated an apparently varied lipolytic activity.
conserved regions and careful primer design (32). In addition, characterization of the specific
In contrast, metagenomic gene-specific PCR activity on substrates with various chain lengths
uses an gene identified by functional screening provides definite evidence for the functional
of soil DNA library, instead of a known gene in diversity of chimeric clones. Such a property of
database as the startpoint for retrieving family diversity is expected and is indeed a hallmark of
genes, which has a inherent property of biasing successful libraries created by DNA family
for genomes of dominant microorganisms of soil shuffling (6,7). The crossover frequency of
DNA. This distinguishing feature make chimeric clones in this study is higher than that

metagenomic gene a good start point for reported for normal enzyme shuffling and lower
retrieving metagenomic homolgous family genes that that of a shuffled library created with
from soil DNA samples by PCR. It is worth methods for generating highly recombined genes,
noting that 20 bands of correct size and without such as RICHT (38). Given that we used the
any non-specific bands was observed in the gel same shuffling procedure as in normal enzyme
analysis of the products amplified from DNA of shuffling reports to create our chimeric library,
60 soil samples with metagenomic gene specific the high crossover frequency can be attributed to
Lip923 primer set (Fig. 3). That was achieved by the diversity of the genetic material we used. In
only one experiment without any attempts at fact, there are usually no more than 4
optimizing reaction conditions. Nevertheless, homologous genes of enzymes from
neither our method nor traditional metagenomic microorganisms used to shuffle in reports using
sequence-based PCR guarantees acquisition of DNA family shuffling (38). Additionally, in the
full-length family genes because identical 13,310 bases sequenced, 3 spontaneous
sequences are conserved in the interior regions mutations were found. This rate of 1 mutation
of subfamily genes rather than at terminal per 4,436 bases is well within the number of
regions (6). spontaneous mutations expected from the rounds
Phylogenetic analysis indicated that the of PCR used to generate the fragment and
metagenomic gene specific amplification template DNA and to amplify the chimeric
strategy offers a chance to access to a novel products. Further, no unexpected insertions,
subfamily that was previously unknown and deletions, or rearrangements were detected by
completely uncharacterized. Traditional DNA sequencing. These data suggest that the
degenerate primer-based PCR for metagenomic DNA shuffling procedure using genetic material
screening is based on the conserved DNA collected by TMGS-PCR can creates a fair
sequences of known family genes. Generally, the qualified chimerical library.
amino acid identities of newly discovered
sequences from a metagenome to those in the CONCLUSION
protein databases have been above 50% and
In conclusion, we have presented a novel
usually around 80–98%, which is rarely possible
TMGS-PCR approach with unique features that
8
distinguish it from the traditional PCR-based metagenomic homologous genes of microbial
metagenomic screening method. Functional enzymes for DNA family shuffling, which is an
analysis of the shuffled library, coupled with the excellent tool for the irrational design of
sequence information from the selected chimeric biocatalysts with tailored properties. Meanwhile,
progeny, indicated that this method can provide one can also use TMSG-PCR to generate
high-quality genetic materials for DNA family breeding stock and then use “synthetic
shuffling. We expect that TMGS-PCR could shuffling” to breed divergent sequences for
provide a new alternative to collecting efficient directed enzyme evolution.
REFERENCES
1. Arnold, F., and Volkov, A. (1999) Current opinion in chemical biology 3, 54-59
2. Tao, H., and Cornish, V. (2002) Current opinion in chemical biology 6, 858-864
3. Cherry, J., and Fidantsef, A. (2003) Current opinion in biotechnology 14, 438-443
4. Turner, N. (2009) Nature Chemical Biology 5, 567-573
5. Cohen, N., Abramov, S., Dror, Y., and Freeman, A. (2001) Trends in Biotechnology

19, 507-510
6. Ness, J. E., Welch, M., Giver, L., Bueno, M., Cherry, J. R., Borchert, T. V., Stemmer,
W. P., and Minshull, J. (1999) Nat Biotechnol 17, 893-896
7. Crameri, A., Raillard, S. A., Bermudez, E., and Stemmer, W. P. (1998) Nature 391,
288-291
8. Chang, C. C., Chen, T. T., Cox, B. W., Dawes, G. N., Stemmer, W. P., Punnonen, J.,
and Patten, P. A. (1999) Nat Biotechnol 17, 793-797
9. Christians, F. C., Scapozza, L., Crameri, A., Folkers, G., and Stemmer, W. P. (1999)
Nat Biotechnol 17, 259-264
10. Bittker, J., Le, B., Liu, J., and Liu, D. (2004) Proceedings of the National Academy of
Sciences 101, 7011
11. Sieber, V., Martinez, C., and Arnold, F. (2001) nature biotechnology 19, 456-460
12. Ostermeier, M., Nixon, A., Shim, J., and Benkovic, S. (1999) Proceedings of the
National Academy of Sciences 96, 3562
13. Lutz, S., Ostermeier, M., Moore, G., Maranas, C., and Benkovic, S. (2001)
Proceedings of the National Academy of Sciences of the United States of America 98,
11248
14. Griswold, K., Kawarasaki, Y., Ghoneim, N., Benkovic, S., Iverson, B., and Georgiou,
G. (2005) Proceedings of the National Academy of Sciences of the United States of
America 102, 10082
15. Castle, L. A., Siehl, D. L., Gorton, R., Patten, P. A., Chen, Y. H., Bertain, S., Cho, H.
J., Duck, N., Wong, J., Liu, D., and Lassner, M. W. (2004) Science 304, 1151-1154
16. Handelsman, J., Rondon, M. R., Brady, S. F., Clardy, J., and Goodman, R. M. (1998)
Chem Biol 5, R245-249
17. Ferrer, M., Martinez-Abarca, F., and Golyshin, P. N. (2005) Curr Opin Biotechnol 16,
588-593
18. Cowan, D., Meyer, Q., Stafford, W., Muyanga, S., Cameron, R., and Wittwer, P.
(2005) Trends Biotechnol 23, 321-329
9
19. Lorenz, P., and Eck, J. (2005) Nat Rev Microbiol 3, 510-516
20. Blow, N. (2008) Nature 453, 687-690
21. Yun, J., and Ryu, S. (2005) Microbial cell factories 4, 8
22. Schmidt, M., and Bornscheuer, U. (2005) Biomolecular engineering 22, 51-56
23. Fischer, M., and Pleiss, J. (2003) Nucleic acids research 31, 319
24. Thompson, J., Higgins, D., and Gibson, T. (1994) Nucleic acids research 22, 4673
25. Tamura, K., Dudley, J., Nei, M., and Kumar, S. (2007) Molecular biology and
evolution 24, 1596
26. Joern, J., Meinhold, P., and Arnold, F. (2002) Journal of molecular biology 316,
643-656
27. Ferrer, M., Plou, F., Nuero, O., Reyes, F., and Ballesteros, A. (2000) Journal of
Chemical Technology & Biotechnology 75, 569-576
28. Wong, H., and Schotz, M. (2002) The Journal of Lipid Research 43, 993
29. Moore, J., and Arnold, F. (1996) nature biotechnology 14, 458
30. Tringe, S., and Rubin, E. (2005) Nature Reviews Genetics 6, 805-814
31. LeCleir, G., Buchan, A., Maurer, J., Moran, M., and Hollibaugh, J. (2006)

Environmental microbiology 9, 197-205
32. Bell, P., Sunna, A., Gibbs, M., Curach, N., Nevalainen, H., and Bergquist, P. (2002)
Microbiology 148, 2283
33. Cottrell, M., Yu, L., and Kirchman, D. (2005) Applied and environmental
microbiology 71, 8506
34. Roh, C., Villatte, F., Kim, B., and Schmid, R. (2007) Letters in applied microbiology
44, 475-480
35. Chen, Z., Liu, Y., Wu, J., She, Q., Jiang, C., and Liu, S. (2007) Applied microbiology
and biotechnology 74, 688-698
36. Jiang, C., and Wu, B. (2007) Biochemical and Biophysical Research Communications
357, 421-426
37. Chae, J., Song, B., and Zylstra, G. (2008) FEMS microbiology letters 281, 203-209
38. Coco, W., Levinson, W., Crist, M., Hektor, H., Darzins, A., Pienkos, P., Squires, C.,
and Monticello, D. (2001) nature biotechnology 19, 354-359
FOOTNOTES
Acknowledgement: The work was supported by National Natural Science Foundation of

China (30900253, 20906016), Natural Science Foundation of Zhejiang Province (Y4080317).
The abbreviations used are: TMGS-PCR, truncated metagenomic gene specific
polymerase chain reaction; p-nitrophenyl butyrate (pNPC4); p-nitrophenyl caprylate (pNPC8);
p-nitrophenyl laurate (pNPC12).
10
FIGURE LEGENDS
Fig. 1. Schematic overview of homologous genes collected by truncated metagenomic

gene specific amplification. Initially, the metagenomic starting gene was isolated from
environmental samples by functional screening. Based on the sequence of this identified gene,
truncated gene specific primers were used to amplify homologous genes from different
environmental samples. The retrieved homologous family genes were subjected to shuffling
to generate the chimeric libraries for isolated clones with desired properties.
Fig. 2. Isolation of starting lipase from metagenomic library. (A) Detection of lipase on
the plates containing tributyrin. (B) Multiple alignment of the partial amino acid sequences
containing the conserved GxSxG motifs of LipS05 and other lipases. GI number and
organisms: gi|111021394|, Rhodococcus sp; gi|169844727|, Coprinopsis cinerea;
gi|182439565|, Streptomyces griseus; gi|134109957|, Cryptococcus neoformans. The putative
catalytic triad residues are shaded and denoted by arrowheads.
Fig. 3. Agarose gel electrophoresis of TMGS-PCR products amplified from DNA of 60
soil samples using Lip923F/R primer set. M indicates the 5.0 kb ladder lane.

Fig. 4. Amino acid sequence alignment of lipases. The alignment quality curve is
displayed below the alignment. Asterisks, colons and dots indicate completely, strongly and
weakly conserved positions, respectively.
Fig. 5. Phylogenetic analysis of lipase sequences (partial sequence, 307 aa). The data are
illustrated as a neighbor-joining tree based on 24 new lipases and 50 previously known
sequences from the Lipase Engineering Database. The name at the nodes is assigned
according to those in the Lipase Engineering Database and is shown for the major clades
only. Each geographic origin is represented by a grey dot.
Figure 6. Characterization of chimerical clones. (A) lipase activity screens. The chimeric
lipase library was plated to rich medium containing 1% tributyrin, grown one day, and
incubated for 5-7 additional days at 4°C. Transformants exhibiting diverse activity produce
clear variably-sized halos. (B) Specific activity on varied substrates for selected chimeric
mutants obtained by family shuffling. Data reported are the mean of at least two independent
experiments. (C) Sequence of selected chimaeric mutants obtained by family shuffling. The
segments derived from different enviromental lipases are shown as white bars and noted as
their name above the bars. Because of sequence identity in crossover, shown as black bars, the
exact derivation of the sequence cannot be determined exactly. The aminoacid point
mutations is indicated by carets (v) above the bars.
11
TABLE LEGENDS
Table 1 Primers used in environmental sample amplification.
12
13
Figure 1
14
Figure 2
15
Figure 3
Figure 4
. ***** ******* : : . ***: . *. **** ** : : * *: : : ** **: **: *: **. *: * ***: *******: *: **: : : * : : : *. *: . **: . *************: **. : * . * ********. ** **** : . . . * : * * **
LipS08-1 P YKE I I QE L R YNL C T S E DQP VP VF P F AYDWR L P L E I I E T QF AE F VE E VI DR T KL MAHYVNAGYVAHP T VNL I GHS MGGL I I T GYL DKKGR S AP VS KVAT L GT P YKGS F E AVI KI AT GT ANL GS DAP NS R E R E AAR L T S S L YHL L P AI QDAL E L DDP S L P T 160
LipS33 P YKE I I QE L R YNL C T S E DQP VP VF P F AYDWR L P L E I I E T QF AE F VE E VI DR T KL MAHYVNAGYVDHP T VNL I GHS MGGL I I T GYL DKKGR S AP VS KVAT L GT P YKGS F E AVI KI AT GT ANL GS DAP NS R E R E AAR L T S S L YHL L P AI QDAL E L DDP S L P T 160
LipS19 P YKE I I QE L R YNL C T S E DQP VP VF P F AYDWR L P L E I I E T QF AE F VE E VI DR T KL MAHYVS AGYVDHP T VNL I GHS MGGL I I T GYL DKKGR S AP VS KVAT L GT P YKGS F E AVI KI AT GT ANL GS DAP NS R E R E AAR L T S S L NHL L P AI QDAL GL DDP S L P T 160
LipS16 P YKE I I QE L R YNL C T S E DQP VP VF P F AYDWR L P L E I I E AQF AE F VE E VI DR T KL MAHYVS AGYVDHP T VNL I GHS MGGL I I T GYL DKKGR S AP VS KVAT L GT P YKGS F E AVI KI AT GT ANL GS DAP NS R E R E AAR L T S S L YHL L P AI QDAL E L DDP S L P T 160
LipS11 P YKE I I QE L R YNL C P S E DQP VP VYP F AYDWR L P L E I I E R QF AE F VE E VI DR T R L MAHYVQAGYI KR P T VNL VGHS MGGL I I T GYL DKKGR S AP VS KVVT L GT P YKGS F E AVI KI AT GT ANL GS DAP NS R E R E AAR L T S S L YHL L P E I QDAL E L DDP S L P N 160
LipS13 P YKE I I QE L R YNL C P S E DQP VP VF P F AYDWR L P L E I I E E QF AE F VDE VI DR T KL MAHYVQAGYT E HP T VNL I GHS MGGL I I T GYL DKKGKS AP VS KVVT MGT P YKGS F E AVI KI AT GT ANL GS DAP NS R E R E AAR L T S S L YHL L P AI QDAL E L DDP S L P T 160
LipS30 P YKE I I QE L R YNL C S S E DQP VP VF P F AYDWR L P L E I I E KQF AE F VDE VI DR T KL MVHYVQAGYAE HP T VNL I GHS MGGL I I T GYL DKKGKS AP VS KVVT MGT P YKGS F E AVI KI AT GT ANL GS DAP NS R E R E AAR L T S S L YHL L P AL E NAL E L DDP S L P T 160
LipS48-1 P YKE I I QE L R YNL C ANE E QP VP VF P F AYDWR L P L AI L E KQL ADF VE E VI AR T QL MS HYVE AGYR E NP R VNL I GHS MGGL L I T GYL DR KGKS AR VAKVVT L GT P YKGS F E AVI KI AT GT ANL GADS P NS R E R E AAR L T S S L YHL L P AI KGAL E L DP P E L P D 160
LipS08-2 P YKE I I QE L R YNL C P S E E QP VP VF AF AYDWR L P L E I I E R QF S DF VE E VI AR T KL I AHYVE T GYVE NP R VNL I GHS MGGL I I AGYL DKKGKS AP VS R VVT L AT P YKGS F E AVI KI AT GT ANL GS DT P NS R E R E AAR L T S S L YHL L P S I E DAL E VDDP AL P T 160
LipS17 P YKE I I QE L R YNL C P S E DQP VP VF P F AYDWR L P L E I I E KQF S DF VDE VI AR T R L VGHYVE S GF L E NP KVNL I GHS MGGL I I T GYL DKKGKT AP VS KVVT L AT P YKGS F E AVI KI AT GT ANL GS DAP NS R E R E AAR L T S S L YHL L P T I S DAL E VDDP E L P T 160
LipS50-1 P YKE I I QE L R YNL C P S NDQP VP VF P F AYDWR L P L E I I E R QF S DF VAE VI DR T KL I NHYVE KGYVE NP KVNL I GHS MGGL I I T GYL DKKGKS AP VS KVVT L AT P F HGS F E AVI KI AT GT ANL GS DP P NS R E R E AAR L T S S L YHL L P AI KNAL E I DDP KL P A 160
LipS09 P YKE I I QE L R YNL C P S E DHL VP VF P F AYDWR L P L GI I E KQF S NF VE E VI DR T KL I GHYVE AGYVE NP T VNL I GHS MGGL I I T GYL DKKGKAAP VS KVVT L AT P YE GS F E AVI KI AT GT ANL GS DQP NS R E R E AAR L T S S L YHL L P AI KDAL E VDDP T L P A 160
LipS05 P YKE I I QE L R YNL C HDE DQQVP VF P F AYDWR L P L E I I E KQF S DF VDE VI DR T KL I R HYVE AGYVNNP KVNL I GHS MGGL I I T GYL DKKGT S AP VS KVVT L AT P YKGS F E AVI KI AT GT S NL GS DT S NS R E R E AAR L T S S L YHL L P AI QDAL E VDDP KL P T 160
LipS05-2 P YKE I I QE L R YNL C HDE DQQVP VF P F AYDWR L P L E I I E KQF S DF VDE VI DR T KL I R HYVE AGYVDNP KVNL I GHS MGGL I I T GYL DKKGT S AP VS KVVT L AT P YKGS F E AVI KI AT GT S NL GS DT S NS R E R E AAR L T S S L YHL L P AI QDAL E VDDP KL P T 160
LipS07 S YKE I I QE L R YNL C HDE DQQVP VF P F AYDWR L P L E I I E R QF S DF VDE VI DR T KL I AHYVE AGYVDHP T VNL I GHS MGGL I I T GYL DKKGT S AP VAKVAT L AT P YKGS F E AVI KI AT GT S NL GS DT S NS R E R E AAR L T S S L YHL L P AI QDAL E VDDP NL P T 160
LipS01-1 P YKE I I QE L R YNL C HDE DQQVP VF P F AYDWR L P L E I I E R QF S DF VDE VI DR T R L I S HYVE AGYVQNP KVNL I GHS MGGL I I AGYL DT KKDS AR VAKVAT L AT P YKGS F E AVI KI AT GT S NL GS DT S NS R E R E AAR L T S S L YHL L P AI QDAL E I DDP NL P T 160
LipS01-2 P YKE I I QE L R YNL C T NE DQQVP VF P F AYDWR L P L DI I E KQF S DF VDE VI DR T KL I S HYVE AGF VQNP KVNL I GHS MGGL I I AGYL DT KKDS AR VAKVAT L AT P YKGS F E AVI KI AT GT S NL GS DT S NS R E R E AAR L T S S L YHL L P AI QDAL E I DDP NL P T 160
LipS10 P YKE I I QE L R YNL C T YE DQQVP VF P F AYDWR L P L DI I E KQF S DF VDE VI DR T KL I AHYVE AGYVE NP KVNL I GHS MGGL I I AGYL DT KKE S AQVAKVAT L AT P YKGS F E AVI KI AT GT S NL S S DT S NS R E R E AAR L T S S L YHL L P AI QDAL E VDDP NL P T 160
LipS51 P YKE I I QE L R YNL C T NE DQL VP VF P F AYDWR L P L E I I E KQF S DF VDE VI DR T KL I T HYVE AGYVE NP KVNL I GHS MGGL I I AGYL DT KKDS AR VAKVAT L AT P YKGS F E AVI KI AT GT S NL GS DT S NS R E R E AAR L T S S L YHL L P AI QDAL E VDDP NL P R 160
LipS49 P YKE I I QE L R YNL C T NE DE QVP VF P F AYDWR L P L DI I E KQF S DF VDE VI NR T R L I S HYVE AGYVE NP KVNL I GHS MGGL I I AGYL DT KKDS AR VAKVAT L AT P YKGS F E AVI KI AT GT S NL GS DT S NS R E R E AAR L T S S L YHL L P AI QDAL E VDDP S L P K 160
LipS03 P YKE I I QE L R YNL C HDE DKQVP VF P F AYDWR L P L E I I E R QF S DF VDE VI DR T KL I S HYVE AGYVE NP KVNL I GHS MGGL I I AGYL DT KKDS AR VAKVAT L AT P YKGS F E AVI KI AT GT S NL GS DT S NS R E R E AAR L T S S L YHL L P AI KGAL E I DDP KL P N 160
LipS12 P YKE I I QE L R YNL C T DE NHQVP VYP F AYDWR L P L DMI E KQF S DF VDE VVE R T KL I S HYVE DGYVDHP T VNL I GHS MGGL I I AGYL DT KGKS AP I S KVAT L AT P YKGS F E AVI KI AT GT S NL GADAS NS S E R E AAR L T S S L YHL L P AI QDAL E VDDP KL P T 160
LipS48-2 P YKE I I QE L R YNL C P S E DE P VP VF P F AYDWR L P L E I I E KQF S DF VE E VI DR T KL I R HYAL AGYAANP KVNL I GHS MGGL I I T GYVDKKGKS AP VS KVAT L AT P YHGS F E AVI KI AT GT ANL GS DT S QS R E R E AAR L T AS L YHL L P DI KNAL E VDDP KL P L 160
LipS50-2 P YKE I I QE L R YNL R P KE DKP VP VF P F GYDWR QP L DL I E AQF ADF VDE VI AR T KL L R HYAE AGYADDP KVNL VGHS MGGL I I AGYL E R I GKS AP VAKVAT L AT P YR GS F E AVI KI AT GT ANL GT DE P T S R E R E AAR L T P S L YHL L P DI E GGL E I DDP KL P K 160
ruler 1. . . . . . . 10. . . . . . . . 20. . . . . . . . 30. . . . . . . . 40. . . . . . . . 50. . . . . . . . 60. . . . . . . . 70. . . . . . . . 80. . . . . . . . 90. . . . . . . 100. . . . . . . 110. . . . . . . 120. . . . . . . 130. . . . . . . 140. . . . . . . 150. . . . . . . 160
. : *. . . *** . : : *: : *: * . : ** ** : *: . . : * *: : * : . : * . ***** **: * ***: : : : *. **: ***. *. : * * *: . . * ******* : : * * * ***: **********

LipS08-1 S F F NP GL WQL S I I AS VL E YVR T QMT F I T E QE - - - QKAQAL F L R F L E AAQAYR T R I DNF R L S R T R L KAADWL C VVGVNS E T R VR MR I T KT E R GP MF DL S S KYR QNL WR KNAE NQAE WR L T GDGT VP F E AAL P NF L AL E NI VC VT P E DYGYW 307
LipS33 S F F NP GL WQL S I I AS VL E YVR T QMT F I T E QE - - - QKAQAL F L R F L E AAQAYR T R I DNF R L S R T R L KAADWL C VVGVNS E T R VR MR I T KT E R GP MF DL S S KYR QNL WR KNAE NQAE WR L T GDGT VP F E AAL P NF L AL E NI VC VT P E DYGYW 307
LipS19 S F F NP GL WQL S I I AS VL E YVR T QMT F I T E QE - - - QKAQAL F L R F L E AAQAYR T R I DNF R F S R T R L KAADWL C VVGVNS E T R VR MR I T KT E R GP MF DL S S KYR QNL WR KNAE NQAE WR L T GDGT VP F E AAL P NF L AL E NI VC VT P E DYGYW 307
LipS16 S F F NP GL WQL S I I AS VL E YVR T QMT F I T E QE - - - QKAQAL F L R F L E AAQAYR T R I DNF R F S R T R L KAADWL C VVGVNS E T R VR MR I T KT E R GP MF DL S S KYR QNL WR KNAE NQAE WR L T GDGT VP F E AAL P NF L AL E NI VC VT P E DYGYW 307
LipS11 S F F NP GL WQL S I VAS VME YVR T QMAF I T E KE - - - QKAQAL F L R F L E AAE AYR S R I DNF R L S T T R L KAADWL C VVGVNS E T R VR MR VT KT E R GP MF DL GS KYR L NL WR KNP E NQAE WR L T GDGT VP F E AAL P NF L AP E NI VC VT P E DYGYW 307
LipS13 S F F NP GL WQL S VVAS VL E YVR T QMT F I I E QE - - - QKAQAL F L R F L E AAQAYR T R I DNF R L S KT KL R AE DWL C VVGVNS E T R VR MR I AKT E R GP L F DL S S KYR L NL WR KNP E HE ME WR L T GDGT VP F E AAL P NF L KL E NI VC VT P E DYGYW 307
LipS30 S F F S P GL WQL S VVAS VL E YVR T QMT F I T E QE - - - QKAQAL F L QF L E AAQVYR T R I DNF R L S KT NL KAGDWL C VVGVNS E T R VR MR I AKT E R GP L F DL S S KYR L NL WR KNP E NE ME WR L T GDGT VP F E AAL P DF L KL E NI VC VT P E DYGYW 307
LipS48-1 NF F DP AL WQL S I VAS VL AYVQKQMS F I S E QD- - - QKAQE L F I R F L QAGE AYR NR I DR F R L T KT NL T AADWL C VVGVDS E T R VR MR I AKT QR GP MF DL S S KYR L NL WR KNL E NP AE WR L T GDGT VP F E AAL P NF L DL E NI L C VT P E DYGYW 307
LipS08-2 S L F DP AL WQL S VVAS VL AYVQR QMAF L T DQN- - - QKAQE L F AR F L E AAR AYR T R I DKF R L T KT NL KP E DWL C VVGVDS E T R VR MR I AMT E R GP MF DL S S KYR L NL WR QDP QNQAE WR WT GDGT VP F E AAL P NF L KL E NI VC VT P E DYGYW 307
LipS17 NL F DP AL WQL S VVAS VL AYVQR QMAF I T NQN- - - QKAQE L F T R F L E AAQAYR NR I DKF R L T R T AL QP E DWL C VAGVNS E T R VR MR I AKT E R GP L F DL S S KYR L NL WR KNP GNP AE WR L T GDGT VP F E AAVP NF L KP E NI VC VT P E DYGYW 307
LipS50-1 NL F DP AL WQL S VL AS VL AYVQAQMAF VS E KE - - - QKAQE L F VR F L E AAQAYR NR I DKF R L T KT GL E P E DWL C VVGVNS E T R VR MR I T S T E R GP MF DL S S KYR L NR WR S DL ANP AE WR L T GDGT VP F E AAL P NS L KL E NI VC VT P E DYGYW 307
LipS09 NL F DP AL WQL S VVAS VL AYVQR QMAF L T DHD- - - QKAQE L F AR F L KAAQAYR NR L DKF R L S KT NL KP E DWL C VVGVNS E T R VR MR VQR T E R GP L F DL S S KYR L NR WKS DL ANP ME WR L T GDGT VP F E AAL P NF L E L E NI I C VT P E DYGYW 307
LipS05 S L F DP AL WQS S VVAS VL AYVQKQMAF I AE QN- - - QKAQAL F AR F L E AAQAYR AR I DQF QL S KT NL KP E DWL C VAGVNS E T R VR MR I T S T E R GP L F DL S S KYR L NL WR KNKQDQS KWGL T GDGT VP F E AAL P NF L KP E NI VC VT P E DYGYW 307
LipS05-2 S L F DP AL WQS S VVAS VL AYVQKQMAF I AE QN- - - QKAQAL F AR F L E AAQAYR AR I DQF QL S KT NL KP E DWL C VAGVNS E T R VR MR I T S T E R GP L F DL S S KYR L NL WR KNKQDQS KWGL T GDGT VP F E AAL P NF L KP E NI VC VT P E DYGYW 307
LipS07 S L F DP AL WQS S VVAS VL AYVQKQMAF I T E QD- - - QKAQE L F AR F L E AAQAYR AR I DQF QL S KT NL KP E DWL C VAGVNS E T R VR MR I T R T E R GP L F DL S S KYR L NL WR KNKKNP T E WQL T GDGT VP F E AAL P NF L KP E NI VC VT P E DYGYW 307
LipS01-1 S L F DP AL WQS S I L AS VL AYVQKQMAF I KE QDE QT QKAQAL F AKF L E AAQAYR AR I DQF QL S T T NL KP E DWL C VAGVNS E T R VR MR I S R T E R GP L F DL S S KYR L NL WR KNKKNP T E WGL T GDGT VP F E AAL P NF L KP E NI VC VT P E DYGYW 310
LipS01-2 S L F DP AL WQS S I L AS VL AYVQKQMAYI KE QDE QT QKAQAL F AKF L E AAQAYR AR I DQF QL S T T NL KP ADWL C VAGVNS E T R VR MR I S R T E R GP L F DL S S KYR L NL WR KNKKNP T AWGL T GDGT VP F E AAL P NF L KP E NI VC VT P E DYGYW 310
LipS10 S L F DP AL WQS S I L AS VL AYVQKQMAS I KE QDE QT QKAQAL F T KF L E AAR AYR AR I DQF QL S P T NL KP E DWL C VAGVNS E T R VR MR I T R T E R GP L F DL S S KYR L NL WR KNKANP S DWGL T GDGT VP F E AAL P NF L KP E NI VC VT P E DYGYW 310
LipS51 S L F DP AL WQS S I VAS VL AYVQKQMAF I KE QDQQT QKAQT L F AQF L E AAQAYR T R I DQF QL T E T NL QP E DWL C VAGVNS E T R VR MR I S R T E R GP L F DL S S KHR L NL WR KNNVNP T E WKL T GDGT VP F E AAQP T F L KP E NI VC VT P E DYGYW 310
LipS49 S L F DP AL WQS S I L AS VL AYVQKQMAF I KE QDQQT QKAQAL F AKF L E AAQAYR T R I DQF QL P E T NL QP KDWL C VAGVNS DT R VR MR I T R T E R GP L F DL S S KYR L NL WR KNKQNP T E WGL T GDGT VP F E AAL P NF L QP E NI VC VT P E DYGYW 310
LipS03 S L F DP GL WQT S VI AS VL AYVQKQMAF I T KQE - - - QKAQE L F T KF L E VAQAYR AR I DKF KL S KT NL NS DDWL C VVGVNS E T R VR MR I T S T DHGP L F DL S S KYR L NL WR KNAGNQT E WGL T GDGT VP F E AAL P NF L KP E NI VC VT P E DYGYW 307
LipS12 S L F NP AL WQS S I I AS VL AYVQQQMAF I KGQDE KT QKAQAL F AQF L E R AQMYR S R I DQF QL S KT NL T ADDWL C VVGVNS QT R VR MR I S S T DS GP L F DL S S KYR L NL WR NKDYS P E QWR WT GDGT VP YE AAL P NF L GP E NI VC VT P E DYGYW 310
LipS48-2 S L F DP AL WQAS VI AS VL AYVQKQMAVI E QE S - - - - KAQAL F AR F L E AAQAYR T R I DNF DL S T T NL QS GDWL C VMGVNS E T R VR MR I T S T DR GP L F DL GS KYR L NL WR KKS E AE ANWR L T GDGT VP F E AAL P NF L GL E NI I C VT P E DYGYW 306
LipS50-2 S L F DAGL WQR GI I DS I T E F VR L QAVS KI DR P - - - QQAQDL F NS L L QDAQAHR KR VE S F R L S KAGL R P E DWL C VAGVDS VT R VKL KI T KT S R GP MF DL GS DYR QNL WR KDP R DQT DWHL T GDGT VP L DS AVP T F L KP E NI VC VT P E DYGYW 307
ruler . . . . . . . 170. . . . . . . 180. . . . . . . 190. . . . . . . 200. . . . . . . 210. . . . . . . 220. . . . . . . 230. . . . . . . 240. . . . . . . 250. . . . . . . 260. . . . . . . 270. . . . . . . 280. . . . . . . 290. . . . . . . 300. . . . . . . 310
16

17
Figure 5
18
Figure 6
Table 1
Location Size of No. of
Tm
Name Sequence(5’→3) within Product amplification
(°C)
gene (bp) products
Lip1311F ATGGAATTTATTCCCCCTGTCATCTTTGTG 62.03 1-30 1311 0
Lip1311R TTATTTGGCACGCAGCGGCTTGAC 63.69 1287-1311 1311 0
Lip1275F TCATCTTTGTGCCGGGCATCACC 63.73 20-42 1275 0
Lip1275R GCTTGACCGCCGGATCCCATTT 63.8 1273-1294 1275 0
Lip1247F GCATCACCGGGACGTATTTGCGA 63.73 35-57 1247 0
Lip1247R ATCCCATTTTTCAGCCGTGACGC 61.96 1259-1280 1247 0
Lip1095F CGTGTGGCGCTTCACCCAGATG 65.66 115-136 1095 4
Lip1095R GAAGTGACGGACAATGAGTCGATGC 63.62 1185-1209 1095 4
Lip923F CCCTACAAGGAAATCATTCAGGAACTGC 63.48 196-223 923 20
Lip923R TCCCAGTATCCATAATCCTCAGGTGTCAC 64.85 1090-1118 923 20
Lip481F CATTCGATGGGCGGATTGATCATCACC 65.04 415-441 481 16
Lip481R TAACGCCTGCTACGCACAGCCAGTCTTCT 67.68 867-896 481 16
19

Genetic Engggggg

Hochgeladen von

Dokumentinformationen

Originalbeschreibung:

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Genetic Engggggg

Hochgeladen von

Copyright:

Verfügbare Formate

JBC Papers in Press. Published on October 20, 2010 as Manuscript M110.

PROSPECTING METAGENOMIC ENZYME SUBFAMILY GENES FOR DNA

DNA family shuffling is a powerful Directed evolution mimics the processes of

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Acknowledgement: The work was supported by National Natural Science Foundation of

Fig. 1. Schematic overview of homologous genes collected by truncated metagenomic

Downloaded from www.jbc.org by guest, on October 25, 2010

Table 1 Primers used in environmental sample amplification.

Downloaded from www.jbc.org by guest, on October 25, 2010

. : *. . . *** . : : : : : * . : : : . . : : : : . : * . *** : * ***: : : : . : . . : * : . . ******* : : * * * *: ********

Downloaded from www.jbc.org by guest, on October 25, 2010

Downloaded from www.jbc.org by guest, on October 25, 2010

Das könnte Ihnen auch gefallen