Beruflich Dokumente
Kultur Dokumente
Graham F. Hatfull1
Molecular Genetics
of Mycobacteriophages
INTRODUCTION
Mycobacteriophages are viruses that infect mycobacterial hosts including Mycobacterium smegmatis and
Mycobacterium tuberculosis. The first mycobacteriophages were isolated in the late 1940s using M. smegmatis as a host (1, 2), followed by isolation of phages
that infect M. tuberculosis (3). The application of
phages with distinct host preferences to typing clinical
mycobacterial isolates was recognized, and numerous
studies on mycobacteriophage typing were published
over the subsequent 30 years (412). In the 1950s a variety of further investigations focusing on the biology
of these phages and their potential applications were
initiated including studies on generalized transduction
(13), viral morphology (14, 15), lysogeny (1620),
transfection of phage DNA (21, 22), and other biochemical features (2340). These early contributions
provided a critical foundation for the further characterization and application of mycobacteriophages to tuberculosis research that emerged from them.
Prior to the mid-1990s, the lack of methods for
efficient and reproducible introduction of DNA into
mycobacteriacoupled with the lack of simple plasmid vectorsrepresented substantial impediments to
the development of facile systems for genetic manipulation of M. tuberculosis and other mycobacteria (41).
81
82
and applications for tuberculosis genetics. The focus
will primarily be on recent developments, and there are
a number of other reviews that the reader may find useful (18, 59, 60, 6278).
GENERAL ASPECTS
OF MYCOBACTERIOPHAGES
Mycobacteriophage Isolation
Bacteriophages can typically be isolated from any environment in which their bacterial hostor close
relatives of their hostis present. There is no obvious
reservoir of M. tuberculosis outside of its human host,
but mycobacteriophages have been isolated from various patient samples including stool (7981). However,
because there are numerous saprophytic mycobacterial
relatives, mycobacteriophages can be readily isolated
from environmental samples such as soil or compost,
using M. smegmatis or other nonpathogenic mycobacteria as hosts; a subset of these phages also infect
M. tuberculosis (discussed further below). Phages can
also be isolated by release from lysogenic host bacteria
(82, 83), and phage genomic sequences can be identified in sequenced mycobacterial genomes (8388).
M. smegmatis has proven to be a useful surrogate
for phage isolation, using either direct plating from environmental samples or by enrichment, in which the
sample is first incubated with M. smegmatis to promote
amplification of phages for that host. Typically, direct
plating yields only a small number of plaques from
<10% of the samples that are tested, whereas enrichment generates plaques from a higher proportion of
samples, and the phage titers can be greater than 109
plaque forming units (pfu)/ml. Although enrichment
might be anticipated to reduce the diversity of the
phages isolated due to the potential growth advantages
that a subset of phages might enjoy under these conditions, there is little evidence to support this with the
mycobacteriophages. However, it does appear to alter
the prevalence with which different phage types are recovered. For example, phages with one particular genome type (Cluster A; see below for details) represent
over 40% of the phages recovered after enrichment,
whereas they are only 25% isolated by direct plating.
However, phages of a second genome type (Cluster B)
are more prevalently recovered by direct plating (27%)
than by enrichment (13%). But both of these general
phage types themselves encompass considerable diversity and can be divided into several subtypes (i.e.,
subclusters), and enrichment also influences the abundance of particular subclusters. For example, phages of
83
84
impact on science education is indicated by numerous
parameters including student surveys such as the Classroom Undergraduate Research Experience (CURE) instrument (91).
Successful implementation of the PHIRE and SEAPHAGES programs is predicated on a suite of features
that characterize phage discovery and genomics. The
enormous diversity of the phage population stacks the
deck in favor of individual phage hunters isolating a
phage with novel features, in some instances being
completely unrelated to previously sequenced phages.
This, however, is only apparent from the genomic characterization, because little or nothing can be learned
about phage diversity and evolution without the genomic sequence. Students name their phages following
a nonsystematic nomenclature that reflects the individualistic aspects of the phage genomes (see below). With
over 500 completely sequenced genomes of phages isolated on M. smegmatis mc2155, the prospects of isolating a phage with a completely novel DNA sequence is
greatly diminished, although it is still extremely rare to
find two phages with identical or near-identical genomes. However, the diversity of phages isolated using
otherbut closely relatedhosts is likely to be just as
great, and it is easy to envisage that large collections of
sequenced phage genomes using numerous hosts within
the Actinomycetales are likely to substantially advance
our understanding of viral evolution.
Seven additional attributes of the PHIRE program
have been described that contribute to its productivity
(55, 56). These are (i) technical simplicity in beginning
the project, (ii) the lack of requirement for prior advanced conceptual understanding, (iii) flexibility in timing and implementation, (iv) multiple milestones for
success, (v) a parallel project design with strong peermentoring opportunities and the potential to engage
large numbers of students, (vi) authentic research leading to peer-reviewed publications, and (vii) project ownership that powerfully engages students in their own
individual contributions. These attributes should be
transferable to other research and education platforms,
and there are several excellent examples (9295).
MYCOBACTERIOPHAGE GENOMICS
At the time of writing, there are a total of 285 complete
mycobacteriophage genomes available in GenBank,
and we will focus on these in this chapter (Table 1).
The total number of sequenced mycobacteriophages is
largercurrently 531 (www.phagesdb.org)and the
outstanding genomes will be available in GenBank
pending completion and review of the annotations. Of
85
Phage
Aeneas
Bethlehem
BillKnuckles
BPBiebs31
Bruns
Bxb1
DD5
Doom
Dreamboat
Euphoria
Jasper
JC27
KBG
KSSJEB
Kugel
Lesedi
Lockley
Marcell
MrGordo
Museum
PattyP
Perseus
RidgeCB
SarFire
SkiPole
Solon
Switzer
Trouble
U2
Violet
Che12
D29
L5
Odin
Pukovnik
RedRock
Trixie
Turbido
Bxz2
HelDan
JHC117
Jobu08
Methuselah
Microwolf
Rockstar
Vix
Arturo
Backyardigan
Dhanush
Eagle
Flux
ICleared
LHTSCC
Cluster
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A1
A2
A2
A2
A2
A2
A2
A2
A2
A3
A3
A3
A3
A3
A3
A3
A3
A4
A4
A4
A4
A4
A4
A4
Accession #
JQ809703
AY500153
JN699000
JF957057
JN698998
AF271693
EU744252
JN153085
JN660814
JN153086
EU744251
JF937099
EU744248
JF937110
JN699016
JF937100
EU744249
JX307705
JN020140
JF937103
KC661273
JN572689
JN398369
KF024726
GU247132
EU826470
JF937108
KF024724
AY500152
JN687951
DQ398043
AF022214
Z18946
KF017927
EU744250
GU339467
JN408461
JN408460
AY129332
JF957058
JF704098
KC661281
KC661272
JF704101
JF704111
JF704114
JX307702
JF704093
KC661271
HM152766.1
JQ809701
JQ896627
JN699015
Length (bp)
GC %
53684
52250
51821
53171
53003
50550
51621
51421
51083
53597
50968
52169
53572
51381
52379
50486
51478
49186
50988
51426
52057
53142
50844
53701
53137
49487
52298
52102
51277
52481
52047
49136
52297
52807
52892
53332
53526
53169
50913
50364
50877
50679
50891
50864
47780
50963
51500
51308
51373
51436
51370
51440
51813
63.6
63.2
63.4
63.4
63.6
63.6
63.4
63.8
63.9
63.7
63.7
63.6
63.6
63.6
63.8
63.8
63.4
64.0
63.8
63.6
63.6
63.7
64.0
63.8
63.6
63.8
63.8
63.6
63.7
63.8
62.9
63.5
62.3
62.3
63.3
64.5
64.5
63.3
64.2
64.0
64.0
64.0
64.2
64.0
64.3
64.0
64.1
63.7
63.9
63.9
63.9
63.9
63.9
86
Table 1
Phage
Medusa
MeeZee
Peaches
Sabertooth
Shaka
TiroTheta9
Wile
Airmid
Benedict
Cuco
ElTiger69
George
LittleCherry
Tiger
Blue7
DaVinci
EricB
Gladiator
Hammer
Jeffabunny
HINdeR
Timshel
Astro
Saintus
Alma
PackMan
Goose
Rebeuca
Severus
Twister
ABU
Chah
Colbert
Fang
Harvey
Hertubise
IsaacEli
JacAttac
Kikipoo
KLucky39
Morgushi
Murdoc
Newman
Oline
Oosterbaan
Orion
OSmaximus
PG1
Phipps
Puhltonio
Scoot17C
SDcharge11
Serendipity
Cluster
A4
A4
A4
A4
A4
A4
A4
A5
A5
A5
A5
A5
A5
A5
A6
A6
A6
A6
A6
A6
A7
A7
A8
A8
A9
A9
A10
A10
A10
A10
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
B1
Accession #
KF024733
JN243856
GQ303263.1
JX307703
JF792674
JN561150
JN243857
JN083853
JN083852
JN408459
JX042578
JF704107
KF017001
JQ684677
JN698999
JF937092
JN049605
JF704097
JF937094
JN699019
KC661275
JF957060
JX015524.1
JN831654
JN699005
JF704110
JX307704
JX411619
KC661279
JQ512844
JF704091
FJ174694
GQ303259.1
GU247133
JF937095
JF937097
JN698990
JN698989
JN699017
JF704099
JN638753
JN638752
KC691258
JN192463
JF704109
DQ398046
JN006064
AF547430
JF704102
GQ303264.1
GU247134
KC661274
JN006063
Length (bp)
GC %
51384
51368
51376
51377
51369
51367
51308
51241
51083
50965
51505
51578
50690
50332
52288
51547
51702
52213
51889
48963
52617
53278
52494
49228
53177
51339
50645
51235
49894
51094
68850
68450
67774
68569
68193
68675
68839
68311
68839
68138
68307
68600
68598
68720
68735
68427
69118
68999
68293
68323
68432
67702
68804
63.9
63.9
63.9
63.9
63.9
63.9
63.7
60.0
59.8
60.9
59.8
61.0
60.9
60.7
61.4
61.5
61.5
61.4
61.3
61.6
62.8
63.1
61.4
61.2
62.5
62.6
65.1
65.1
64.4
65.0
66.5
66.5
66.5
66.5
66.5
66.4
66.5
66.5
66.5
66.5
66.4
66.4
66.5
66.4
66.5
66.5
66.3
66.5
66.5
66.4
66.5
66.5
66.5
87
(Continued)
Phage
ShiVal
TallGrassMM
Thora
ThreeOh3D2
UncleHowie
Vista
Vortex
Yoshand
Arbiter
Ares
Hedgerow
Qyrzula
Rosebush
Akoma
Athena
Daisy
Gadjet
Kamiyu
Phaedrus
Phlyer
Pipefish
ChrisnMich
Cooper
KayaCho
Nigel
Stinger
Zemanar
Acadian
Reprobate
Alice
ArcherS7
Astraea
Ava3
Breeniome
Bxz1
Cali
Catera
Dandelion
Drazdys
ET08
Ghost
Gizmo
LinStu
LRRHood
MoMoMixon
Nappy
Pio
Pleione
Rizal
ScottMcG
Sebata
Shrimp
Spud
Cluster
B1
B1
B1
B1
B1
B1
B1
B1
B2
B2
B2
B2
B2
B3
B3
B3
B3
B3
B3
B3
B3
B4
B4
B4
B4
B4
B4
B5
B5
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
C1
Accession #
KC576784
JN699010
JF957056
JN699009
GQ303266.1
JN699008
JF704103
JF937109
JN618996
JN699004
JN698991
DQ398048
AY129334
JN699006
JN699003
JF704095
JN698992
JN699018
EU816589
FJ641182.1
DQ398049
JF704094
DQ398044
KF024729
EU770221
JN699011
JF704104
JN699007
KF024727
JF704092
KC748970
KC691257
JQ911768
KF006817
AY129337
EU826471
DQ398053
JN412588
JF704116
GQ303260.1
JF704096
KC748968
JN412592
GQ303262.1
JN699626
JN699627
JN699013
JN624850
EU826467
EU826469
JN204348
KF024734
EU826468
Length (bp)
GC %
68355
68133
68839
68992
68016
68494
68346
68719
67169
67436
67451
67188
67480
68711
69409
68245
67949
68633
68090
69378
69059
70428
70654
70838
69904
69641
71092
69864
70120
153401
156558
154872
154466
154434
156102
155372
153766
157568
156281
155445
155167
157482
153882
154349
154573
156646
156758
155586
153894
154017
155286
155714
154906
66.5
66.5
66.5
66.5
66.5
66.5
66.5
66.5
68.9
69.0
69.0
68.9
68.9
67.5
67.5
67.6
67.5
67.5
67.6
67.5
67.3
69.1
69.1
70.0
68.3
68.6
68.9
68.4
68.3
64.7
64.7
64.7
64.8
64.8
64.8
64.7
64.7
64.7
64.7
64.6
64.6
64.6
64.8
64.7
64.8
64.7
64.8
64.7
64.7
64.8
64.8
64.7
64.8
88
Table 1
Phage
Wally
Myrna
Adjutor
Butterscotch
Gumball
Nova
PBI1
PLot
SirHarley
Troll4
244
ABCat
Bask21
Cjw1
Contagion
Dumbo
Elph10
Eureka
Henry
Kostya
Lilac
Murphy
Phaux
Phrux
Porky
Pumpkin
Rakim
SirDuracell
Toto
Ardmore
Bobi
Boomer
Che8
Daenerys
DeadP
DLane
Dorothy
DotProduct
Drago
Fruitloop
GUmbie
Hamulus
Ibhubesi
Job42
Llij
Mozy
Mutaforma13
Pacc40
PMC
Ramsey
RockyHorror
SG4
Shauna1
Cluster
C1
C2
D
D
D
D
D
D
D
D
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
E
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
F1
Accession #
JN699625
EU826466
EU676000
FJ168660
FJ168661
JN699014
DQ398047
DQ398051
JF937107
FJ168662
DQ398041
KF188414
JF937091
AY129331
KF024732
KC691255
JN391441
JN412590
JF937096
EU816591
JN382248
KC748971
KC748969
KC661277
EU816588
GQ303265.1
JN006062
JF937106
JN006061
GU060500
KF114874
EU816590
AY129330
KF017005
JN698996
JF937093
JX411620
JN859129
JN542517
FJ174690
JN398368
KF024723
JF937098
KC661280
DQ398045
JF937102
JN020142
FJ174692
DQ398050
FJ174693
JF704117
JN699012
JN020141
Length (bp)
GC %
155299
164602
64511
64562
64807
65108
64494
64787
64791
64618
74483
76131
74997
75931
74533
75736
74675
76174
76049
75811
76260
76179
76479
74711
76312
74491
75706
75793
75933
52141
59179
58037
59471
58043
56461
58899
58866
55363
54411
58471
57387
57155
55600
59626
56852
57278
57701
58554
56692
58578
56719
59016
59315
64.7
65.4
59.9
59.7
59.6
59.7
59.8
59.8
59.6
59.6
63.4
63.0
62.9
63.7
63.1
63.0
63.0
62.9
63.0
63.5
63.0
62.9
62.9
63.1
63.5
63.0
62.9
62.9
63.0
61.5
61.7
61.1
61.3
61.6
61.6
61.9
61.4
61.8
61.2
61.8
61.4
61.8
61.2
61.2
61.5
61.1
61.3
61.3
61.4
61.2
61.1
61.9
61.7
89
(Continued)
Phage
ShiLan
SiSi
Spartacus
Taj
Tweety
Velveteen
Wee
Avani
Che9d
Jabbawokkie
Yoshi
Angel
Avrafan
BPs
Halo
Hope
Liefie
Konstantine
Predator
Barnyard
Babsiella
Brujita
Island3
Che9c
BAKA
Courthouse
LittleE
Omega
Optimus
Redno2
Thibault
Wanda
Adephagia
Anaya
Angelica
BarrelRoll
CrimD
JAWS
TM4
MacnCheese
Pixie
Fionnbharth
Larva
JoeDirt
LeBron
UPIE
Breezona
Crossroads
Faith1
Rumpelstiltskin
Winky
Whirlwind
Bongo
Cluster
F1
F1
F1
F1
F1
F1
F1
F2
F2
F2
F2
G
G
G
G
G
G
H1
H1
H2
I1
I1
I1
I2
J
J
J
J
J
J
J
J
K1
K1
K1
K1
K1
K1
K2
K3
K3
K4
K5
L1
L1
L1
L2
L2
L2
L2
L2
L3
M
Accession #
JN020143
KC661278
JQ300538
JX121091
EF536069
KF017004
HQ728524
JQ809702
AY129336
KF017003
JF704115
FJ973624
JN699002
EU568876
DQ398042
GQ303261.1
JN412593
FJ174691
EU770222
AY129339
JN699001
FJ168659
HM152765
AY129333
JF937090
JN698997
JF937101
AY129338
JF957059
KF114875
JN201525
KF006818
JF704105
JF704106
HM152764
JN643714
HM152767
JN185608
AF068845
JX042579
JF937104
JN831653
JN243855
JF704108
HM152763
JF704113
KC691254
KF024731
JF744988
JN680858
KC661276
KF024725
JN699628
Length (bp)
GC %
59794
56279
61164
58550
58692
54314
59230
54470
56276
55213
58714
41441
41901
41901
42289
41901
41650
68952
70110
70797
48420
47057
47287
57050
111688
110569
109086
110865
109270
108297
106327
109960
59646
60835
59598
59672
59798
59749
52797
61567
61147
58076
62991
74914
73453
73784
76652
76129
75960
69279
76653
76050
80228
61.4
61.5
61.7
61.9
61.7
61.5
61.8
61.0
60.9
61.1
61.0
66.7
66.6
66.6
66.7
66.6
66.8
57.4
56.4
57.5
67.1
66.8
66.8
65.4
60.7
60.9
61.3
61.4
60.8
60.9
60.8
60.8
66.6
66.4
66.4
66.6
66.9
66.6
68.1
67.3
67.3
68.0
65.3
58.8
58.8
58.8
58.9
58.9
58.9
58.9
58.9
59.3
61.6
90
Table 1
Phage
PegLeg
Rey
Butters
Charlie
Redi
Catdawg
Corndog
Dylan
Firecracker
BigNuz
Fishburne
Jebeks
Giles
Send513
Marvin
Dori
DS6A
Muddy
Patience
Wildcat
Cluster
Accession #
Length (bp)
GC %
M
M
N
N
N
O
O
O
O
P
P
P
Q
R
S
Single
Single
Single
Single
Single
KC900379
JF937105
KC576783
JN256079
JN624851
KF017002
AY129335
KF024730
JN698993
JN412591
KC691256
JN572061
EU203571
JF704112
JF704100
JN698995
JN698994
KF024728
JN412589
DQ398052
80955
83724
41491
43036
42594
72108
69777
69815
71341
48984
47109
45580
53746
71547
65100
64613
60588
48228
70506
78296
61.5
60.9
65.8
66.3
66.1
65.4
65.4
65.4
65.5
66.7
67.3
67.3
67.3
56.0
63.4
66.0
68.4
58.8
50.3
57.2
91
92
93
typically represented, following the Lambda precedent,
with these genes at the left end of the genome and transcribed rightward (see Fig. 3, 4, and 5). Sometimes this
structural operon is situated such that the terminase
genes at its left end are close to the physical ends of the
genome, but there are many examples (e.g., Cluster A
phages) where other genes are situated in this interval.
Large subunit terminase genes can be readily identified
in most of these phages, and manybut not allhave
an identifiable small terminase subunit. The other virion structural genes typically follow in the order: terminase, portal, capsid maturation protease, scaffolding,
capsid subunit, 4 to 6 head-tail connector proteins, major tail subunit, tail assembly chaperones, Tapemeasure
protein, and 5 to 10 minor tail proteins.
Although the gene order is very well conserved,
there is substantial sequence diversity, to the extent that
genes conferring some of the specific functions cannot
be predicted from their sequence alone, although their
position is also informative. However, because it is relatively simple to characterize the virions themselves,
by SDS-PAGE, N-terminal sequencing, or mass spectrometry, correlations between the structural genes
and proteins can be readily determined (50, 53, 90,
102, 103). In general, these studies show that the major
capsid and tail subunits are the most abundant and
that in some phages (e.g., L5) the capsid subunit is extensively covalently cross-linked, similar to the wellstudied phage HK97 (50, 104). A scaffold protein gene
involved in head assembly is absent from some mycobacteriophages, although its function may be provided
by a domain of the capsid subunit.
The Tapemeasure protein is simple to identify because it is typically encoded by the longest gene in the
genome, and the two reading frames immediately upstream are expressed via a programmed translational
frameshift (105). Although the sequences of the
Tapemeasure proteins are very diverse, they often contain small conserved domains corresponding to peptidoglycan hydrolysis motifs (54). The precise roles for
these is not clear, but in TM4 it has been shown that
94
Figure 4 Functional genomics of mycobacteriophage Giles. A map of the mycobacteriophage Giles
was generated using Phamerator and annotated as described for Fig. 3. Boxes below the genome indicate whether the gene is nonessential for lytic growth (yellow), likely essential (blue), or essential
(green). Arrows indicate genes expressed in lysogeny (red) or early (green) or late (purple) lytic
growth, with line thickness reflecting transcription strength. Reproduced with permission from
Dedrick et al. (107). doi:10.1128/microbiolspec.MGM2-0032-2013.f4
Nonstructural Genes
The right parts of the genomes encoding nonstructural
functions are characterized by an abundance of small
open reading frames of unknown functions. They usually include a subset of genes whose functions can be
bioinformatically predicted, and these are often associated with nucleotide metabolism or DNA replication.
For example, some phages encode a DNA polymerase
similar to E. coli DNA Polymerase I, whereas others
encode alpha subunits of bacterial DNA Polymerase III.
While it is simple to reason that these are involved in
phage DNA replication, this has not been shown directly, and there are many mycobacteriophages that
do not encode their own DNA polymerase at all. So
why it would be needed in some phages but not in
others is not clear. It is also common to find helicase
and primase-like proteins, and sometimes recombination functions, including both recA and recET-like
genes. In phage Giles, proteins of previously unknown
95
function are implicated in DNA replication from mutagenesis studies (see Fig. 4) (107); D29 gp65predicted
to be part of the RecA/DnaB helicase superfamilyhas
been demonstrated to be an exonuclease (108). But
overall, very little is known about the mechanics of regulation of DNA replication in mycobacteriophages or
many of the possible genes that are involved.
Some of the nonstructural genes are cytotoxic and
kill the host when expressed or overexpressed. This has
been extensively characterized in staphylococcal phages
(109), where it has been developed as part of a drug
development pipeline, but several mycobacteriophageencoded cytotoxic proteins have also been identified.
Initially, segments of the phage L5 genome were shown
to be not tolerated on plasmid vectors and could not be
transformed into M. smegmatis (110). Further dissection showed that L5 genes 77, 78, and 79 all have cytotoxic properties, with 77 being the most potent (111).
Interestingly, expression of L5 gp79 seems to specifically inhibit cell division of M. smegmatis and promote
filamentation (111). L5 gp77 appears to act by interacting directly with Msmeg_3532, a pyridoxal- 5phosphate-dependent
L-serine
dehydratase
that
converts L-serine to pyruvate (112), although it is unclear what the consequences of the interaction are. The
role of such an interaction in the growth of L5or the
many other Cluster A phages encoding homologues of
L5 gp77is unclear.
96
97
readily interpreted. The 439-bp MPME1 elements contain 11-bp imperfect inverted repeats (IRs) near their
ends and a small (125-codon) open reading frame
encoding a putative transposase. The MPME1 elements
in phages BPs and Hope are 100% identical to each
other (from IR-L to IR-R), but the insertions differ in a
6-bp segment between IR-L and the preintegration site.
The origin of this 6-bp sequence is unclear but does not
correspond to a target duplication (117).
MPME1 elements are found in a variety of other
mycobacteriophage genomes including those in
Clusters F and I and are either identical or have no
more than a single base pair difference; there are also
truncated versions of the element in the Cluster O
phages (117). MPME2 elements are 1 base pair longer
and share 79% nucleotide sequence identity to
MPME1. They are present in the Cluster G phage Halo
and phages within Clusters F, I, and N, being either
identical to each other or having no more than a single
base pair difference. As yet there is no direct evidence
for the mobility of the MPME elements, the frequency
of movement, or the mechanism involved, although the
comparative genomics suggests that these are active and
moving at a respectable rate. No MPME elements have
been identified outside of the mycobacteriophages.
Comparative genomics also reveals an IS110-like element in phage Omega, which is absent from phage
LittleE, and the genomes are sufficiently similar in
these regions to indicate the preintegration site and a 5bp target duplication (90). There are no closely related
copies in other mycobacteriophages, but there are some
more distantly related segments in some Subcluster A1
phages (90). It is unclear if this is a remnant or an active transposon.
There are few examples of self-splicing introns in the
mycobacteriophages, but two have been recently described, both in Cluster J phages (90) and both within
virion structural protein genes. In phage BAKA the intron is small (265 bp) and is within a putative tail protein gene; in phage LittleE the intron is within the
capsid subunit gene and is larger (819 bp) due to inclusion of a small open reading frame on the opposite
Figure 5 Genome map of mycobacteriophage Alma. The genome map of mycobacteriophage Alma was generated using Phamerator and is illustrated as described for Fig. 3. Alma
is a Subcluster A6 phage and shares the features of other Cluster A phages in having multiple
binding sites for its repressor protein (gp75). These stoperator sites are indicated by vertical
arrows, and the orientation of the asymmetric sites relative to genome orientation are shown
as (-) or (+). Stoperators were identified as sequences corresponding to the consensus sequence 5-GATGAGTGTCAAG with no more than a single mismatch. Note that the
stoperator consensus sequences can differ for different Subcluster A phages (98).
doi:10.1128/microbiolspec.MGM2-0032-2013.f5
98
Figure 6 A model for mycobacteriophage diversity. The large number of different types of
mycobacteriophages isolated on M. smegmatis mc2155 can be explained by a model in
which phages can readily infect new bacterial hostseither by a switch or an expansion of
host rangeusing a highly diverse bacterial population that includes many closely related
strains. As such, phages with distinctly different genome sequences and GC% contents
infecting distantly related bacterial hosts, such as those to the left (red) or right (blue)
extremes of a spectrum of hosts, can migrate across a microbial landscape through multiple
steps. Each host switch occurs at a relatively high frequency (<1 in 105 particles, or an average of about one every 103 bursts of lytic growth) and much faster than either amelioration
of phage GC% to its new host or genetic recombination. Two phages (such as those shown
in red and blue) can thus arrive at a common host (M. smegmatis mc2155) but be of distinctly different types (clusters, subclusters, and singletons). Reproduced with permission
from Jacobs-Sera et al. (99). doi:10.1128/microbiolspec.MGM2-0032-2013.f6
strand with similarities to homing endonucleases. Because both these proteins are required for virion assembly, and the capsid must be expressed at very high
levels, the splicing events are expected to occur extremely efficiently. They are probably variants of group
I introns, but little is known about their mechanisms
of splicing (90). Other introns may be present in other
genomes but have escaped detection because of insertion in genes of unknown function.
The rarity of self-splicing RNA introns contrasts
with the relative abundance of inteins that splice out
at the protein level. These are typically identifiable
through conserved domains associated with intein splicing and are often apparent by comparative genomics.
MYCOBACTERIOPHAGEHOST INTERACTIONS
99
Cluster K phagesregardless of subclusterefficiently
infect M. tuberculosis; among the Cluster A phages,
only phages within specific subclusters can infect M. tuberculosis, including Subclusters A2 and A3. Interestingly, the Cluster G phages (e.g., BPs) do not efficiently
infect M. tuberculosis, although mutants that do can be
isolated at a frequency of <105, and the mutants
maintain the ability to infect M. smegmatis. Several of
these mutants have been mapped, and all have single
amino acid substitutions in gene 22, encoding a putative tail fiber protein (107, 117). A simple interpretation is that the mutants overcome the need for a
specific interaction with receptors on the M. tuberculosis cell surface. However, adsorption assays suggest
there is a more complex explanation, because the mutants do not appear to adsorb to M. tuberculosis more
efficiently than the wild-type parent phage, and surprisingly, they adsorb substantially better to M. smegmatis,
in some cases, dramatically so (99). This is not easy to
explain, because the mutants appear to infect M. smegmatis quite normally and were selected for infection of
M. tuberculosis.
The same set of phages was also tested for their ability to infect two other strains of M. smegmatis (99).
Many, but not all, of the phages efficiently infect these
strains, and although there is a correlation between
infection and cluster/subcluster type, it is statically
weaker than with M. tuberculosis infection. But similarly, there are instances where phages are able to overcome the host barrier at moderate frequencies (105),
and examination of phage Rosebush mutants capable
of infecting M. smegmatis Jucho (99) reveals amino
acid substitutions in putative tail fibers, reminiscent of
the Cluster G mutants that infect M. tuberculosis.
100
mycobacteriophage receptors and how phage tail structures specifically recognize them.
However, there are many other determinants of host
range beyond the availability of cell wall receptors, including restriction/modification systems, lysogenic immunity, CRISPR elements (125), and various abortive
infection systems including those mediated by toxinantitoxin systems (126). M. smegmatis mc2155 has no
known prophages, restriction modification systems, or
CRISPR arrays, which may contribute to its suitability
as a permissive host for phage isolation. However,
M. tuberculosis H37Rv contains a type III-A family
CRISPR array (127) and more than 80 putative toxin/
antitoxin systems (128), perhaps suggesting development within a phage-rich environment in its recent
evolutionary history. None of the spacer elements within the M. tuberculosis CRISPR array (which are used
for spoligotyping M. tuberculosis isolates) closely
match known phage sequences, and it is unclear whether the CRISPR arrays are currently active or contribute
in any way to phage resistance (129). M. tuberculosis
H37Rv has two small (<10 kbp) prophage-like
elements in its genome (Rv1 and Rv2), but it is
unlikely that these confer immunity to other phages
(96). However, a likely intact 55-kbp prophage has
been identified in a Mycobacterium canettii strain (88),
and other prophages are present in the genomes of
M. avium 104 (130), Mycobacterium marinum (87),
M. ulcerans (85), and Mycobacterium abscessus (83,
84). Resistance of M. smegmatis to D29 infection can
result from overexpression of the host multicopy phage
resistance (mpr) gene, perhaps by alteration of the cell
surface, although the specific mechanism is not known
(131, 132).
Because phages can easily replicate from a single
particle to vast numbers (there are 106 to 108 particles
in a typical plaque), and mutants arise at moderate frequencies, host range mutants can readily arise
within environmental populations. As such, phages
are expected to move from one host to another at
frequencies that are vastly greater than the time required to ameliorate their genomic features, such as
coding biases and GC%, to a specific host. This leads
to a model (Fig. 6) in which two key parameters contribute to the diversity of phages: the ability of phages
to rapidly switch hosts, and the availability of a broad
spectrum of closely related hosts in the environment
from which the phages are isolated (99). This model
can therefore account for the nature of mycobacteriophage genomes and predicts that similar phage diversity will be seen using other hosts and similar
sampling (99).
Lysis Systems
Host cell lysis is a critical step in phage lytic growth, and
in the prototype lambda system is both efficient and precisely timed (133, 134). Understanding the lysis systems
of mycobacteriophages is of particular interest in that
these can illuminate features of mycobacterial cell walls.
Mycobacteriophage lysis systems are described in detail
in reference 243 and will be discussed only briefly here.
Most mycobacteriophages encode at least three proteins required for lysis: a peptidoglycan-cleaving endolysin (also called Lysin A), Lysin B, and a holin; a few
lack Lysin B, some either lack a holin or it is difficult to
identify bioinformatically, but all encode an endolysin
(135138). Phage Ms6 and its relatives encode an additional protein that acts as a chaperone for delivery of
the endolysin to its peptidoglycan target (139). The
endolysins are diverse and modular in nature, reflecting
an intragenic mosaicism that is a microcosm of the generally mosaic nature of the phage genomes (138). Many
are composed of three segments: an N-terminal domain
with predicted peptidase activity, a central domain that
cleaves the peptidoglycan sugar backbone, and a Cterminal cell wall binding domain; there are, however,
numerous departures from this general organization
(138). In total, there are at least 25 different organizations (Org-A to Org-Y) with unique combinations of
the constituent domains. Interestingly, the Ms6 lysA
(Org-J) gene encodes a second lysis gene that is wholly
embedded within lysA and is expressed by translation
initiation from an internal start codon; phage mutants
that express only the longer (Lysin384) or the shorter
(Lysin241) endolysin are viable (140). Lysin B encodes
an esterase that cleaves the linkage of the mycolylarabinogalactan to the peptidoglycan (136, 137) and,
unlike lysin A, is dispensable for lytic growth (137).
However, it is required for optimal lysis and efficient
phage reproduction (137) and likely plays an analogous
role to the spanins that facilitate fusion of the inner and
outer membranes of Gram-negative bacteria during
phage lysis (134, 141). Analysis of the Ms6 gp4 shows
that it is a likely holin, with a signal-arrest-release
(SAR) domain followed by a transmembrane domain
(142). However, Ms6 gp5 also has a transmembrane domain, and gp4 and gp5 interact to facilitate lysis (142).
INTEGRATION SYSTEMS
Phages within more than one-half of the different
clusters encode a phage integrase, and there are numerous examples of both tyrosine integrases (Int-Y) and
serine integrases (Int-S); these include Clusters A, E, F,
101
integrates into the groEL1 gene of M. smegmatis, and
Bxz2, which integrates into the gene Msmeg_5156
(145, 147). Bxb1 integration has been especially useful
for gaining insights into mycobacterial physiology because Bxb1 lysogens are defective in the formation of
mature biofilms (149). This results specifically from
inactivation of the groEL1 gene by integrative disruption and led to the demonstration that GroEL1 plays
a role as a dedicated chaperone for mycolic acid biosynthesis (149, 150). It is not known if disruption of
Msmeg_5156 or any other host gene as a consequence
of phage integration via a serine integrase has specific
physiological consequences.
Phage integration systems typically are carefully regulated in their recombination directionality, such that
integrase catalyzes integrative recombination using attP
and attB sites (252 bp and 29 bp, respectively [151,
152]) but in the absence of accessory factors does not
mediate excisive recombination utilizing attL and attR
sites (which are themselves the products of integration)
(153). Directional control is enabled by a recombinational directionality factor (RDF), a phage-encoded accessory actor that determines which pairs of attachment
sites can undergo recombination. The L5 RDF has been
identified (gp36) and acts by binding to specific DNA
sites within attP and attR to bend DNA and alter the
nature of higher-order protein-DNA architectures that
can be formed by the Int-Y, the RDF, and the host integration factor, mIHF (154157). Directional control is
determined by a different mechanism for the serine
integrases, as illustrated for Bxb1. The attachment sites
are relatively small (<50 bp), and Int binds as a dimer
to each site, and the choice of pairs of sites competent
for recombination is determined by which proteinDNA complexes can undergo synapsis, presumably
predicated on compatible conformations of Int bound
to specific sites (158, 159). A phage-encoded RDF,
Bxb1 gp47, associates with Int-DNA complexes not
through DNA binding, but by direct protein-protein interactions and presumably alters the conformations
such that attL and attR can undergo synapsis, but attP
and attB cannot (160, 161). A curious feature of Bxb1
is that the RDF is encoded by gene 47, which is situated among DNA replication genes, and is not closely
linked to int (gene 35). Numerous mycobacteriophages
have homologues of Bxb1 47, including many (such
as L5) that utilize tyrosine integrases and for which
there is no obvious role in recombination. Presumably,
these genes play alternative roles in phage growth,
most likely in DNA replication, and have been coopted
by the integration systems for directional regulation (162).
102
MYCOBACTERIOPHAGE GENE
EXPRESSION AND ITS REGULATION
None of the sequenced mycobacteriophage genomes
encode single-subunit RNA polymerases, and it is likely
that all phage transcription utilizes the host RNA polymerase. In some examples, phage growth has specifically been shown to be sensitive to the transcription
inhibitor, rifampin (33, 50). Promoters corresponding
to those used by the host major sigma factor (SigA)
have also been identified in several mycobacteriophages
and can be predicted in many others. For example, the
strong Pleft promoter in phage L5 closely corresponds
to 10 and 35 SigA sequences (163), as do both the
early lytic promoter (PR) and the repressor promoter
(Prep) of BPs (164), and in all three examples, the transcription start site has been mapped. Interestingly, in
both the BPs promoters, these transcripts provide a
leaderless mRNA for the first gene in the operon, with
the first base of the translation initiation codon corresponding to the 5 end of the mRNA. The use of leaderless mRNA transcripts has been shown previously in
expression of the firefly luciferase reporter gene in an
L5 recombinant phage (165). A SigA-like promoter has
also been described for expression of the Ms6 lysis cassette, but this does include a leader between the transcription initiation site and the predicted translational
start codon (166).
The use of host SigA-like promoters does not, however, appear to be universal for mycobacteriophage transcription. For example, a search for SigA-like promoters
in mycobacteriophage Rey reveals no close matches and
none that closely correlate with positions where promoters are anticipated to be. In phage Giles, where the transcripts in lysogeny as well as early and late lytic growth
have been mapped by RNAseq (Fig. 4), there are no evident SigA-like promoter sequences upstream of where
transcription starts. Although late transcripts are likely
to utilize phage-encoded activators, it is unclear whether
the early promoters in Giles use an alternative host sigma
factor or if these also are regulated by phage-encoded
functions. There is clearly much to be learned about how
transcription initiation is regulated in mycobacteriophage
growth. The Cluster A phages all encode a potential sigma factor (Phams 1448, 1922, 2982 in the Phamerator
database Mycobacteriophage_285), as indicated by
HHPred searches with the closest similarity to SigK of
M. tuberculosis, which could regulate phage gene expression. Many phages also encode one or more WhiB
proteins, but whether these regulate host or phage expression is not clear. In TM4, it has been shown that
the phage-encoded WhiB protein is not required for
phage growth, although it contains an Fe-S cluster, is a
Transcriptomics
RNAseq analysis of transcription in phage Giles is quite
informative. In lysogeny, few regions are expressedas
expectedand these include gene 47, which encodes
the phage repressor. Somewhat surprisingly, the repressor is expressed at high levels, equivalent to the 0.5
percentile of highest expressed genes in M. smegmatis
(107), in noted contrast to the lower expression of other phage repressors (168). Presumably, the Giles repressor binds with relatively low affinity to its binding site
(s), although these have yet to be identified. Lysogenic
transcription proceeds through the four downstream
genes, although at a much lower level (Fig. 4). The leftward-transcribed genes at the left end of the genome
(34) are also expressed at a low level in lysogeny. Perhaps most surprising from this RNAseq analysis is the
expression of an apparent noncoding RNA near the
right end of the genome (Fig. 4), which is made during
both lysogenic and early lytic growth. It is expressed at
a high level, but its function is unclear, and the DNA
segment encoding it is not required for phage growth
or lysogeny.
103
heteroimmunity evolves in this system is unclear, because although amino substitutions within the repressor could readily lead to altered recognition of
positions 2 to 8 in the stoperator sites, it is unclear
what selection pressure leads to coordinate evolution of
over 30 genomically dispersed binding sites to new
specificities (see Fig. 6). Interestingly, although the repressor/stoperator system is restricted to the Cluster A
phages within the collection of sequenced mycobacteriophages, it appears to be shared with phages of some
other Actinomycetales hosts. For example, there are
multiple stoperator-like sites in Streptomyces phage R4
and its relatives (244), as well as in Rhodococcus phage
RER2 (179); in the latter phage the invariant positions
(1, 9 to 13) are the same as in the Cluster A mycobacteriophages, but the R4-like Streptomyces phage
stoperators are substantially different. The prevalence
of this regulatory system among the broader Actinomycetales phage population remains to be explored.
Although this particular regulatory system is confined to Cluster A phages, within the mycobacteriophages, other phages contain repeated sequences that
are likely involved in the regulation of gene expression.
One example is the Cluster K phages that contain a
conserved sequence (5-GGGATAGGAGCCC) positioned 2 to 8 bp upstream of putative translation start
sites and are thus referred to as start-associated sequences (SASs); there are 10 to 19 copies per genome
(97). The sequence is asymmetric and is situated in
the expected location of the ribosome binding site,
and eight of the conserved positions can pair with the
3 end of the 16S rRNA. However, there are wellconserved positions at the edges of the site that cannot
pair with the 16SrRNA, suggesting that it is not
just a common variation of a ribosome binding site.
Moreover, the sites are situated exclusively next to
nonvirion structural genes, and the consensus site is
present neither in other mycobacteriophages nor in the
M. smegmatis genome (97). So if these SASs are involved in translation initiation, they likely also require
a phage-encoded regulator. An intriguing hypothesis is
that the SAS-associated genes are highly expressed in
early lytic growth and that the SASs play a role in releasing ribosomes from these transcripts during late
lytic growth to optimize ribosome availability for late
gene expression. Curiously, a subset of the SAS sites has
a second conserved sequence composed of imperfect
17-bp IRs separated by a variable spacer. Because of
their tight association with SASstypically located
with a few base pairs upstreamthese are referred to
as extended start-associated sequences, and their roles
are unknown (97).
104
Integration-Dependent Immunity
A novel system of immunity regulation has been described in several different types of mycobacteriophages, including BPs (Cluster G), Brujita (Cluster I),
Charlie (Cluster N) and BigNuz (Cluster P) (164,
180). The characteristic featurereadily recognizable
bioinformaticallyis that the phage attachment site for
integration (attP) is located within the coding sequence
for the phage repressor (e.g., BPs gp33). In these systems, the repressor is located immediately upstream of
the integrase gene (int), in contrast to the immunity system of Cluster A phages described above, in which int
and the repressor are separated by <20 kbp (see Fig. 6).
The immunity functions are thus compactly organized,
with all of the required functions situated within <2
kbp of the genomes. Because of the specific location of
attP with the repressor, integration of the phage
genome plays a central role in the lytic-lysogenic decision, generating two alternative forms of the repressor:
a longer viral form (e.g., BPs gp33136) and a shorter
prophage form (BPs gp33103) that lacks the C-terminal
33 residues of the viral form. As expected, the shorter
prophage form is active in conferring superinfection
immunity, but interestingly, the longer viral form is not.
The reason for this is that the extreme C-terminus of
the viral form contains an ssrA-like tag that targets it
for degradation, presumably by the ClpXP protease
(164). This is a functionally important distinction in
the two forms of the repressor and directly determines
the frequency at which lysogeny is established; a mutant phage encoding a stabilized viral repressor form
has a greatly elevated frequency of lysogenization
(164). Nonetheless, establishment of immunity is dependent on phage integration, because otherwise the
active prophage form of the repressor cannot be expressed (164). The activity of the integration system
thus must also play a central role in the lytic-lysogenic
decision, because if Integrase (Int) was always expressed and fully active, then lytic growth would
not occur.
The resolution to this conundrum is that the integrase
also contains a C-terminal ssrA-like tag, such that degradation of Int is anticipated to also determine the lysogenization frequency. This feature is illustrated by the
behavior of integration-proficient plasmid vectors (see
below) that transform at unexpectedly low frequencies
due to Int degradation; stabilization of Int leads to large
increases in the transformation frequencies (164). The
relative simplicity of these immunity systems with a few
genes and DNA sites serving multiple functions suggests
that these may have evolved relatively early in the development of phage immunity systems (180).
APPLICATIONS FOR
MYCOBACTERIAL GENETICS
Mycobacteriophages have proven to be essential tools
for the development of mycobacterial genetics and to
provide potential clinical tools for tuberculosis control.
Given the well-established advantages of viral approaches to a variety of biological problems, mycobacteriophages would seem to offer considerable promise
for genetic and clinical applications. Like most viruses,
mycobacteriophages show host specificity, efficiently
introduce their DNA into the host, replicate efficiently,
express genes at high levels, utilize a variety of regulatory strategies, and are simple to grow. Mycobacteriophages offer an additional advantage in that their
rapid growth contrasts to the extremely slow growth
rate of M. tuberculosis. The tools for mycobacterial
genetics are typically of two general classes. First,
there are those that take advantage of the phages
themselves, such as for efficient delivery of foreign
DNA to the hosts. The second is the exploitation of
phage components such as plasmid vectors or selectable
markers. Applications in clinical microbiology have
largely focused on simple and rapid tools for tuberculosis diagnosis.
Generalized Transduction
Generalized transduction of genetic markers in
M. smegmatis was one of the earliest applications
of mycobacteriophages (13). Although transduction
was initially demonstrated using phage I3which has
yet to be fully genomically characterized but liklely is
grouped within Cluster Cit has also been shown for
other Cluster C phages such as Bxz1 (246). Transduc-
105
tion using these phages typically generates relatively
few transductants, depending on the locus, but a number of markers have been moved between M. smegmatis strains. Unfortunately, the Cluster C phages do
not infect M. tuberculosis and there currently is no generalized transducing phage for slow-growing mycobacteria. Moreover, all of the phages that infect
M. tuberculosis contain genomic cohesive ends and are
thus not good candidates for generalized transducers.
Presumably phages of M. tuberculosis that have terminally redundant genomesand are thus candidates for
generalized transducersexist in nature, but have yet
to be isolated.
Mycobacteriophages for
Efficient Gene Delivery
A key feature of mycobacteriophages is that when
added to host cells at a multiplicity of infection (moi)
greater than about 3, every cell in the culture is infected. This is vastly more efficient than introducing
DNA by electroporation and when using recombinant
phages provides a simple means of introducing foreign
DNA. The first strategy to exploit this was the construction of shuttle phasmids that replicate as large
(<50 kbp) cosmids in E. coli and as phages in
mycobacteria (44), and these continue to be indispensible tools of mycobacterial genetics; see reference
245 for a detailed discussion. Shuttle phasmids are typically constructed by ligation of phage DNA to a
cosmid vector and packaging into phage lambda
particles in vitro. These are used to infect E. coli, the library of colonies is harvested, and DNA is isolated and
then used to electroporate M. smegmatis. The plaques
recovered are purified and demonstrated to retain the
cosmid vector (76). The strategy relies on the use of
mycobacteriophages with appropriately sized genomes,
as the packaging constraints for phage lambda are
<53 kbp and there must be a region of nonessential
genes that can be replaced by the cosmid vector. Shuttle
phasmids have been constructed from phages TM4
and D29, both of which satisfy these constraints, as
well as having the ability to efficiently infect both
M. smegmatis and M. tuberculosis. TM4 shuttle phasmids are the most widely used, and conditionally
replicating temperature-sensitive mutants have been developed that facilitate the recovery of survivors following shuttle phasmid infection (49); these mutations
have been mapped through analysis of temperatureresistant revertants (97).
Shuttle phasmids can be readily manipulated in
E. coli by replacing the cosmid moiety with another
106
cosmid carrying desired genes. The three primary
applications have been in the construction of reporter
phages for the delivery of luciferase or fluorescent
genes (188190), for the delivery of allelic exchange
substrates for constructing M. tuberculosis mutants
(48), and for transposon delivery (49). Because transposon insertion is typically a low-frequency event, the
high efficiency of shuttle phasmid infection is critical
and has enabled the construction of high-density transposon libraries and their use for determining gene
requirements under specific conditions (191194).
Recombinant phages have been constructed for the
delivery of reporter genes by two other methods. First,
a derivative of phage L5 that carries the firefly luciferase gene was constructed by crossing over from a plasmid and identifying the recombinants by hybridization
(165); these infect both fast- and slow-growing mycobacteria (195). Second, a reporter phage derivative of
D29 has been constructed carrying green fluorescent
protein (GFP) using recombineering (196). These approaches are more directed than using shuttle phasmids
but contribute toward a suite of strategies for using
phages to deliver DNA efficiently to mycobacterial hosts.
Selectable Markers
Although they have yet to find widespread use, phage
immunity systems provide effective selectable markers
that avoid the use of antibiotics and antibiotic resistance genes. These have advantages both for use in
combination with other antibiotic resistance genes
when constructing complex recombinants and few antibiotic resistance markers are available, as well as for
constructing recombinants of virulent strains where introducing additional antibiotic resistance genes can enhance the biosafety concerns. The first example of
immunity selection was demonstrated using the L5 repressor, which confers immunity to both L5 and D29
(110), but the strategy is adaptable for any phage in
which the repressor has been identified and shown
to confer immunity to superinfection (97, 107, 164).
Patterns of phage heteroimmunity indicate which of
these can be used in combination with each other. In
principle, there are dozens of additional immunity systems and thus the potential to create a large suite of
compatible selectable systems.
Selection for transformants carrying the immunity
marker typically uses a clear-plaque derivative of the
107
parent phage from which the repressor gene was derived (110). Such mutants can usually be isolated easily
and efficiently kill nontransformed cells, although
phage-resistant mutants can arise at low frequency.
Selection can be accomplished by first spreading 108 to
109 virion particles onto the surface of solid media,
followed by plating of the electroporated cells, or by
first adding the phage to the recovering electroporated
cells and plating the mixture onto solid media (164).
When using such selectable schemes, it is helpful to be
mindful of the nature of the recombinants recovered. If
the phage is competent for integration (the more usual
circumstance), then the recombinants recovered will also harbor an integrated complete prophage. For many
experimental purposes this may not matter and has the
potential advantage of providing a system for complete
maintenance of the repressor plasmid, even when replicating extrachromosomally, because should plasmid
loss occur, the phage immediately begins lytic growth
leading to phage propagation and cell death. If this is
undesirable, then phages defective in integration can be
used, allowing the recovery of transformants that are
completely phage free. The availability of the BRED engineering approach (182) facilitates the construction of
such integration-defective mutants (164; G. W. Broussard
and G. F. Hatfull, unpublished observations).
Mycobacterial Recombineering
Recombineeringgenetic engineering by recombination (215)is a method developed originally for manipulation of E. coli, taking advantage of the ability of
phage-encoded recombination systems to greatly stimulate the levels of recombination, such that introduction
of DNA substrates containing homology to the chromosome are efficiently incorporated (215, 216). The
commonly used recombination systems are the Red
function of phage Lambda, which contains three components, an exonuclease (Exo), a DNA pairing enzyme
(Bet), and a RecBCD inhibitor (Gam), or the rac
prophage-derived RecET system (215, 217). There are
anecdotal reports of attempts to use the Lambda Red
system in mycobacteria, all without success, although
this is perhaps not surprising given the species specificities seen in similar recombineering systems (218).
A mycobacterial recombineering system was developed using recombinases encoded by mycobacteriophages (184186), and this strategy of using
species-cognate recombination systems has been used
for other bacteria (219). However, not all phages
encode recombination systems that are bioinformatically identifiable, and it was not until a substantial
collection of sequenced genomes was available (56)
108
that recET-like genes were identified in mycobacteriophage Che9c (184). Induced expression of Che9c 60
and 61 promotes recombination levels such that recombinants can be recovered following electroporation of a
dsDNA substrate with homology to the chromosome,
both in M. smegmatis and in M. tuberculosis (184).
The requirements for homology seem less permissive
than in E. coli, and the frequency of recombination
drops substantially once the length falls below about
500 bp. Recombineering can also use ssDNA substrates, and then only Che9c gp61 is required (185).
This enables the introduction of point mutations, especially when they can be selected for directly, such as
those that confer antibiotic resistance, where the total
number of recombinants recovered can be greater than
105 (185). A mutant that cannot be selected for directly
may also be constructed by co-electroporation with one
that does, because a high proportion of the selected
transformants also carry the unselected mutation and
can be screened for using PCR (185). This approach
mirrors that developed for manipulation of phage genomes (182), as described above.
Phage Therapy?
No discussion of mycobacteriophages is complete without some consideration of their use for direct therapeutic control of mycobacterial infections including
tuberculosis. The general concept of phage therapy is an
old one and was championed by Felix DHerelle for several decades following his discovery of bacteriophages
(237). Phage therapy has been widely employed in the
former Soviet Union but has been slow to gain acceptance elsewhere, although recently, phages have been
approved for control of both Listeria and E. coli contaminations in meat. The use of mycobacteriophages for
control of tuberculosis has considerable appeal given
the predominance of multidrug-resistant (MDR) strains
and the appearance of extensively drug-resistant (XDR)
strains (238). Although there has been some laboratory assessment of phage therapy in M. tuberculosis
infected guinea pigs (239), and the possible use of
surrogate systems for phage delivery to infected macrophages (240, 241), there have been no human trials.
109
of mycobacteria (18). These words were quite prescient, especially with the emphasis on the need for a
stronger basic understanding of mycobacteriophages
and the ways in which this would be useful. The discussion of mycobacteriophages in the 2000 first edition of
Molecular Genetics of Mycobacteria (64) reflected the
considerable advances in both basic and applied studies
over the subsequent dozen years.
However, the 14 years since the first edition in 2000
have quite dramatically advanced our understanding of
mycobacteriophages and ways in which they can be
exploited for tuberculosis genetics. In 2000, the complete genome sequences of only four mycobacteriophages (L5, D29, Bxb1, TM4) had been determined,
dwarfed now by the <285 complete genome sequences
in GenBank (61, 90, 97, 98) and the total of over 500
(http://www.phagesdb.com). This wealth of genomic
data has changed our perspectives of these phages
and our expectations for exploiting them for mycobacterial genetics. The diversity is clearly enormous,
and although we may now be closer to saturating the
isolation of the vast majority of genome types that infect M. smegmatis mc2155, the use of other mycobacterial hosts is likely to reveal further expansion of this
diversity.
The concluding sentence of the chapter on mycobacteriophages in the first edition stated, We can
therefore expect a rich and exciting future in elucidating mycobacteriophage evolution, novel systems of
genetic regulation, tools for genetic and clinical applications, and the role of phages in mycobacterial
physiology and virulence (64). The ensuing 14 years
more than adequately met these expectations; phage
genomics has elucidated evolutionary mechanisms, integration-dependent immunity is an excellent example
of novel regulation, recombineering is an effective genetic application, and Bxb1 integration has revealed
aspects of host physiology and biofilm formation.
What will the next period of mycobacteriophage research bring? There are numerous areas in which
advances might be anticipated. Currently, we have only
limited insights into the host ranges of the currently
characterized phages, know very little about what receptors are used and what determines specificity, and
have only limited information on the rates and mechanisms by which phages switch host ranges. It is also
reasonable to expect substantial advances in determining the functions of mycobacteriophage genes. There
are a vast number of genes (>40,000) with unknown
functions, and with new mutagenesis approaches we
should be poised to find out what these do and why
they are carried in phage genomes. Likewise, although
110
were learning more about phage gene expression and
the regulation of gene expression, this is mostly confined to small numbers of phages, and there is a great
deal more to learn. The genomic information also suggests numerous ways in which these phages can be
exploited for mycobacterial genetics and further insights into mycobacterial physiology.
Finally, the disseminated use of mycobacteriophage
research for science education and research training has
engaged hundreds of faculty and thousands of undergraduate students. Their involvement has had a huge
impact over the past 10 years, and the continued engagement of these students and researchers will secure
an active future for mycobacteriophage investigations.
I am grateful to Roger Hendrix, Craig Peebles, and Jeffrey
Lawrence for discussions; to all members of my laboratory
past and present for their insights and contributions; to
Welkin Pope, Bekah Dedrick, Greg Broussard, and Debbie
Jacobs-Sera for comments on the manuscript; and to all student phage hunters everywhere. Studies in the Hatfull laboratory were supported by National Institutes of Health grant
GM51975 and by the Howard Hughes Medical Institute
through its Professorship grant to GFH.
Citation. Hatfull GF. 2014. Molecular genetics of mycobacteriophages. Microbiol Spectrum 2(2):MGM2-0032-2013.
References
1. Gardner GM, Weiser RS. 1947. A bacteriophage for
Mycobacterium smegmatis. Proc Soc Exp Biol Med 66:
205206.
2. Whittaker E. 1950. Two bacteriophages for Mycobacterium smegmatis. Can J Public Health 41:431436.
3. Froman S, Will DW, Bogen E. 1954. Bacteriophage active against Mycobacterium tuberculosis. I. Isolation
and activity. Am J Pub Health 44:13261333.
4. Grange JM. 1975. Proceedings: bacteriophage typing of
strains of Mycobacterium tuberculosis isolated in
south-east England. J Med Microbiol 8:ix(2).
5. Grange JM. 1975. The genetics of mycobacteria and
mycobacteriophages: a review. Tubercle 56:227238.
6. Jones WD Jr. 1975. Phage typing report of 125 strains
of Mycobacterium tuberculosis. Ann Sclavo 17:
599604.
7. Jones WD Jr. 1988. Bacteriophage typing of Mycobacterium tuberculosis cultures from incidents of suspected
laboratory cross-contamination. Tubercle 69:4346.
8. Snider DE Jr, Jones WD, Good RC. 1984. The usefulness of phage typing Mycobacterium tuberculosis
isolates. Am Rev Respir Dis 130:10951099.
9. Goode D. 1983. Bacteriophage typing of strains of
Mycobacterium tuberculosis from Nepal. Tubercle 64:
1521.
10. Murohashi T, Tokunaga T, Mizuguchi Y, Maruyama
Y. 1963. Phage typing of slow-growing mycobacteria.
Am Rev Respir Dis 88:664669.
111
46. Snapper SB, Lugosi L, Jekkel A, Melton RE, Kieser T,
Bloom BR, Jacobs WR Jr. 1988. Lysogeny and transformation in mycobacteria: stable expression of foreign
genes. Proc Natl Acad Sci USA 85:69876991.
47. Snapper SB, Melton RE, Mustafa S, Kieser T, Jacobs
WR Jr. 1990. Isolation and characterization of efficient
plasmid transformation mutants of Mycobacterium
smegmatis. Mol Microbiol 4:19111919.
48. Bardarov S, Bardarov S Jr, Pavelka MS Jr,
Sambandamurthy V, Larsen M, Tufariello J, Chan J,
Hatfull G, Jacobs WR Jr. 2002. Specialized transduction: an efficient method for generating marked and unmarked targeted gene disruptions in Mycobacterium
tuberculosis, M. bovis BCG and M. smegmatis. Microbiology 148:30073017.
49. Bardarov S, Kriakov J, Carriere C, Yu S, Vaamonde C,
McAdam RA, Bloom BR, Hatfull GF, Jacobs WR Jr.
1997. Conditionally replicating mycobacteriophages: a
system for transposon delivery to Mycobacterium tuberculosis. Proc Natl Acad Sci USA 94:1096110966.
50. Hatfull GF, Sarkis GJ. 1993. DNA sequence, structure
and gene expression of mycobacteriophage L5: a phage
system for mycobacterial genetics. Mol Microbiol 7:
395405.
51. Mediavilla J, Jain S, Kriakov J, Ford ME, Duda RL,
Jacobs WR Jr, Hendrix RW, Hatfull GF. 2000. Genome
organization and characterization of mycobacteriophage Bxb1. Mol Microbiol 38:955970.
52. Ford ME, Sarkis GJ, Belanger AE, Hendrix RW, Hatfull
GF. 1998. Genome structure of mycobacteriophage
D29: implications for phage evolution. J Mol Biol 279:
143164.
53. Ford ME, Stenstrom C, Hendrix RW, Hatfull GF. 1998.
Mycobacteriophage TM4: genome structure and gene
expression. Tuber Lung Dis 79:6373.
54. Pedulla ML, Ford ME, Houtz JM, Karthikeyan T,
Wadsworth C, Lewis JA, Jacobs-Sera D, Falbo J, Gross
J, Pannunzio NR, Brucker W, Kumar V, Kandasamy J,
Keenan L, Bardarov S, Kriakov J, Lawrence JG, Jacobs
WR, Hendrix RW, Hatfull GF. 2003. Origins of highly
mosaic mycobacteriophage genomes. Cell 113:171182.
55. Hatfull GF. 2010. Bacteriophage research: gateway to
learning science. Microbe 5:243250.
56. Hatfull GF, Pedulla ML, Jacobs-Sera D, Cichon PM,
Foley A, Ford ME, Gonda RM, Houtz JM, Hryckowian
AJ, Kelchner VA, Namburi S, Pajcini KV, Popovich
MG, Schleicher DT, Simanek BZ, Smith AL,
Zdanowicz GM, Kumar V, Peebles CL, Jacobs WR Jr,
Lawrence JG, Hendrix RW. 2006. Exploring the
mycobacteriophage metaproteome: phage genomics as
an educational platform. PLoS Genet 2:e92.
57. Hanauer D, Hatfull GF, Jacobs-Sera D. 2009. Active
Assessment: Assessing Scientific Inquiry. Springer, New
York, NY.
58. Hanauer DI, Jacobs-Sera D, Pedulla ML, Cresawn SG,
Hendrix RW, Hatfull GF. 2006. Inquiry learning.
Teaching scientific inquiry. Science 314:18801881.
59. Hatfull GF. 2008. Bacteriophage genomics. Curr Opin
Microbiol 11:447453.
112
60. Hatfull GF. 2010. Mycobacteriophages: genes and genomes. Annu Rev Microbiol 64:331356.
61. Hatfull GF. 2012. Complete genome sequences of 138
mycobacteriophages. J Virol 86:23822384.
62. Hatfull GF. 1994. Mycobacteriophage L5: a toolbox for
tuberculosis. ASM News 60:255260.
63. Hatfull GF. 1999. Mycobacteriophages, p 3858. In
Ratledge C, Dale J (ed), Mycobacteria: Molecular Biology and Virulence. Chapman and Hall, London.
64. Hatfull GF. 2000. Molecular genetics of mycobacteriophages, p 3754. In Hatfull GF, Jacobs WR, Jr (ed),
Molecular Genetics of Mycobacteria. ASM Press,
Washington, DC.
65. Hatfull GF. 2004. Mycobacteriophages and tuberculosis, p 203218. In Eisenach K, Cole ST, Jacobs
WR, Jr, McMurray D (ed), Tuberculosis. ASM Press,
Washington, DC.
66. Hatfull GF. 2006. Mycobacteriophages, p 602620.
In Calendar R (ed), The Bacteriophages. Oxford
University Press, New York, NY.
67. Hatfull GF. 2012. The secret lives of mycobacteriophages. Adv Virus Res 82:179288.
68. Hatfull GF, Barsom L, Chang L, Donnelly-Wu M,
Lee MH, Levin M, Nesbit C, Sarkis GJ. 1994. Bacteriophages as tools for vaccine development. Dev Biol
Stand 82:4347.
69. Hatfull GF, Cresawn SG, Hendrix RW. 2008. Comparative genomics of the mycobacteriophages: insights
into bacteriophage evolution. Res Microbiol 159:332
339.
70. Hatfull GF, Hendrix RW. 2011. Bacteriophages and
their genomes. Curr Opin Virol 1:298303.
71. Hatfull GF, Jacobs WR Jr. 1994. Mycobacteriophages:
cornerstones of mycobacterial research, p 165183. In
Bloom BR (ed), Tuberculosis: Pathogenesis, Protection
and Control. ASM Press, Washington, DC.
72. Stella EJ, de la Iglesia AI, Morbidoni HR. 2009. Mycobacteriophages as versatile tools for genetic manipulation of mycobacteria and development of simple
methods for diagnosis of mycobacterial diseases. Rev
Argent Microbiol 41:4555.
73. McNerney R. 1999. TB: the return of the phage. A review of fifty years of mycobacteriophage research. Int J
Tuberc Lung Dis 3:179184.
74. McNerney R. 2002. Phage tests for diagnosis and drug
susceptibility testing. Int J Tuberc Lung Dis 6:1129
1130; author reply 1130.
75. McNerney R, Traore H. 2005. Mycobacteriophage and
their application to disease control. J Appl Microbiol
99:223233.
76. Jacobs WR Jr, Snapper SB, Tuckman M, Bloom BR.
1989. Mycobacteriophage vector systems. Rev Infect
Dis 11(Suppl 2):S404S410.
77. Redmond WB. 1963. Bacteriophages of mycobacteria:
a review. Adv Tuberc Rev 12:191229.
78. Sarkis GJ, Hatfull GF. 1998. Mycobacteriophages.
Methods Mol Biol 101:145173.
113
bacteriophages and prophages: all the worlds a phage.
Proc Natl Acad Sci USA 96:21922197.
97. Pope WH, Ferreira CM, Jacobs-Sera D, Benjamin RC,
Davis AJ, DeJong RJ, Elgin SCR, Guilfoile FR, Forsyth
MH, Harris AD, Harvey SE, Hughes LE, Hynes PM,
Jackson AS, Jalal MD, MacMurray EA, Manley CM,
McDonough MJ, Mosier JL, Osterbann LJ, Rabinowitz
HS, Rhyan CN, Russell DA, Saha MS, Shaffer CD,
Simon SE, Sims EF, Tovar IG, Weisser EG, Wertz JT,
Weston-Hafer KA, Williamson KE, Zhang B, Cresawn
SG, Jain P, Piuri M, Jacobs WR Jr, Hendrix RW,
Hatfull GF. 2011. Cluster K mycobacteriophages:
insights into the evolutionary origins of mycobacteriophage TM4. PLoS One 6:e26750.
98. Pope WH, Jacobs-Sera D, Russell DA, Peebles CL,
Al-Atrache Z, Alcoser TA, Alexander LM, Alfano MB,
Alford ST, Amy NE, Anderson MD, Anderson AG, Ang
AAS, Ares M Jr, Barber AJ, Barker LP, Barrett JM,
Barshop WD, Bauerle CM, Bayles IM, Belfield KL, Best
AA, Borjon A Jr, Bowman CA, Boyer CA, Bradley KW,
Bradley VA, Broadway LN, Budwal K, Busby KN,
Campbell IW, Campbell AM, Carey A, Caruso SM,
Chew RD, Cockburn CL, Cohen LB, Corajod JM,
Cresawn SG, Davis KR, Deng L, Denver DR, Dixon
BR, Ekram S, Elgin SCR, Engelsen AE, English BEV,
Erb ML, Estrada C, Filliger LZ, Findley AM, Forbes L,
Forsyth MH, Fox TM, Fritz MJ, Garcia R, George ZD,
Georges AE, Gissendanner CR, Goff S, Goldstein R,
Gordon KC, Green RD, Guerra SL, Guiney-Olsen KR,
Guiza BG, Haghighat L, Hagopian GV, Harmon CJ,
Harmson JS, Hartzog GA, Harvey SE, He S, He KJ,
Healy KE, Higinbotham ER, Hildebrandt EN, Ho JH,
Hogan GM, Hohenstein VG, Holz NA, Huang VJ,
Hufford EL, Hynes PM, Jackson AS, Jansen EC, Jarvik
J, Jasinto PG, Jordan TC, Kasza T, Katelyn MA, Kelsey
JS, Kerrigan LA, Khaw D, Kim J, Knutter JZ, Ko C-C,
Larkin GV, Laroche JR, Latif A, Leuba KD, Leuba SI,
Lewis LO, Loesser-Casey KE, Long CA, Lopez AJ,
Lowery N, Lu TQ, Mac V, Masters IR, McCloud JJ,
McDonough MJ, Medenbach AJ, Menon A, Miller R,
Morgan BK, Ng PC, Nguyen E, Nguyen KT, Nguyen
ET, Nicholson KM, Parnell LA, Peirce CE, Perz AM,
Peterson LJ, Pferdehirt RE, Philip SV, Pogliano K,
Pogliano J, Polley T, Puopolo EJ, Rabinowitz HS,
Resiss MJ, Rhyan CN, Robinson YM, Rodriguez LL,
Rose AC, Rubin JD, Ruby JA, Saha MS, Sandoz JW,
Savitskaya J, Schipper DJ, Schnitzler CE, Schott AR,
Segal JB, Shaffer CD, Sheldon KE, Shepard EM,
Shepardson JW, Shroff MK, Simmons JM, Simms EF,
Simpson BM, Sinclair KM, Sjoholm RL, Slette IJ,
Spaulding BC, Straub CL, Stukey J, Sughrue T, Tang
T-Y, Tatyana LM, Taylor SB, Taylor BJ, Temple LM,
Thompson JV, Tokarz MP, Trapani SE, Troum AP,
Tsay J, Tubbs AT, Walton JM, Wang DH, Wang H,
Warner JR, Weisser EG, Wendler SC, Weston-Hafer KA,
Whelan HM, Williamson KE, Willis AN, Wirtshafter
HS, Wong TW, Wu P, Yang YJ, Yee BC, Zaidins DA,
Zhang B, Zuniga MY, Hendrix RW, Hatfull GF. 2011.
Expanding the diversity of mycobacteriophages: insights
into genome architecture and evolution. PLoS One 6:
e16329.
114
99. Jacobs-Sera D, Marinelli LJ, Bowman C, Broussard
GW, Guerrero Bustamante C, Boyle MM, Petrova ZO,
Dedrick RM, Pope WH, Science Education Alliance
Phage Hunters Advancing Genomics and Evolutionary
Science Sea-Phages Program, Modlin RL, Hendrix RW,
Hatfull GF. 2012. On the nature of mycobacteriophage
diversity and host preference. Virology 434:187201.
100. Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix
RW, Hatfull GF. 2011. Phamerator: a bioinformatic
tool for comparative bacteriophage genomics. BMC
Bioinformatics 12:395.
101. Huson DH, Bryant D. 2006. Application of phylogenetic
networks in evolutionary studies. Mol Biol Evol 23:
254267.
102. Mageeney C, Pope WH, Harrison M, Moran D, Cross
T, Jacobs-Sera D, Hendrix RW, Dunbar D, Hatfull GF.
2012. Mycobacteriophage Marvin: a new singleton
phage with an unusual genome organization. J Virol 86:
47624775.
103. Morris P, Marinelli LJ, Jacobs-Sera D, Hendrix RW,
Hatfull GF. 2008. Genomic characterization of mycobacteriophage Giles: evidence for phage acquisition of
host DNA by illegitimate recombination. J Bacteriol
190:21722182.
104. Duda RL, Hempel J, Michel H, Shabanowitz J, Hunt D,
Hendrix RW. 1995. Structural transitions during bacteriophage HK97 head assembly. J Mol Biol 247:
618635.
105. Xu J, Hendrix RW, Duda RL. 2004. Conserved translational frameshift in dsDNA bacteriophage tail assembly
genes. Mol Cell 16:1121.
106. Piuri M, Hatfull GF. 2006. A peptidoglycan hydrolase
motif within the mycobacteriophage TM4 tape measure
protein promotes efficient infection of stationary phase
cells. Mol Microbiol 62:15691585.
107. Dedrick RM, Marinelli LJ, Newton GL, Pogliano K,
Pogliano J, Hatfull GF. 2013. Functional requirements
for bacteriophage growth: gene essentiality and expression in mycobacteriophage Giles. Mol Microbiol 88:
577589.
108. Giri N, Bhowmik P, Bhattacharya B, Mitra M, Das
Gupta SK. 2009. The mycobacteriophage D29 gene 65
encodes an early-expressed protein that functions as a
structure-specific nuclease. J Bacteriol 191:959967.
109. Liu J, Dehbi M, Moeck G, Arhin F, Bauda P,
Bergeron D, Callejo M, Ferretti V, Ha N, Kwan T,
McCarty J, Srikumar R, Williams D, Wu JJ, Gros P,
Pelletier J, DuBow M. 2004. Antimicrobial drug discovery through bacteriophage genomics. Nat Biotechnol
22:185191.
110. Donnelly-Wu MK, Jacobs WR Jr, Hatfull GF. 1993. Superinfection immunity of mycobacteriophage L5:
applications for genetic transformation of mycobacteria. Mol Microbiol 7:407417.
111. Rybniker J, Plum G, Robinson N, Small PL, Hartmann
P. 2008. Identification of three cytotoxic early proteins
of mycobacteriophage L5 leading to growth inhibition
in Mycobacterium smegmatis. Microbiology 154:
23042314.
115
137. Payne K, Sun Q, Sacchettini J, Hatfull GF. 2009.
Mycobacteriophage Lysin B is a novel mycolylarabinogalactan esterase. Mol Microbiol 73:367381.
138. Payne KM, Hatfull GF. 2012. Mycobacteriophage
endolysins: diverse and modular enzymes with multiple
catalytic activities. PLoS One 7:e34052.
139. Catalao MJ, Gil F, Moniz-Pereira J, Pimentel M. 2010.
The mycobacteriophage Ms6 encodes a chaperone-like
protein involved in the endolysin delivery to the peptidoglycan. Mol Microbiol 77:672686.
140. Catalao MJ, Gil F, Moniz-Pereira J, Pimentel M. 2011.
The endolysin-binding domain encompasses the Nterminal region of the mycobacteriophage Ms6 Gp1
chaperone. J Bacteriol 193:50025006.
141. Berry J, Rajaure M, Pang T, Young R. 2012. The
spanin complex is essential for lambda lysis. J Bacteriol
194:56675674.
142. Catalao MJ, Gil F, Moniz-Pereira J, Pimentel M. 2011.
Functional analysis of the holin-like proteins of
mycobacteriophage Ms6. J Bacteriol 193:27932803.
143. Stella EJ, Franceschelli JJ, Tasselli SE, Morbidoni HR.
2013. Analysis of novel mycobacteriophages indicates
the existence of different strategies for phage inheritance in mycobacteria. PLoS One 8:e56384.
144. Lee MH, Pascopella L, Jacobs WR Jr, Hatfull GF.
1991. Site-specific integration of mycobacteriophage
L5: integration-proficient vectors for Mycobacterium
smegmatis, Mycobacterium tuberculosis, and bacille
Calmette-Guerin. Proc Natl Acad Sci USA 88:
31113115.
145. Pham TT, Jacobs-Sera D, Pedulla ML, Hendrix RW,
Hatfull GF. 2007. Comparative genomic analysis of
mycobacteriophage Tweety: evolutionary insights and
construction of compatible site-specific integration
vectors for mycobacteria. Microbiology 153:27112723.
146. Freitas-Vieira A, Anes E, Moniz-Pereira J. 1998. The
site-specific recombination locus of mycobacteriophage
Ms6 determines DNA integration at the tRNA(Ala)
gene of Mycobacterium spp. Microbiology 144:
33973406.
147. Kim AI, Ghosh P, Aaron MA, Bibb LA, Jain S, Hatfull
GF. 2003. Mycobacteriophage Bxb1 integrates into the
Mycobacterium smegmatis groEL1 gene. Mol Microbiol
50:463473.
148. Smith MC, Brown WR, McEwan AR, Rowley PA.
2010. Site-specific recombination by phiC31 integrase
and other large serine recombinases. Biochem Soc Trans
38:388394.
149. Ojha A, Anand M, Bhatt A, Kremer L, Jacobs WR Jr,
Hatfull GF. 2005. GroEL1: a dedicated chaperone involved in mycolic acid biosynthesis during biofilm formation in mycobacteria. Cell 123:861873.
150. Ojha AK, Trivelli X, Guerardel Y, Kremer L, Hatfull
GF. 2010. Enzymatic hydrolysis of trehalose dimycolate
releases free mycolic acids during mycobacterial growth
in biofilms. J Biol Chem 285:1738017389.
151. Pena CE, Lee MH, Pedulla ML, Hatfull GF. 1997.
Characterization of the mycobacteriophage L5 attachment site, attP. J Mol Biol 266:7692.
116
152. Pena CE, Stoner JE, Hatfull GF. 1996. Positions of
strand exchange in mycobacteriophage L5 integration
and characterization of the attB site. J Bacteriol 178:
55335536.
153. Lewis JA, Hatfull GF. 2001. Control of directionality in
integrase-mediated recombination: examination of recombination directionality factors (RDFs) including Xis
and Cox proteins. Nucleic Acids Res 29:22052216.
154. Lewis JA, Hatfull GF. 2000. Identification and characterization of mycobacteriophage L5 excisionase. Mol
Microbiol 35:350360.
155. Lewis JA, Hatfull GF. 2003. Control of directionality in
L5 integrase-mediated site-specific recombination. J
Mol Biol 326:805821.
156. Pedulla ML, Lee MH, Lever DC, Hatfull GF. 1996. A
novel host factor for integration of mycobacteriophage
L5. Proc Natl Acad Sci USA 93:1541115416.
157. Pena CEA, Kahlenberg JM, Hatfull GF. 1999.
Protein-DNA complexes in mycobacteriophage L5 integrative recombination. J Bacteriol 181:454461.
158. Ghosh P, Kim AI, Hatfull GF. 2003. The orientation of
mycobacteriophage Bxb1 integration is solely dependent on the central dinucleotide of attP and attB. Mol
Cell 12:11011111.
159. Ghosh P, Pannunzio NR, Hatfull GF. 2005. Synapsis in
phage Bxb1 integration: selection mechanism for the
correct pair of recombination sites. J Mol Biol 349:
331348.
160. Ghosh P, Bibb LA, Hatfull GF. 2008. Two-step site selection for serine-integrase-mediated excision: DNA-directed
integrase conformation and central dinucleotide proofreading. Proc Natl Acad Sci USA 105:32383243.
161. Ghosh P, Wasil LR, Hatfull GF. 2006. Control of phage
Bxb1 excision by a novel recombination directionality
factor. PLoS Biol 4:e186.
162. Savinov A, Pan J, Ghosh P, Hatfull GF. 2012. The Bxb1
gp47 recombination directionality factor is required not
only for prophage excision, but also for phage DNA
replication. Gene 495:4248.
163. Nesbit CE, Levin ME, Donnelly-Wu MK, Hatfull GF.
1995. Transcriptional regulation of repressor synthesis
in mycobacteriophage L5. Mol Microbiol 17:
10451056.
164. Broussard GW, Oldfield LM, Villanueva VM, Lunt BL,
Shine EE, Hatfull GF. 2013. Integration-dependent bacteriophage immunity provides insights into the evolution of genetic switches. Mol Cell 49:237248.
165. Sarkis GJ, Jacobs WR Jr, Hatfull GF. 1995. L5 luciferase reporter mycobacteriophages: a sensitive tool for the
detection and assay of live mycobacteria. Mol
Microbiol 15:10551067.
166. Garcia M, Pimentel M, Moniz-Pereira J. 2002. Expression of mycobacteriophage Ms6 lysis genes is driven by
two sigma(70)-like promoters and is dependent on a
transcription termination signal present in the leader
RNA. J Bacteriol 184:30343043.
167. Rybniker J, Nowag A, van Gumpel E, Nissen N,
Robinson N, Plum G, Hartmann P. 2010. Insights into
the function of the WhiB-like protein of mycobac-
168.
169.
170.
171.
172.
173.
174.
175.
176.
177.
178.
179.
180.
181.
182.
183.
184.
185.
186.
187.
188.
189.
190.
191.
192.
193.
194.
195.
196.
117
197.
198.
199.
200.
201.
202.
203.
204.
205.
206.
207.
208.
209.
210.
118
211.
212.
213.
214.
215.
216.
217.
218.
219.
220.
221.
222.
223.
224.
225.
237.
238.
239.
240.
241.
119
242.
243.
244.
245.
246.