Beruflich Dokumente
Kultur Dokumente
ABSTRACT
The tumor suppressor p53 is a sequence-specific
transcription factor, which regulates the expression
of target genes involved in different stress responses.
To understand p53s essential transcriptional functions, unbiased analysis of its DNA-binding repertoire
is pivotal. In a genome-wide tiling ChIP-on-chip
approach, we have identified and characterized
1546 binding sites of p53 upon Actinomycin D
treatment. Among those binding sites were known
as well as novel p53 target sites, which included
regulatory regions of potentially novel transcripts.
Using this collection of genome-wide binding sites, a
new high-confidence algorithm was developed,
p53scan, to identify the p53 consensus-binding
motif. Strikingly, this motif was present in the majority
of all bound sequences with 83% of all binding sites
containing the motif. In the surrounding sequences of
the binding sites, several motifs for potential regulatory cobinders were identified. Finally, we show that
the majority of the genome-wide p53 target sites can
also be bound by overexpressed p63 and p73 in vivo,
suggesting that they can possibly play an important
role at p53 binding sites. This emphasizes the
possible interplay of p53 and its family members in
the context of target gene binding. Our study greatly
expands the known, experimentally validated p53
binding site repertoire and serves as a valuable
knowledgebase for future research.
INTRODUCTION
The tumor suppressor gene p53 is the most frequently
mutated gene in human cancers (1). It can be activated by
*To whom correspondence should be addressed. Tel: +31 24 3610541; Fax: +31 24 3610520; Email: m.lohrum@ncmls.ru.nl
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors
2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/
by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Peak detection
200
120
Counts
120
80
0
500
5.000
400
00
00
00
37
02
00
00
99
36
36
96
00
00
00
36
93
00
00
00
00
90
36
87
00
36
00
00
00
36
78
00
00
75
00
36
72
00
36
69
00
36
66
63
36
36
600
36
60
00
00
00
PI
U2OS + Act.D
00
PI
U2OS
104
00
103
00
102
84
101
00
100
36
104
81
103
36
102
00
101
00
chr6
0.000
300
3.500
200
1.000
100
0.000
1.000
p21
00
76
00
00
00
75
74
36
36
00
00
GADD45A
36
100
Fold enrichment
U2OS
+ Act.D
U2OS
40
40
Counts
B
Sub-G1: 12.1%
G1: 34.2%
S: 4.2%
G2: 49.0%
160
Sub-G1: 6.7%
G1: 53.5%
S: 12.5%
G2: 26.8%
160
80
200
E
5.000
0.000
chr6
p53 ChIP Replicate 1
3.500
5.000
This study
1473
0.000
3.500
5.000
0.000
73
33
PET5+
Wei et al.
3.500
1.000
0.000
p21
1.000
Figure 1. Genome-wide identication of p53-binding sites using ChIP-on-chip. (A) Representative cell cycle prole of U2OS cells untreated or treated
for 24 h with 5 nM Actinomycin D. (B) Western blot showing p53 expression levels of U2OS cells, untreated or treated for 24 h with 5 nM
Actinomycin D. (C) ChIP enrichment (fold over negative control, myoglobin) of p53 at the p21 promoter and the intronic binding site of GADD45A.
(D) ChIP-on-chip prole of p53 binding to chromosome 6 visualized using Signalmap (NimbleGen Inc.). Shown are the log2 ratio of ChIP/Total
signal derived from the genome-wide tiling data and a zoomed-in view of binding to the p21 promoter for all three biological replicates. Genes are
represented by thick horizontal bars (plus-strand positive, minus-strand negative). (E) Overlap of the ChIP-on-chip derived p53-binding sites with
PET5+ data from Wei et al.
was performed by quantitative PCR with three independent ChIP experiments. In total, 50 potential binding sites
were randomly selected and tested. This resulted in a
conrmation of 48 out of 50 sites. We conclude that we
identied 1546 genome-wide p53-binding sites with a false
positive rate of 4%.
Since several studies, using dierent ChIP-based techniques, have identied binding sites for p53 in various
cell systems (3538,62), we compared these to the
collection of our binding sites (Table 1). The overlap
between our data set and the PET5 cluster in the
ChIP-PET data set for p53 by Wei et al. (37) was 69%,
even though dierent cell lines and treatments were used
(Figure 1E). The extensive overlap with the highest ranked
targets of the ChIP-PET data (69%) and lower overlap
(17%) in the low ranked targets with ChIP-PET data is in
concordance with the study of Euskirchen et al. (63),
where they compared ChIP-sequencing with ChIP-on-chip
under the same conditions for STAT1 and also found the
most overlap in the highest ranked regions. From the
genes that were identied by a ChIP-based screen in yeast
(41), 50% are found in our study. Thus, even when using
Total number
Overlap with p53
of binding sites of this study
(1546 total)
PET2+
PET2+
with
p53PET
motif
PET3+
PET4+
PET5+
Low
Mid
High
5807
1773
542
383
301
262
327
169
106
113
34
8
48
37
38
170
111
73
3
1
1
15
16
26
p53
21%
genome wide
32%
16%
intragenic
39%
downstream 5kb
29%
13%
28%
3%
intergenic
16% *
3%
C 12
intragenic
.6
.8
r8
ch
8
.8
r8
ch
r8
ch
ch
r2
2.
1.
r2
.5
r8
ch
ch
.7
.1
AD
L3
pG
ch
00
00
99
91
90
00
80
98
98
00
70
91
91
98
76
10
5
.1
r9
ch
.1
r8
ch
r2
ch
r1
ch
.1
.1
Fold induction
15
1
ch
r9
.
76
0.000
2.000
Expression analysis
ch
r2
.
chr9.1
20
20
5.000
0.000
2.000
40
ch
r1
.
13
13
chr8.1
Targeted ChIP
60
11 0
00
0
1
13 200
76
0
13
00
00
10
76
13
5.000
r4
pr
D4
5A
om
6
.1
r4
.6
28
r6
1.
ch
r1
r1
ch
00
80
47
10
intergenic
00
00
10
47
79
0
00
77
47
chr2.1
0.000
2.000
0.000
2.000
Fold enrichment
10
47
10
5.000
78
0
55
00
0
89
14
89
54
00
00
14
00
53
89
14
chr1.1
ch
r8
.
5.000
52
51
89
14
14
89
00
ChIP-on-chip
91
8.
26
ch
ch
ch
r1
r4
1.
.7
27
7
r1
ch
r6
ch
5.
.6
.1
8
ch
ch
r8
7.
.5
r1
r3
p2
ch
0
1
0
5
ch
intergenic
r3
ch
fold RLU
B 6
Figure 2. Characterization of identied p53 binding sites. (A) Distribution of the p53-binding site location relative to Ensembl genes (upper panel)
compared to the genome-wide distribution (lower panel). Locations of binding sites are divided in TSS anking region (5 kb upstream of TSS + rst
exon + rst intron), intragenic region (all introns and exons except rst), 5 kb downstream (5 kb downstream of last exon), 525 kb up- or downstream or intergenic regions (everything else). The asterisk represents signicant enrichment. (B) Expression change of genes which have a p53
binding site in the TSS anking region. The expression change is shown after 24 h of 5 nM Actinomycin D treatment in fold over untreated U2OS
cells. Error bars represent standard deviation of three independent experiments. (C) Transactivation assay of intronic (yellow) and intergenic (blue)
p53-binding sites. The relative luciferase activity is plotted in fold over empty vector. Error bars represent standard deviations of three independent
biological replicates. (D) Binding prole and expression change of four ESTs bound by p53. In the upper panel, the ChIP-on-chip data is visualized.
In the lower left panel, these binding sites are conrmed by targeted ChIP for p53. Shown is the enrichment in fold over negative control
(myoglobin). In the lower right panel, the expression change of the EST is shown after 24 h of 5 nM Actinomycin D treatment in fold over untreated
U2OS cells. Error bars represent standard deviation of three independent experiments.
150
*
*
*
* *
* *
15
10
5
0
ke res
p p
d po
re ns pho rot rote
ce e
tra
sp ein in
pt to
ns
or D hor mo bind
m
pr NA us di in
em
ot d m fic g
tra b
ei a e at
ns ra
n m t a io
m ne
si ag bo n
gn e li
em re
al s sm
br ce
in t im
an pt
g u
o
e
p
re r pr tra de ath lus
ce ot n v
w
pt ein sfe elo ay
or
ra pm
p
pr ho se e
o s
a n
bi tein pha cti t
op k ta vi
bi ol in se ty
op ym as
ol e e act
ym r m ac .
er et tivi
m ab ty
D
N odi olis
f m
A
m icat
et io
ab n
D ol
N is
t ra A r m
e
ub nsc pa
iq rip ir
ui ti
tin on
c
tra ycl
ns e
a p
si ce po ort
es
gn ll pt
o
ta
a
l ad s
bl
ce tra he is
is
hm
ll ns sio
di d n
en
ffe uc
t/m reg
re t io
nt n
ai ula
ia
nt ti
t io
en on
an o gr n
f
ce ce ow
of ll c th
ch yc
ro le
m
at
in
um A
x
si on
gn g
al uid
R
e g MA
in a
ul P Fo g p nc
e
at K
io si cal ath
n gn a w
of a dh ay
ac lin es
tin g p io
cy ath n
to w
sk ay
el
et
on
50
20
ci
100
number of genes
200
25
al
number of genes
250
KEGG pathway
5.000
ChIP-on-chip
SEMA3C
5.000
0.000
2.000
0.000
2.000
0.000
2.000
1.000
1.000
1.000
0.000
0.000
0.000
1.000
1.000
1.000
00
00
48
83
47
60
00
83
47
20
00
83
46
80
00
46
83
5.000
83
40
00
0
00
84
11
58
58
83
00
0
00
SEMA6A
11
58
82
0
11
00
0
58
81
00
0
11
58
11
80
80
18
40
00
80
18
50
00
80
18
60
00
80
18
70
00
80
18
80
00
en
zy
lin
Gene Ontology
SEMA3A
Targeted ChIP
Fold enrichment
50
40
38
30
20
8
SEMA6A
SEMA3A
10
0
SEMA3C
Figure 3. Functional annotation of identied p53-binding sites. (A) All Ensembl genes within 5 kb of an identied p53-binding site were annotated
according to the Gene Ontology (GO) using Ontologizer. Signicantly enriched groups (P < 0.05) are indicated with an asterisk. Shown is a selection
of GO categories. (B) All Ensembl genes within 5 kb of an identied p53-binding site were annotated according to Kyoto Encyclopedia of Genes and
Genomes (KEGG) using FatiGO+. Signicantly enriched pathways (P < 0.05) are indicated with an asterisk. Shown are the ve pathways with the
highest number of p53-bound genes. (C) Binding prole of p53 target genes involved in the axon guidance pathway. In the upper panel the ChIP-onchip data is visualized. In the lower panel these binding sites are conrmed by targeted ChIP for p53. Shown is the enrichment in fold over negative
control (myoglobin).
20
19
18
17
16
15
14
13
12
11
10
bits
p53 ChIP-on-chip
p53 ChIP-PET 3+
Sensitivity
AUC
F-measure
MNCP
AUC
F-measure
8.67
5.62
7.35
0.90
0.83
0.82
0.86
0.76
0.82
3.97
3.37
3.49
0.93
0.86
0.87
0.91
0.81
0.81
0.8
0.6
0.4
p53scan
p53MH
Match
0.2
0.05
0.1
0.15
0.2
1-Specificity
C 100%
% of sequences
p53scan
p53MH
Match
MNCP
80%
60%
40%
20%
0%
0
5 6 7 8 9 10 11 12 13
spacer length
A 18%
16%
% of peaks
14%
12%
10%
8%
6%
4%
2%
0%
250
0
distance p53 motif to peak centre (bp)
250
B 100%
80%
60%
40%
20%
0%
low
medium
high
4
3
log2 ChIP/Total
0
1
2
3
4
5
6
number of mismatches to motif
Motif
2
19.77
2.11e-21
576
Significance
KLF/PAX4/
Sp1
P-value
Number
of
sequences
bits
Factor
weblogo.berkeley.edu
18.37
5.34e-20
96
Sp1
bits
weblogo.berkeley.edu
7.34
5.78e-09
681
bHLH
bits
weblogo.berkeley.edu
12
11
10
4.73
2.31e-06
14
AP2
bits
weblogo.berkeley.edu
3.58
3.32e-05
576
MZF1
bits
weblogo.berkeley.edu
11
10
3.06
1.09e-04
72
CP2
bits
weblogo.berkeley.edu
14
13
12
11
10
2.72
2.38e-04
33
ETS2
bits
weblogo.berkeley.edu
0.86
1.74e-02
269
AP1
bits
weblogo.berkeley.edu
Figure 6. Overrepresented transcription factor binding motifs in the vicinity of identied p53-binding sites. Known transcription factors motifs
identied in the 500 bp region centered around p53-binding sites based on TRANSFAC 6 database. Statistical signicance (P < 0.05) was calculated
using TAMO. Motifs were visualized using WebLogo.
cooperation of Sp1 or a Sp1-like factor for the transcriptional activation of the human BAX promoter (69) as well
as for p21 (70). To nd out if the potentially cobinding
transcription factors might inuence p53-transcriptional
activity towards a specic direction of the response
pathway, we looked at the dierent subsets of the p53binding sites containing a specic motif. We analyzed the
potential biological signicance of the genes, which are
within 5 kb of these binding sites using GO annotations, as
described above. Three signicantly enriched GO categories were found for the cobinding factors: developmental, metabolic and cellcell signaling pathways
(Table 3). In all three pathways p53 has been reported
to play a role. Our data thus supports the notion that the
response pathways of p53 might be inuenced by the
identied potential cobinding transcription factors.
p63 and p73 binding to p53 targets
The p53 family members p63 and p73 have been reported
to contribute to the p53 stress response in certain tissues
p63
0.04
0.04
0.05
DISCUSSION
Genome-wide identification of p53-binding sites
To characterize the transcriptional mechanisms of the
p53-mediated stress response, we analyzed p53 binding to
shared
B
2
13
14
15
16
17
18
19
20
14
15
16
17
18
19
20
12
13
11
10
p53 only
2
12
11
10
CP2
AP1
p73
0.04
bHLH
0.03
bHLH
GO:0007275 Multicellular
organismal development
GO:0007275 Multicellular organismal
development
GO:0019219 Regulation of nucleobase, nucleoside,
nucleotide and nucleic acid
metabolic process
GO:0007267 Cellcell signaling
GO:0007275 Multicellular organismal
development
P-value
bits
MZF1
GO description
p53
28%
bits
TRANSFAC GO term
Figure 7. p63 and p73 binding to p53 targets. (A) Overlap of p53binding sites also bound by p63 and p73 in Saos2 inducible cell lines as
determined by ChIP-on-chip on the dedicated p53 array. (B) p53 motif
identied with p53scan, visualized using WebLogo for p53-binding sites
also bound by p63 or p73 (upper panel) and binding sites only bound
by p53 (lower panel).
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
ACKNOWLEDGEMENTS
We thank Dr K. Vousden for p53-constructs; Dr H. van
Bokhoven for p63-constructs and Drs Shinji Takagi and
G. Melino for p73-constructs. We thank Colin Logie, Gert
Jan Veenstra and Willem Welboren for helpful discussions
and critical reading of the article. Funding for this work
was provided by Dutch Cancer Foundation (KWF)
(KUN 2003-2926 to L.S., S.D. and M.K., KUN 20053347 to L.S.); Netherlands Organisation for Scientic
Research (NWO) (Vidi 846.05.002 to S.J.vH., S.J.J.B. and
M.L.); EPIgenetic TReatment Of Neoplastic disease
(EPITRON) (LSH-2004-2.2.0-2 to MAvD); X-TRANET (LSHG-CT-2005-018882 to M.A.vD.). Funding to
pay the Open Access publication charges for this article
was provided by KWF and NWO.
Conict of interest statement. None declared.
REFERENCES
1. Hollstein,M., Sidransky,D., Vogelstein,B. and Harris,C.C. (1991)
p53 mutations in human cancers. Science, 253, 4953.
2. Kern,S.E., Kinzler,K.W., Bruskin,A., Jarosz,D., Friedman,P.,
Prives,C. and Vogelstein,B. (1991) Identication of p53 as a
sequence-specic DNA-binding protein. Science, 252, 17081711.
3. Vogelstein,B., Lane,D. and Levine,A.J. (2000) Surng the p53
network. Nature, 408, 307310.
4. Laptenko,O. and Prives,C. (2006) Transcriptional regulation by
p53: one protein, many possibilities. Cell Death Dier., 13, 951961.
5. Erster,S. and Moll,U.M. (2005) Stress-induced p53 runs a
transcription-independent death program. Biochem. Biophys. Res.
Commun., 331, 843850.