Beruflich Dokumente
Kultur Dokumente
GSQLPCEPEPDvaVlt.SMLtDPSHITAEAAgRRLaRGSpP
3.2 Mapping of the Interacting HCV
SmASSsASQL
NS5A Segments
where a capital letter indicates that the position is
To simplify the study of the interactions between HCV fully conserved, and a lowercase letter denotes that the
NS5A and human proteins, we mapped all the interacting particular residue is predominant at that position. The
segments of the five NS5A protein variants to NP 751927.1, sequence logo and structure of this sequence motif are
which is the representative protein of HCV NS5A pro- illustrated in Figure 1. The highly conserved sequence
tein variants. The two segments 1973-2272 and 2272-2419 segments and sequence motif are in good agreement.
of NS5A CAB46677.1, for example, share high sequence Of the three highly conserved segments, two segments
identities of 95.11% and 96.35% with sequence segments (G185 -D196 and R216 -L239 ) match perfectly the sequence
1-300 and 300-438 of NP 751927.1, respectively. This sug- motif G185 -L234 . The other sequence segment S414 -S437 ,
gests that the similar sequence segments may share similar according to an earlier report [3], lies in the segment of
3D structures and functions, so we consider the sequence aa 412-477, which is essential to NS5A binding ability.
segment 1-438 of NP 751927.1 as a substitute for the bind- Hence, it can be concluded that the three highly conserved
ing segment 1973-2419 of CAB46677.1 for further study. sequence segments are important for the interaction of
Whereas, the other three sequence segments 1-173, 1-234, HCV NS5A with other proteins and that the sequence motif
and 277-447 of NP 751927.1 belong to the HCV E1 or E2 plays an important role in the interaction of HCV NS5A
envelope protein variants. Hence, these 3 sequence seg- and human proteins.
ments were excluded from further study. In this manner, The NS5A sequence motif identified in this study is also
we could map all interacting segments to NP 751927.1 vi- consistent with the experimental results of previous stud-
ral regulatory protein (Table 2). ies [2, 9]. Evans et al. [2] reported that the sequence seg-
886
in Table 1 to homologous proteins in PDB. Only 12 hu-
man proteins out of 32 have homologous human proteins
with known structures found in PDB. The remainder of this
paper discusses the interactions between the 12 human pro-
teins and HCV NS5A viral regulatory protein.
887
the 500 random sequences. Hence, the probability of find- ACKNOWLEDGEMENTS
ing the binding motif of HCV NS5A is as low as 0.002,
suggesting that the motif is hard to find by chance. This work was supported by the Korea Research Foun-
In the same way, we generated a new set of 500 random dation Grant funded by the Korean Government (KRF-
sequences of equal length as the average human proteins 2006-D00038) and by KOSEF through the Systems Bio-
interacting with HCV NS5A (99 aa). We also predicted Dynamics Research Center.
secondary structures of the random sequences using GOR
[11]. In the 500 random sequences, there were 44 hits which
References
match the binding motif both in sequences and structure pat-
terns. Any of the 4 structure patterns, full helix (H), full
extended strand (E), full loop (L), and extended strand-loop [1] E.C. H SU , B. H SI , M. H IROTA -T SUCHIHARA ,
(EL), was allowed. Hence, the probability of finding the J. RULAN , C. I ORIO , F. S ARANGI , J. D IAO ,
binding motif of human proteins is as low as 0.096, again G. M IGLIACCIO , D.L. T YRRELL , N. K NETE -
MAN , C.D. R ICHARDSON , Nature Biotechnology,
suggesting that the motif is hard to find by chance.
We also attempted to find the proposed motif in 21, 2003, pp. 519-525.
three human proteins, which were not included in the [2] M.J. E VANS , C.M. R ICE , S.P. G OFF, Proc. Natl.
initial 12 human proteins used for deriving the bind- Acad. Sci. USA, 101(35), 2004, pp. 13038-13043.
ing motif: NP 036289.2 (F-box and leucine-rich re-
peat protein 2), NP 002101.2 (tyrosine protein kinase [3] Y. S HIROTA , H. L UO , W. Q IN , S. K ANEKO , T. YA -
HCK), and NP 003185.1 (TATA box Binding Protein). MASHITA , K. KOBAYASHI , S. M URAKAMI , J. Biol.
We found 4 occurrences of the motif in NP 036289.2 Chem., 277, 2002, pp. 11149-11155.
(L148 kgisEG154 , L262 qilEA267 , L288 ekmdlEE295 , and
L359 ehlEN364 ), and only one occurrence in the other hu- [4] C. L IN , J.W. W U , K. H SIAO , M.S.S. S U, Journal
man proteins each (L126 lrrEG131 in NP 002101.2 and of Virology, 71, 1997, pp. 6465-6471.
L49 sileEQ55 in NP 003185.1). All these motifs were found [5] C. A LFARANO , C.E. A NDRADE , K. A NTHONY,
within the binding segments of the human proteins. This N. BAHROOS , M. BAJEC, Nucleic Acids Research,
suggests that the proposed motif is in a good agreement with 33, 2005, pp. D418-D424.
the experimental data.
[6] F. P UIG -BASAGOITI , X. F ORNS , I. F URCIC ,
4 CONCLUSIONS S. A MPURDANES , M. G IMENEZ -BARCONS ,
S. F RANCO , J.M. S ANCHEZ -TAPIAS , J.C. S AIZ, J.
Gen. Virol., 86, 2005, pp. 1067-1075.
Based on the analysis of the interactions between HCV
NS5A and human proteins, we identified three highly [7] G.E. C ROOKS , G. H ON , J.M. C HANDONIA ,
conserved sequence segments (G185 -D196 , R216 -L239 , and S.E. B RENNER, Genome Reserach, 14, 2004,
S414 -S437 ) and one binding motif (G185 -L234 ) in HCV pp. 1188-1190.
NS5A protein with NCBI ID 26053629. In addition to
the conserved segments, a sequence motif L-X(3,5)-E- [8] I. J ONASSEN , J.F. C OLLINS , D.G. H IGGINS, Pro-
[AEGNQST] and its structure were identified in the bind- tein Science, 4, 1995, pp. 1587-1595.
ing sites of human proteins. The binding sites of human [9] K.J. PARK , S.H. C HOI , S.Y. L EE , S.B. H WANG ,
proteins often form a full helix (H) or an extended strand- M.M. L AI, J. Biol. Chem., 277(15), 2002,
loop (EL) structure. Statistical analysis of the binding mo- pp. 13122-13128.
tifs, G185 -L234 in HCV NS5A proteins and L-X(3,5)-E-
[AEGNQST] in human proteins, shows that the binding mo- [10] T.K. J ENSSEN , A. LæGREID , J. KOMOROWSKI ,
tifs are statistically significant and hard to find by chance in E. H OVIG, Nature Genetics, 28, 2001, pp. 21-28.
random sequences. The binding motif and its exact role
in the interactions between NS5A and human proteins re- [11] C. C OMBET, C. B LANCHET, C. G EOURJON ,
main to be verified by biochemical experiments. However, G. D ELEAGE, Trends in Biochemical Sciences,
the observations reported in this study should have valu- 25(3), 2000, pp. 147-150.
able implications for the design of site-directed mutagenesis
experiments targeting HCV NS5A-human protein interac-
tions. In the future, we will expand this analysis method to
the interactions between other HCV nonstructural proteins
and human proteins.
888
Table 1. Interactions between HCV NS5A viral regulatory proteins and human proteins, extracted
from the BOND database
HCV NS5A proteins Human Proteins
NCBI Interacting NCBI Interacting
Interaction Descriptions Accession Segments GenInfo ID Segments
NS5A interacts with Grb2 CAB46677.1 277-447 4504111 –
NS5A interacts with Amph II2 CAB46677.1 – 21536411 393-482
NS5A interacts with Amph II2 short CAB46677.1 1-173 21536415 –
NS5A interacts with hVAP-A CAB46677.1 1-173 37588850 –
ApoE interacts with NS5A from HCV type 1b isolate Con1 CAB46677.1 – 4557325 –
NS5A interacts with FBL2 CAB46677.1 1-234 16306582 67-423
NS5A interacts with AHNAK CAB46677.1 – 535177 71-293
NS5A interacts with SFRP4 CAB46677.1 – 4506895 69-315
NS5A interacts with NDRG1 CAB46677.1 1973-2272 37655183 123-261
NS5A interacts with MGP CAB46677.1 1973-2272 13528966 1-103
NS5A interacts with CRABP-1 CAB46677.1 2272-2419 4758052 45-137
NS5A interacts with FTH1 heavy polypeptide 1 CAB46677.1 2272-2419 56682959 31-183
NS5A interacts with Translokin CAB46677.1 2272-2419 29469444 57-500
NS5A interacts with MGC2574 CAB46677.1 2272-2419 13129104 4-294
NS5A interacts with TACSTD2 CAB46677.1 2272-2419 14495611 –
NS5A interacts with Pl4K CAB46677.1 2272-2419 17105400 1799-1916
NS5A interacts with PTMA CAB46677.1 2272-2419 21359860 –
NS5A interacts with CENT-δ-2 CAB46677.1 2272-2419 16975484 850-1181
NS5A interacts with C9 ORF6 CAB46677.1 2272-2419 8923431 26-181
HCV NS5A interacts with apolipoprotein apoA1 NP 751927.1 – 4557321 1-167
HCV nonstructural 5A interacts with karyopherin beta 3 NP 751927.1 1973-2172 24797086 1007-1097
NS5A interacts with Hck NP 751927.1 – 30795229 82-135
NS5A interacts with Lck NP 751927.1 – 20428652 66-118
NS5A interacts with Lyn (Src-family protein tyrosine kinase) NP 751927.1 – 4505055 67-120
NS5A interacts with Fyn (Protein-tyrosine kinase fyn isoform) NP 751927.1 – 4503823 87-140
NS5A interacts with hVAP-33 NP 751927.1 – 37588850 156-242
NS5A interacts with SRCAP (Snf2-Related CBP Activator) NP 751927.1 – 5730067 –
NS5A interacts with p53 (Tumor protein p53) NP 751927.1 – 8400738 1-10 and 34-393
NS5A interacts with TRAF2 NP 751927.1 148-337 22027612 1-358
1-301 314-501
Human Vesicle-Associated Pro. (hVAP-33) interacts with NS5A NP 751927.1 – 36588850 1-48
p53 interacts with the Hepatitis C virus protein NS5A NP 751927.1 – 8400738 –
hTAFII32 interacts with the Hepatitis C virus protein NS5A NP 751927.1 – 4507351 1098-1620
NS5A interacts with Jak1 NP 751927.1 – 4504803 –
NS5A interacts with FBL2 NP 751927.1 – 16306582 –
NS5A interacts with La (autoimmune RNA-binding protein) NP 671491.1 1983-2025 10835067 –
1973-2050
Apolipoprotein E precursor (ApoE) interacts with NS5A NP 671491.1 1983-2025 4557325 314-501
NS5A interacts with TBP (TATA box Binding Protein) AAB87527.1 1-101 4507379 1-281
NS5A interacts with p53 (Tumor protein) AAB87527.1 – 8400738 –
Hepatitis C virus NS5A interacts with apolipoprotein apoA1 AAQ86584.1 1-224 4557321 1-167
889
Table 2. Interacting segments of HCV NS5A proteins. The last column shows the interacting seg-
ments mapped to NS5A NP 751927.1.
Fig. 7. Diagram of sequence motif structure of human proteins (PDB code: 2A01, 2CBR, 2FHA, 1YCS, 1W1F, and 1GRI) interacting with NS5A.
Figure 2. Structure of binding motifs of 6 human proteins (PDB codes: 2A01, 2CBR, 2FHA, 1YCS,
1W1F, and et1GRI) interacting
al., 2006). with HCV
But the biggest stumbling NS5A
block in protein. R:L , and S:S ) and one sequence motif ( G-L )
HCV research 216 239 414 437 185 234
is the lack of a cell culture and small animal model for studying were identified within the primary sequence of NS5A with NCBI ID
viral replication and pathogenesis, which are expensive and time 26053629. These residue fragments may play a critical role for HCV
consuming. Hence, we focused on the residue information of NS5A interacting with other proteins or with other parts of HCV
interaction between HCV NS5A and human proteins and tried to protein. Whether or not these analysis and discussion accurately
identify functional important residue segments. This may provide us depict the interactions between NS5A and human proteins remains
much useful residue-level information to depict HCV-human protein to be verified by experiments. The observations presented in the
interaction, as well as guides for future experiment design. 890
present paper have potential implications for the design of site-
HCV NS5A is a viral regulatory protein that modulates viral RNA directed mutagenesis experiments targeting HCV NS5A-human
replication and host process by interacting directly and indirectly protein interactions. Next step, we will expand this analysis method
with a variety of host regulatory factors. The important role of NS5A to the interactions between other HCV nonstructural proteins and
in HCV RNA replication and its interaction with other proteins human complexes.
has been clearly demonstrated by high or clustered incidence of
adaptive systems, although the molecular mechanism involved