Sie sind auf Seite 1von 18

A = promotor

1 accattagaa aatcacgaaa agagctatca acatggccta aagtgaccaa tgaattggag


61 tttatttcac ggcgttgtga aatatgctcc aactttaacc caaaaaccca cacacactca
121 ttgccaatgt attaatgttt ttttttattt ttatttatac ttgggggaga tgtgtatttt
181 tttagtgggg cgaatgttct agaaagctct ttaaagtacc tcgatttgtg tgggaatcta
241 acgctcgttg tttgcccctt cctcgcaata gttgtttgtg aatatgttgc gatgtatacg
301 tatccacgat ttcagtgcat cttcactacc aaatccaaga agaagaaact taaaattcat
361 caatggaaaa gtttttgcaa tccccctgtc tcgctcactt gctctggccc cctcaatcgc
421 cgctctcttt ccgccgcaag gcgcacaaca agtggcaaac cgaagcggcg gcgtcgtctt
481 caaaccagtt ttactgtagc gttgttgttg ctgcttcagc ttacactgaa aaaataacgc
541 agtcaataac gcaaattagt ccatagcttg ttggtaccat ttcgtttgta tattttataa
601 gcacattttg taaacaggaa ttaatggaga atacgcaaca caaatctaag gcacgatttc
661 taaaaaagtt taaatgatgt ggtgattaat gtttgtttat ttatcacgac acttaaattg
721 ttctctgtgt aattctgcgg ctgctgcgtc tccgcgataa ttggtgcgga ggttgcccct
781 tcagccctcc ccccaaactc ctccatctac cattccatct accgttccgc cgatcagtat
841 ttgcttttat gggcaaaagt tcttcaatga gtggtcacag catttgtgaa gggagggggg
901 cggagggggc tgcggtacgt gcatgcaatt ccttattggc gacaatgttg aaatgtgtgc
961 gcctgtttga aggaaaacac cagagatcaa accaatagct ttaaaacttt gattgccttt
1021 tgattaaagc catgtggaac ttagtgccat caccgttatt gggcagattc taattggttt
1081 tccaagtaag atgctgaaaa tgcaattttt tctcgtaacg ttaatcggtg aacagggaat
1141 ccccctcatt gatctcccag cgcagtggag ccaaagtcct aagttttttc agcatggatg
1201 acagtgccta ttagtgtgac cgctggtgtt tgtggtggct tgggccaaaa gtgaattatt
1261 catgttgttg ctgctgtcgc tctgccagtg gaaaattccc aaaaaaaatt cgcatgaact
1321 gcgccggctg ctcctccgct cctccgctgc cccacatgct tctccttcca ctcgttcttc
1381 ttcttctttg agtgcccccc aaaggcaaaa agtcggagtc gaagttactt ggtgtgttag
1441 ttgttgttgc tcttgcccac tcattcgctt cggttttttt ttattttctt ttcttgtctt
1501 taacattttt tctttttata aaaatagcac agacgaagaa atatttgcac acacacacac
1561 acaccacgca cacacgtgta ttgtagcatt tgctaatttt cttcgtttgt ttcgcttggc
1621 gttttttgtt tgttgattcg cctattgcct ccgtcttccc tctcgctctc cgccactttt
1681 atattttcaa agacagcaat ttgtatcttt agtgcgggga tcacttttct ctcttaaacc
1741 cgactttctc acgctctctt ccggcgctct ttcgtctcct tatccaccac ccacccacgc
1801 cctaaatgtg tcaatggttc aaggcggacg aatgaataat caaagaagaa ttaactcgga
1861 gcttttcaca ttaaaaagct tctcaattgt ggagctttgc tttcgtcaga cttttttcgg
1921 gggccaaaac tgcatgacgt gccgccaaaa tcaataaaca tagtattgtt tcgcactttg
1981 tgtatacata ctatgtactt tcacacttgt tcttcccatc caatttatgc aagatcatat
2041 gcttagttct tagtactgcc aaatccttac aaaaaaaagt cgggtttaat gcggctgtca
2101 ccgcctctta tctttcattt ccctccaata tgcctcccac ttgcaggcag ttgactttgt
2161 ttcactggaa gccacgcgtt gcttgtcgat ttgatcctct tataaatatt tgcacgtctc
2221 gatttacatt gtttgcttgt tgtttttgta tgtatggcat aatgacgttt acgatgacgc
2281 gcttgaaacg actaaaagtc ggaattatat ggctggcccc ccgtcgattt ctcacgccca
2341 tttttggaag tgaatcgcgg ctttggcatt tttggtatcc agaaatgttg tcgccactgc
2401 aaatacgaac aaaatataca taataacatg cggaatctgg cccatggcgt cgtaaaccgg
2461 tgcaacctta atgacaaatt gcaaatgtga atagagtgcg tttcataatg cacttccggt
2521 cataaaatat aataattcct aaggcgaatt aattgtttat caatcaattg ttttgataca
2581 acaatttgtc aataaagaag cacacatggg caccaagaat ctgctgacgt tgtcgcattc
2641 ttgttatcga tcaattaacg tttttcactt caatttgcaa taaagtaatt cttacgataa
2701 taatgacagg gcaggacggg atggtacgta ttcgaatcgg taatatgagg caagggaaag
2761 gaagACGGgt aaaaataaag ttaacgatat ggcagtaaac aatgtgggtt ggacacaagc
2821 aagctgtcca agaagccact tgatgcgacc gacaacttgg gttataaaac ttcgacaaca
2881 gttttcacag cacgcgctgt tccattccgt tttattccgt tcgaaaattg ctttttcgct
2941 caattaatta acttattttt tattgcacac aattctatta aatgtgtgtt gtctgttgat
3001 tggattgctg ggattgtgtt gggagcatac aaatctattg gagaatgtgc cccccatcat
3061 gctggcacta aaatcaactt aatagctgca cacagcttgg gaacatgttc attgtgtgga
3121 tttcttcagg acttactctt cctgtatgac gggaattaac tcgtcgatat ccactcgtcg
3181 tttgttcagt aactgtcatc gggtggtggt ttcgattgaa ttgattgaca gcgagtgaca
3241 agttaaaact aatctttaaa ttgagttcta aacgaaaata ttgtcaataa cgtgaggagg
3301 gtggggaggg ggggttgctc gatgccgctt gcttggcttt attgtgcccg tttttgcgga
3361 aactcgtagc gcagcacgaa aaacgcaatt aatttgtgcg agaatgtgga aatggcggaa
3421 aagcatttgc ctttgtgacg caccgctcct ccttacattt tatttgtgca ggtgaatgga
3481 atggaaaaag gtctatactt ctatgactta agttgtgaga ttacattata tatatattat
3541 atggtgtact tgtatcgcat cgaatgcgct tttaaccttt gcggtttaga tttggctttg
3601 gccatttgct gttcttttgt tggtgggctg gctgtttggt ttgcgagtgg cgaggccagt
3661 aagccacatt gaaaccagtt gtgtggtgtt cgcaactacg ccgcccacca gcccctttac
3721 catcgttccc gccccccaat tggcaaggcc gattatcgac cgctgaccga aggctgcgaa
3781 gaaaaagcaa acaacgagcc gtttagaaag ttaccaactg tttaagttgc cttaattgat
3841 ctgttgcgtt tattcctcgt ttcattttgc tgtttcaggt taacggtcgt gcctggcctt
3901 ggctgcgttc tattccgctg tgccaacgca acgtttgaaa tttagccagc taaaaactgt
3961 gcatttaccc aactgtaagg atttgtattt ggtttttatt gcgcttaatt tgtttcgttg
4021 tggttatcac ctccgggttt aatttcactg tttcaggcta aatcacattt ggccttgact
4081 gtccgtgatc cattccgatt cgtttcgccc attccactta gcacattgcc tggcgtcaat
4141 taattgatca acagtaggaa cgttttgcat ccaaaaatag acgccaaata ttatcacttg
4201 ccttttgtct atttttaatg caaattgtgt gggcaggtaa tggggagaaa aggacgttta
4261 atcagcggcg cgtggcgtct aacgaaatta tcgcatgacc aactaaatcg gctgcccccc
4321 cccccctccc ctcccctcct caaccaccga ttacactgag atttttcgcc gctcgccaaa
4381 actttccagt ctgctgcaca acaatgtgaa acggaacacg aaacgttcct cctcttcttg
4441 attcacatat gtgtgcatat atagaccgac agacagacac aaacgtattt tctgcgtttc
4501 gttgcgaatc ttttggcgca attagcgtgc acaattagca catctctagt actcgtagcg
4561 ccgtcctttt ccttccagct cgcttgcgtg caaatttaat tagcggccca tgctcgacga
4621 gaaagagacg gcatgagatg ggaagtacta gatgaattat tggatggaaa tttaatggct
4681 tcgaggtgaa gtcttttatt ccacgactcc caattatcga ttaaaaagag aaaagaaaga
4741 cacacaaatt ttccactggg tcgagatttg caattagcac ttcgttattt atgcgccgaa
4801 ttcattttcc gccttgcaat ttacctacga caaatataaa catacagtat tgggactagc
4861 acttagcgtg aatccatcaa aaaatccgct tatcacttat cacgtgcttc cccaaacgcc
4921 cccgagtgtc gtctacgagt atttgcgatg tcgtttgtaa ctcgcagtaa atttcgacca
4981 gacttttgtg ggtctggtgg tccggtggca tctggtctga accgaatctg acgcaacttg
5041 acccaagttg cttggcatcg atcttccagt gcactagcac cttgggcgaa aagtgggtct
5101 tgctgtcgct gttggatttt gtctgtctgt ggggaaatct tatcgcacgc catatagagt
5161 tttgggctcg cctgccgtct ggtgtcatca agtcatcaag tgtctgcgcc acctaacaga
5221 tacgatgtgt aaaaaactac ttgagtggaa tcttggtaaa gtttatttac tatacccaga
5281 ataatattga tttggttttc ttgcttttct actctagtac tgcgttcatg caatgcacta
5341 tactcgcatt ttgtttcaat ttaagttcgc gcttttacga aatcatgaaa ccttactgca
5401 taaacgtaca tagcgcatga agtcggtgtg tcggcgaact ttgtaagggg ttatgtggct
5461 aacttctgtg ctttcgattt taaacaataa tgcggggaag cacaatgaaa atgaaacata
5521 acgagaaaca agcaacagaa acaaaaacaa aaacaacaca aaggcgtttc atcaagaaac
5581 aggggaagcg gcgagaaaga agcagaaact tcagcattcg cctgacgtaa atgctgccac
5641 gcattgcgtc attgtggggc tgaagtggga gcgagagggg ctagactgtg ggagtgagat
5701 ggttgcaact gcgtgagagg gagagcagct gtcgttcctt ttctcatgag agcatttgtc
5761 ggctttttca gctcattaat gattcgattt ttcttttata ttttgtggtt ttgtgtgtgt
5821 gtgccttttg tttttttttt gttttgctgt tggatgacac atggttcact tattcaacga
5881 ggggggcggg ggtgtgagcc aaaaaataaa aacaaaaaaa agtggagaga ttgtctataa
5941 ataaaccaac attttggtaa actttatgcg gtgcaagcgc ttttctctgt gataaactgt
6001 gcgtaatttt tgtgggaaaa atgggagaaa aatgacgaaa atctaaattt catggtgacc
6061 tgaatttttg agttcactat cctaacgatc gtgcttattg aattggctga tgcaattcat
6121 tgatggcttg caaatattat tattaagtta taagtttttt tttttttttt ttttttgtta
6181 ttgttttact tgactaatag ctataagggt ctttaaaagt ctgggttgaa ggcagttcgc
6241 tggcttactt ttgcatattt tttcttcttt gttttgtttc tcaggtttct tggctcgagt
6301 ttttgcattt gggggacaac ccaagtactt ggaactccac atttagtcag cgctcgttga
6361 atttgtcttg gtttttttcc ttcggtcgtc tttgaatggg taaaaatggg cagtgactaa
6421 gccaatgtca atttaagtgg ccacttgaag tagtagaatt gtttcgacaa aaatcactgt
6481 gttaagcaac ataacataat acattcaaac actctttcgc tttatgagtt gactcactgt
6541 gttttttgtt ttttatgcat attgtgtttc gttttatttt tacaatcgca gtcgtcgtga
6601 ctttcccata aattccaaaa aataaacaaa aatattttgt aatttcttat caatgtggaa
6661 aatgcttctt ctcgtcgctg ccttgtttat ttgtttacat taagcaacaa cccaacattg
6721 ttgttgtttt gtcatgtaat tatgataatc aagtgcagta aaaacgaaaa caaagaactt
6781 ggcaacaaga aggggttttt tttcgagggg gttttgggga ttggcatacg ttcgcatata
6841 catatgtatg gtgtgaggtt tttagagtgg attgccgctt atcatcgcaa gttaatctac
6901 tttcccagtt catatactct cgtacacgat catgtactcc tatatgttct atatgcatac
6961 ttaaaaacga cgggccatat agataggcga tacatgatcc gaggtgcttc aatataaata
7021 acccccagtc ccatttcact ctgtttcccg ttcccaatta acagggaaac gtacttagca
7081 caacacacac atgaccaaag ttcacttttg gctaagcgtt ttatattatt tttgattaat
7141 taggcaattt gcctgaagcg ttgttgttgc tactttttat ttgctgataa gattgggcca
7201 tgaattgcga catgcacgtg tggttttggc ataagtcgag ttcctcctgc ttccgcccgc
7261 ccaccagaac agaagcaaag ccactccaga atcggtgctt tggtttggcg acaacttgac
7321 attcaggcca gccacccacc atcccacctg gttggtcttt ggttttgttg gttcgtatcg
7381 tattgagtgc acgactaagg accgaaagtg aatttcaagt agaagggagt gggtttttaa
7441 gtggatgagt gacacgaata gcgttattat ttgattggtc atagtttcat ggagtggagt
7501 ggctctctga aactgtttgg tagtctagca acataatatg ggctcctcat ttttttagag
7561 cttatatcta ctatatagct tttctggaat ggaatcgaat actagaattg ttagctcttg
7621 tttgcatcag ggttctttgt gagtagtgct ctctttatat ttcctcgcct tgagttcaat
7681 ttattttttt ttttttgttt ccattgtttg gccgttatct cttgctgcaa ttattgattt
7741 tcagctgaaa ttactaatca attgataatg gattgctctt aatgtatata tttaaagtca
7801 tcgtttcgaa gtcagagcga aaaactttgt tctccaagtt tcctcgtttc cattttatca
7861 ctttgttgtc atataagtgg gttaaacaat tatcatataa ttgccgttct aatgatacga
7921 gcaaacgcat acgccgatgc ctaatcaaca aatattgatt gttttcatta tacatgcatt
7981 ccgtttacat agaaggattt ctgaatatat ataaaatata tatatttaca tgtgttctag
8041 acgacgagaa tcaaggtatt attattacca caaattatcg cattttgaga agccagtttg
8101 gacgttgagc ttgttgtttc ggtttttacc agtttttggt tgttcttgct gccatttggg
8161 actataaagg tgtgggtggt ggtggtaatt ggtgattggg tggcggttgc caccagggtt
8221 gggcattcac taatcgcgtg ttggctttgg caccacctaa ccaccatcta cctcccctca
8281 catttctccc atcacctttc ctttgtgttg ttcgcttcct tttcctttag ttttccagag
8341 ctgctggctc gtgtttttgc cattggctct ctttaccatt ttgacggcat tgtgttggtg
8401 gctttgtgcc ctgtaacttt ccttttgtgt taaatgtccc attgtctttt gtttgtcgag
8461 ttccctggca agaggcaact acgcgttgca atatcaatta ttaaaggctt ttttctagaa
8521 gaatttgtac agatatggta tacgaaaaat gtaataaggt agagaatctt ctttgtttca
8581 atctatttct gcttttaacc agtttacgtt tgtttatcgt tatcgctgta atcggcattc
8641 gccaccattg ctatcgaagt catatgcctt taagatgtta atcgacgtcg gtgtgatgcg
8701 atgtgctgca gtctgtcatg ggttaattgt ctggtcagct ctaattattg attgccgact
8761 cccggtcatt agcagctggc gaaggaagtc aacgatagcg atagcgatag ccatgcacaa
8821 gctttttcgg tttcttcttt ttcgttatac atacatatgt atgtacatag tagaatatga
8881 tgtatgtatg tatgaatatg tttgatattt ttatttacca ccccctgcgt agggtgggct
8941 tttgtgtatc accacaccag ggcctcccac ttttggttgc ttcttctgtt gttgactggt
9001 tggtattctc aagtctttgg caaatatgta gatgcgcaaa atttcctgtg gcgctttact
9061 cttcaattcg tctcatttaa tgattcttca cctatttctt tcccaaatga aatgcaaatg
9121 tttagctgtt tgctattttc ttttgcacat ttcatctact gttctatttt atgctgttct
9181 atgcgctcgt tgttgttgta ctttcacttt cgcgcggttg atgaggtggc caacagatgg
9241 tgctggcgtg caagcaacag gcgttgatgg aactgtggaa tcggaacgca ttcgatgtat
9301 ttttgggtct cctttattcc ttttgctaat tatttgctgc ttatttatat ttcactttcg
9361 atttcgaatg ccgcgccgcg tctacagtat atttttagtt tcgattcagt tcgatagcct
9421 agacctcaaa aaggggcacg gtacgaatgg caaatggtaa aaaatatttt gctttttgac
9481 atttttcctg cattcggaat caatatattc cccagccggt ataaatagcc tgttaaagca
9541 tgtcttgagg caattaccgc tgttaataat accaaccacg gtaaatgtca gagcgcatta
9601 tcgacatgca agttcaagac caagtaaaca agaaccacag ccgcttggct cacaggttac
9661 cccccccccc ccccccctca caagaccacc ccaagacgac cagcaaaggc tttctctgtg
9721 gaaaccccac tacttaagcc cgacccaact catttggtat ttaaatggaa gttgactcaa
9781 aactggaacc caccggcacg tgggtaatat cgattgaaat ggaaaataaa cagtattgtc
9841 ctgttttgtg acacatttca aaattttccg tttcctgccg ctacagatgt gatactcaga
9901 agcagttgta atttgaggca ctgcacacga aagtatattt taatgcattt taatatcact
9961 ttcataaaaa ctaagctgca tgcgtatagc ggaaacttgt ttttcttact tacgtaactt
10021 aattcccagg tgttcattag agaaacaacc aatttgtaga tatttattac aatgaattat
10081 ttctagagtg aatgtcccga actagtacgg cctaataagt caccttctgc tcaagccaac
10141 aagagcccat tcgatggtgt atacagcaag taataacttg taggcattat atatttaagg
10201 attggggctc tgcgtctctt ttctaagctc caatgtcaag tcaataaaca ttaaaccaaa
10261 tgacgcagtg gcgaatggaa ttggttgctg ttagtagggg ttgcttgtag gcgaaagtgc
10321 aagaggtgaa tgggcttacc aatggaaagt tcgtgaacat ggaaccgatt gatataaatt
10381 atatgatatt taacaaagta agcattatta ttgtacaaaa agaaaagtat ttaaaagttg
10441 atgcccgagg tccgagtggc gacatgtaca ttgaacaccc tgtacgaact acatatcacc
10501 ctgtaggtag cttttcttca acccccaact agcagggttt cgtatttctg tttctttatt
10561 attattttcc cgtttcttgt tgtttttgtg ccatggcatg acctaaccaa ctaagccaac
10621 aaactgaccc aagcgcaccc ttcacagcta cccccgtctc gtcccgttgt tggacacttg
10681 tcatcgtctg caaaaccgct ttcaacctga aaccaacaac ccaccgtacg ccgtttaggg
10741 gaatttgtac ctacaggcgg agactcgtgc aataattgtg ttgcctcgtt gatggggttt
10801 gtctttggtc tattgcgcac tattggtgtt gaccattgtt ggtttttttt tgttaatgtt
10861 taccttctac acgtttgccg tagttccttg tgtagtagtt ttcttggata actactcaat
10921 cttatttaaa taaataagac ttgcggacta tccccttatt cgcagtaatg cggcgatgat
10981 gttccttaac catttaggat cgatatctat agatagaatc tagatggtac gataccaatt
11041 ggtatatagg aaggatatat gtccttctat aatttccgtt tttgatattg ttagttgcag
11101 tgctattttt tttttctttt gcacatcttt ccagatgccg ttaagctgtt gcttaattta
11161 taattcattc ctgtttgatt taacacctcc gcactttagg tctcttgata aatatgtatg
11221 tgcagttgta aattaatggg gccggcaaca aaaagatttc ctacccgccc cacccatagg
11281 cggcatgcat aagtaaataa aaaaagcctg ttgttttatt gttgtttctt tttttggctt
11341 ttagtgtagt atttttaatg ctgataaatg gcagtttaaa gtagtgataa gattaaggtt
11401 tttacggcaa gaggaataaa ggaagaagaa gataccagga gcttatcaca aatctgtttg
11461 ttccagaatt gcatctgtgt tccattccat tgtgttccgg tctctgcagg tgtttcgctg
11521 ccaaagcctt tacacacatt tgaccgagga aagtttgctc gccatctcgc tctgaatccg
11581 tgcccctctc attgttctct gctctccctt tctctttctg tcactgcgtt gtgttccatt
11641 accacacgtt cctccgtcca tcacatatac atatgaacat atgtgtggtt ggcgtgcgtc
11701 agcgataata atttggaatt tgagttgtcg ttttcgctct gtttcgctct ttctctctgc
11761 gtttctgttt ctcggtttcg tttgagcttc tcgaGTTCTC AGTTCATTCG CGACCTTAAA
11821 GGCGGCCGCA CATGTTGCAC GCTGAGAAAA ACGTACACCA GACCAGACCA GAACACAAAT
11881 AAATAACCCA AATAGACAGT AAAATATTGA AAATCACAAA GATCTCCGCA TTTCTGTTAT
11941 TTTTATTTTT TTTTCGTTTT TGTTTCGTGA GAGTGTGTTT AAATTCGAAT GCTTTTTGTT
12001 GTTTGGCTTT TCTCTATGGT TTTTACGGTC TTAACAAACC GCAGTGCTGG TCTAAATTTA
12061 GCCAGAAAGT CAAAATAGAA CAAATTGGTG TTTGAAAATG CAGCAAAAAC AGCAACAATT
12121 CGTTTAACAA ATCGAAAACA ACCACTAATT TGTTTACTTG ATTTGAATAA TATTAGGCAA
12181 TGTGACTGTG AAGCGCCAAT ACTAAACAAA ATAAAAAACA AAAGTAATCG AATCGAAACT
12241 AAACTAAAAT CAAAAGAAGT GATTTAAAAT ATACCCAAAA CAGAAAAACT GTGCCGCTTT
12301 AGACGCTTTA TCAATTTCAA AGAACCGAAA AGGAAATACT CTAACGCCTA GAGTATTTAA
12361 CAGACCATTA AAAACCTGAT GGCAACAACA ACAACTACGC AGGCAGCAGG AGCTGCACCA
12421 GCTCTCAATT TATTGCCCGC CAGCAATAAC AATATAAATA ATACACTGAT CAACAACAAC
12481 AATAATAATA ATAATACTAG TAACAGTAAT AATAATAATA ACAACGTTAT AAGCCAGCCG
12541 ATTAAAATAC CGCTAACCGA GCGCTTCTCA TCGCAAACAT CGACGGGCTC GGCGGATAGC
12601 GGTGTAATTG TTTCCAGTGC ATCGCAGCAG CAACTGCAGT TGCCACCACC ACGCAGTAGC
12661 AGTGGATCGC TGAGTCTGCC ACAAGCGCCA CCTGGCGGCA AGTGGCGGCA GAAGCAGCAG
12721 CGCCAACAGT TGCTGCTCAG CCAGGACAGC GGCATCGAAA ATGGTGTCAC CACTCGTCCA
12781 TCGAAAGCCA AGGACAACCA GGGTGCGGGA AAAGCCAGTC ACAATGCCAC AAGCTCGAAG
12841 GAGAGCGGCG CGCAGTCGAA CAGCAGCAGC GAGAGCCTGG GCAGCAATTG CTCCGAGGCC
12901 CAGGAGCAGC AGAGAGTAAG AGCCTCCTCC GCTCTGGAGC TCAGCAGCGT GGACACTCCC
12961 GTGATCGTCG GCGGTGTGGT CAGTGGAGGC AACAGCATCT TGCGCAGCCG CATTAAGTAC
13021 AAGAGTACGA ACAGCACCGG AACCCAGGGA TTCGATGTGG AGGATCGCAT CGATGAGGTG
13081 GATATCTGTG ATGATGATGA TGTCGACTGC GATGATCGCG GATCGGAGAT CGAGGAGGAG
13141 GAGGAGGAGG AGGAGGACGA CGGCGTCAAT GTGGACGACG ATGTCGAGGA GGCCGACAAC
13201 CAGTCGGACA ATCAGTCGGG TATTATAATA AACCTCAAGA GCCAAACCGA ACAAGAGGAG
13261 GAGGTCGATG AGGTGGATGC CAAGCCGAAG AACCGACTTT TGCCACCGGA TCAGGCGGAA
13321 CTCACAGTGG CGGCGGCCAT GGCACGTCGA CGCGATGCCA AGAGCCTGGC CACCGACGGT
13381 CACATATATT TCCCACTGCT CAAGATCAGC GAGGATCCGC ACATTGATTC GAAGCTGATC
13441 AATCGCAAGG ATGGCCTCCA GGACACCATG TATTATTTGG ACGAATTCGG CAGTCCAAAG
13501 TTGCGAGAGA AGTTCGCCCG CAAGCAGAAG CAGCTGCTCG CCAAGCAGCA GAAGCAGTTG
13561 ATGAAACGTG AAAGGAGGAG CGAGGAGCAG CGCAAGAAGC GAAACACCAC CGTGGCATCC
13621 AACTTGGCGG CCAGCGGAGC GGTGGTGGAC GACACCAAAG ATGATTACAA ACAACAACCA
13681 CACTGTGATA CTAGCTCTAG GAGCAAAAAT AACTCGGTAC CCAATCCACC CAGCAGCCAT
13741 CTCCATCAGA ACCACAATCA TCTCGTTGTG GATGTGCAAG AGGATGTGGA TGATGTGAAT
13801 GTGGTTGCCA CCAGCGACGT GGACAGTGGT GTCGTCAAGA TGCGCCGCCA TAGCCACGAT
13861 AACCACTACG ACCGAATTCC CCGGAGCAAT GCTGCCACCA TTACCACCCG CCCTCAAATC
13921 GACCAACAGT CGTCGCACCA CCAGAACACC GAGGATGTGG AGCAAGGAGC TGAGCCCCAA
13981 ATCGATGGCG AAGCGGATCT GGATGCGGAT GCGGATGCGG ACAGCGATGG GAGTGGCGAG
14041 AACGTTAAGA CTGCCAAATT GGCCAGAACA CAGTCCTGCG TCAGTTGGAC CAAAGTGGTG
14101 CAAAAGTTCA AGAATATATT AGgtaaaatc tatgccctaa gcttaatctg tgacttaaac
14161 ataagcggaa gttatgtata ggatagtaat tagtacggac tatatagtat atagacttca
14221 aagccgcaga ctttgcccca tcatatagat tttccacaat ggccagtgca ctgggctttt
14281 tcagcaaata gcagaatgaa tatctcttat caattgtttt taaactgtgc ttttctaata
14341 tctttcattt cctttcttct atttgtaggt aagtgtcaat tgaatggaaa cacaaattcc
14401 tcataactcc cgagggttat cgtatcgggt gagattgttc tatgtacaca cacacacaca
14461 catatatata tatatatata tatatatata tatacacgca tagataagta ttctttatgg
14521 gtgcggaaaa ttttcggaga tttaatgtaa agtgataaag gtctggctgc tctctgcttc
14581 ttcttctatt cttagtctta gtgaattagt cagcgaatga aatgttatgc ataatgcaaa
14641 atggttttat gaaacgtact ttgttgcata ttttctatct tgttgaagtt gggtcaataa
14701 cccagtagca acaagtgaat ttttccgata gctttgtctt cagccttctc cgcgctgaga
14761 taaacgtggg tgggatatat atgcctggat aatatatatg atcttattct cccttttatt
14821 gtcatatgta aaatgtaaaa tttgatcttg gctttattgt ggagcgcagt ctcgacggct
14881 ggaacacgat aaggcaaagc ggaatattct cgctcgtctc tacaacaggc catttaactg
14941 ttgttgcatt tgtcgctgat aagaaaatcc gacaatggtt agaggcaggg aggggggggg
15001 gggtagcagc ggcgcggcag gagcagggca acagcactga cttacgccag ctgatgttga
15061 tttcttcgcc aactcatcta catagtacac tttatgactg gcaaactttg gattaagtgt
15121 gtgtataggt tataattata atacgctgct gttctcaacg atagtagaaa taagtgcgca
15181 caaaatgacc gacgagtttt tggactggct tatggataat ataacatttc cgacgagcca
15241 ccgcatccat cgggatccag ttacccagct gcccaccagt gtcatacgat ggggtagtac
15301 ttgcagtcga ttgtttttaa agattccact aggtgtccgt tttagatata cgataaactg
15361 gtggctgtca gttgcatcag ttcggaatct tggccccgtg agcttagcga acgaagcgtt
15421 aacgccgcgg tgcgaaaagt ggcaagtggc tagtggctag tggcacattc cccttgctgc
15481 tgactgctac gattacagcg tatggaaggc actgtgtcac agcgtgcttg tcacctcggt
15541 atcatcatca tatcatcatc agcccaaagc agttcccatc cggcgagtcg tagtggtgat
15601 ttgcagcttt cgcagatgct gactgatagc aggaaggttg gcgaactata tatatatata
15661 tatgttgtag aacttgccta gcttactttt ggtacttcac ttcgcccaac taccgcccac
15721 ccccgatttc tcgctgtgta gaggttaagt caaagctacg acagcctgca tgatcgaaga
15781 tcatcggcag taaagagatc aagaggcacg cggctatatt gcttacagcg ctgattatat
15841 atagccatag gtatatatat atatatgtat atatattccc gtcgaggggg tggcaggaaa
15901 gaggatggga agatggaagg aattggggat tcgggagcag agattacgcc gtgttagctg
15961 ctgttgtttc tttttttttt aacgctcgac aatcacgcgc tcgtttcata aaacggatga
16021 cagagcaaaa aacttttcca aaaatatcaa agcccaacaa catgcatata ttgaacagct
16081 tgtgaaatgt atttttacgt atatgtatat atgtattata tatgtatgta cattcgtata
16141 ttcactaaat ctgtgaccaa acggaggcgt gtctctgtcg tctttgtttt tgttttttgt
16201 tttttatgcc gccctctcac ctcatttcct ttctaccctg ccctgcctca cgttcgattt
16261 ggaatttgaa atggaaaaat atgtttttgt gcttttcact gggagtagta gtagtaggaa
16321 tatttgcaat tcggattatt atttgttgga aaagagcagg gaatccatat aaaacctatt
16381 gatgctgctg accattgaag agtaagagta aatgtgggct tgtattgatg gatattgatg
16441 ctccatattg atctgctaag aacaatatag caatttgtgt atttatattt gcacactgat
16501 agcgatgcaa ccaagtgatt agtgattagt gttgctattt tcgtcactta cagcgatgga
16561 aatgtttttt atttagcatt tgcccagctg tttttccatt gccgttacgc gtctgctttt
16621 atattaattt gctttatttt attttatgtt atcatgtcaa ctcgcattca atactcgtgt
16681 ttgatcagct gtgaaaagag cattttcaat ggattctagt tactagatgc cagatatcag
16741 atacaagtag ttctctttaa aatgcacctc acaaactatt gatttctctt tggcgaaatg
16801 gcttcctctt caccgatttg tttgtgtttt gtatatttgc tgttgctgtt ttctgggcac
16861 tttgaggtct tctaacgatt ggggatgatg gggaggcatt tgggctttac gtagatacaa
16921 aagagaaaaa aatgttgccc gattttattg gtggcattca aaagtatgct atggaaagtt
16981 ccaccgaagc tgcgatagaa gagacgaccg accaccgaca cccgttgtgg tgatgtgata
17041 acgatgcagg tgacaatggc tggcggattc atttctataa ggttgccaag tgctgcagat
17101 tgtggggcat ctcgtgatct ccgcccgcgt gggtcgtcat ctggccatcc cttttccacc
17161 gcctcgctta accgatctct gccctctctt ttgcccaaaa accgtgatcg acagatagat
17221 ttcttagcca ataaaataga ataaaatcaa ttcgtttcaa ttaaatgaag tctatatatg
17281 ctaacactag tctatactgc atatgcatta gcattagtag caaacagaaa aatggcactg
17341 atagagtata ttttttaatt gtcagcacaa cggcaaggtg aagattgcga atcaaatata
17401 taaaaaaata ctctaaaata aaaaaaacaa ggtgcacact ggggctatac atactcgtac
17461 atacatttac caaagaccct agaataacaa gatgcgtaac ggccatacat tggtttggca
17521 ctatgcagcc acttttttgg tgacggccaa aattactctc tttcggctca ctcccgctga
17581 gagcgtaaga aatctaaaaa tataatttgc ttgcttgtgt gagtaaaaac aagagacaag
17641 aacgcgtata agtgtgcgtg ttgtgctaga agacgatttt cgggccgaaa tcaattctga
17701 tcgaagaaac gaatttacat ggtacatatt agggtagttt ttgccaattt cctagcaata
17761 tgataaatta aaaaaaaatt attataattt taaagctttt taaatttgtt tgttaaaatt
17821 gttgctcgaa ttagctaccg tttacacatt tatatttatg tttaattcta atttgtctct
17881 catctgacaa ttttttaaaa gctaaatatt ttttttgaaa cacttttaat gttaatgtta
17941 catcatatta agtcaaatga tttaataaat atactaaata attaaatatg ataactgttt
18001 attgcaaaag taatatcaaa gacactagaa ttattctagt ttctttgctt tggtcatatt
18061 ttgaggcacg aagtgcggac acaagcactc aacaatcatt accttattaa ttattcacac
18121 gccgcaagat gaatactcta atgacaaata ttctaatata aagccatttt tgaaatttat
18181 ttttgtgata atatgtacat agatttggct atttctaacc tattttcaaa taataataac
18241 gttaaggcat gcaaaacaag aatttttcgc atggtgccaa ttgatcaaaa ataatataga
18301 tttaaagtct aagaacttct gaggtgaagg gcatattttg tcaaatttac aatgcatgag
18361 catacgtgtg cacacataca gttgtctgct atcacacttt gtgcgttgaa aagagctgtt
18421 cgctgtagcg ctcttcgctc tctcgctctc taacaaaaat tcgagagagc ctggagccac
18481 ctctagagcc acggccaaaa aattgtgtgc caaaaaatcg tatggcgtta cgcatcttgt
18541 tattctagtg tctttgcatt tacccttcag acgttccagt cttggctaat cttaagtgaa
18601 atccaaggga tacatctaca tctacatcct tgaaataaaa ctagtttgct attgggtaag
18661 ggttttcatt caatttcatt caacttggtt ggggttctga cggatagggc atttattttg
18721 ggcgtggttc aactgaaacc gaagttcgtg cggctaaact gcggcgatga ctccacttcc
18781 acgtccatgc cacacagata ttgaatggtg ggggtgcaaa atgccagttt tcgggttaga
18841 tcccgattat tgtttagtga acgcgctttc cattttcctt tattgctctc catccatcca
18901 tccgtctgtg ttagcttcct ccacctttcg ttccgttccg ttccgaatgg ggttttcgcc
18961 ccctttaatg tggtgatact gcgtcatggc attttgcatg ccaccgcact gcgccgccca
19021 caaacgccca cgcccccttt ttgcggctga gtggtgccaa atgcttgtat atcgattagt
19081 ttccgtcgac ggtggcggca acacaacaaa ttgctttgcc gctcggacag ctccaactgt
19141 tccatcagca gcagcggccc atgtttcacc ttttcgatat agtttctcaa tgtttggtca
19201 ggggggattg tggggggggg gtgaggagtg tgtttaaccc attgataatt tgattgattt
19261 ggtcgcagtc ctgtagaaac tcagttgatt aatgtgagaa tggcagcgga ggcaacaaaa
19321 cataaccgat ttacaattga tgaatcgatc aaatcgataa atgcacatcg atatatgtat
19381 gtatattgat cctttgcgat tctttcgaaa gtgcgaaggt cacattttcg tttaggcaat
19441 aattttaata tcgattgaca aagtatatgg gccaatcaag taaattggtt tattagctag
19501 cacgcaaata ttttattatc aattttatta tcattttttc aactttgagt acttttgcta
19561 acattacgca tatggatctc ttatctcttc gctattgcag atgcaataat gagaaacaat
19621 gtcaatgcta ttgggccttt aattttcatc gtcgcatttg tgattctaat tgattcatta
19681 aaataaaata aatttgattt gcacttgcac ttgcatgtgc tgcgtggttg tatcctttga
19741 ttggtcttct ccgtcgtatt ctcttcgaat tcttctggcg aatcttcgtt ctccgctctg
19801 cctcgttctt ctccgccgcc gcccgccgca ttgaactgtc agttgtctgt cgtcaaaaaa
19861 aaaaaaaaaa aaggcagggg ccataagggg gcgtggctgt cgcatgcttg accacgcttg
19921 gctgccactg tctttcttct gtcgttctgt cttctgtctt ctgcgtctcc gaaccagcgt
19981 gtcttctctg ctttggcctt atcgcaattt tccctcaatt aaaattttcc ccaaaatcgc
20041 aacaaattgt acgcgattat tatgggtatg gtctaagatt agtaggcttt tatattaatt
20101 tatttttttt tttcaagcgc gttttaaacg atcgaagaat tggtgaggat cgcattcgtg
20161 tgtgggtcgc gtgtggaacc catcgatcgc ccttcgtgtg gcattcgagt gcgatctctg
20221 ggtctctcag ctgtcgttcg tcgttcgtgt tgtcctctgg cttcttccat ttttttgtcg
20281 ccaattgtgc aacggtagct cgagcgatcg gatcgttgga tcgaaagcgg aacgaaaagc
20341 ggaacgcatc ggatttgatc tgatcagagg ggaggagcac tcctctttct cttttttcct
20401 cgcttcttct tcttcttgct gcttcttcct aaaaaaaaat aaaaataaaa aaaaatacaa
20461 catccgaata atcgaagatc gtactatttc ggttcgcaat tcttcttttt ttttttactg
20521 tgatattcca ctttgttgtt gttttcggta gtgccgcgtg tttgcatttt tgctatttga
20581 agatcgatcg atcgattgat ctgcacacaa agcgaacgaa acgtcaagtg accgaaaata
20641 aaaactaggc caaagcctgc caccgtacac agagagaaaa agcatttaaa gtttgtatgc
20701 agtaaaccaa accttaacaa aaaaaaaaaa aaaatttttt tttttggcat tcgagactgg
20761 aaaattggac gttagagatt atccattgta aaacacaagc acttgcatat gtgcatatac
20821 attttgtatc tgtgaaatAA TGCTCTTAAG AGTGAGCTTG CGTAATAAgt aggatttaaa
20881 gcaattgctt acgcatgtgt ggtatacact ttgtatattc tccaagatat gttgcttaaa
20941 ttctataatt tataatttgc ctgccttcgg aattttgtgg cattaagttc ttttgtatta
21001 atgatttcca aatatatttc atcccataat gagtttcgaa tataggtctc aactaattta
21061 gttgccgttc tgcgcaagct ttgatttcaa gcattgatac atatatattt tttttttttt
21121 tgccgtgtaa ggcaggcatc actagagcgc tatacccact ctaccgccac tcaaacggtg
21181 gttcagtgct ttagTGATTC AAGAAACCAG TGGCGGCCAG TACTCCACGC TCAAGGTGGA
21241 TAAGTCACAG GTGGTGCCCG TGGCGGTGCC GCGCGGTGTC CGCAAGGTGG TGCGAGTGGT
21301 TCGCAAAAAG AAGCTGGCAC CCGGCAGTGG GTCAGTGAAC GAGGCCAGTG AATCGGATGG
21361 TGCTGGCTCT GGCACCACCA CCTCCGGCAG GCAGAACTCC ATAGACGCCA GCAAACCGCC
21421 CGCCAAGGTC ATCAAGAATA AAAGGGGCTC CCTGGGCGGT GGTGGTGCAG CGCCACCCAT
21481 ACCGTTGGTC ACTAAAAAAA AGAACAAACG TCGATCGTCC AGCGAGGAGG AGGCGAGCAC
21541 CGGCAACGAA AGCCCCATCG AATCCGAGCC GGAATCCGGT TCGGGCAGCA GCTCCTCCAG
21601 CGCAGGCAGC GAATCTGATA CGGATAGCGA AACCACCAGC AGTAGTTCCT CCACCACCGA
21661 GTCCTCCAAT GCGGATCGCA AGTCGAAGGC CAATGGCAAG CACAAACCCC TTTCCGCCGC
21721 GGCCAAGGCG GCCCAGTCCG CCTGCTCATC GTCGTTGGTG GTGGCCGCCG CTCTGAAGAA
21781 AAGCAAACCG CCGAACCGGA GCGGCAGTGG GGCCAGCATC GCCAGTGCCG GCGGAGGAGG
21841 AAAGCAGTAC AAGGACCCAA gtacagagat aggacctaga agaacatccc cgaatatccc
21901 agatcccaaa atgtgatcca cgcacagaga taagagttgt gttaaaaaaa aaaaaacccg
21961 ccagcaagca gcagttgaat cctttatcga tgtagatata aagatataaa gatagaacga
22021 tagaatttgt tatggaaaag cgaaaagtac gagagatctg aagatttgtg accaccaaaa
22081 tcgacagtga tgcgagaggc agtagcagct gccccatcat catcatcatc acatcgtcgt
22141 catcatgaac acatgaacat ttacatcggc atctttggcg ccaatgccaa aattaattag
22201 caacaaaaaa aaaatgcaaa gggctcggtt gcatattgca gagaaaaatc agacacttag
22261 tttagttgct gatcttatgc taaatgtatt caatttaacc taatcggtat gttcgcaagc
22321 gatagttaga gatgcaggtg tgacataact ttaatctctt caattctcaa acgattttat
22381 tttgcacctc gtcgcgcggt gtacaacact atccacatct ttttatctct gtttctcgtg
22441 cttctaacct cctaacttat ttgcctgtcc ctccaatccg aactcttctg tctgtatctc
22501 caactcgtgt ttgtttagtt caacccgtga cttcttctat cttcctatcc ctcgttccgc
22561 tgttttcgtt tcgagattat ccctgagcac agggactgag tgactgactg agatcgttat
22621 cctgacaata tcttgtgtga ttcgtgtctg atttgtgtgc agGCATGAAA CAACTGATCG
22681 GCAAGCTGAA CGACCTGTGG CCCGAGCACA GTGTTGCACT GTCCATTCCA AAGGAGGTCG
22741 ATAGGAGCAA GGAGAAGCTG GAGGGCGCCT GGGAGACGAC AGgtgtataa atcatctaaa
22801 tttatcttta gatatatata tgtgtgtgtg tggtgtgcct aggataagcg gggtgcctac
22861 catgagggaa ccaactacca aataccaaat acttgaacta acaaattatc atttcactct
22921 tgtgccccaa aaaccaaaca gGTCGCGATG GTTCTAAAAT CACAACAGTT GTTGCAACAC
22981 CCGGCCAAGG CACCGATCGC GTACAAGAGG TCTCCTATAC AGACACAAAG GTCATCGGCA
23041 ATGGCAGCTT CGGCGTCGTG TTCCAGGCAA AGCTCTGCGA TACCGGCGAA CTGGTGGCAA
23101 TCAAAAAAGT TTTACAAGAC AGACGATTTA AGgtgggtgc atcaattgaa tctggcgcta
23161 aaagtattaa ttctaactaa caatgacact ttcccttcac agAATCGCGA ATTGCAAATA
23221 ATGCGCAAAT TGGAGCATTG TAATATTGTG AAGCTTTTGT ACTTTTTCTA TTCGAGTGGT
23281 GAAAAGgtaa gaaaaggggg tcaagtagct aatcaagtag agaacccttt ttttttgggg
23341 ggggaggtgg actgctgaag ttgccagcat tggtaattac ttagtacgtt ttcagttgaa
23401 aatccatcaa ccatcagcga atggaattcg taagaaagtg tgtatataaa tgatgagttt
23461 tgagccaatt ttgttgatgg ttaaaaccgt actaagtatt tatttttagc taaactgagc
23521 tttcgcttag cattttatgt gttattgagt ttatgagtta agcgattacc gaactgaaac
23581 gaatctcttt cacttgcata tcgaaactga aactgaaacc gaatccaaaa ccgaatccga
23641 aatcgaaatc ggtcactagc tggccgagtt ccccatcgat ttgggcatat tcgcgagtgt
23701 gctgaagccc caggtatccc agttctcaaa gttatcagtt cttcagctca gatcattcag
23761 cgacttgctc acactcgatc tagatctttc tctatctctc ataaacgaat ccgaaaaaaa
23821 aaccctgcta tctattagag aaatgtacaa tgaaacgtac agaagatcac atcgcatgtg
23881 gtacccgtac aatctaatct cccgtgaatc aatgatatcc tcctgctggt atgctagtta
23941 tgctagactc agtttaaatg tcgccgactg tacaaacccg atcaaataaa acgcatgttg
24001 atagctcgca actgcatcgg tcagcttcct ccaaaatcca gactcacata cgcatccaca
24061 catggatcat ttacgctcaa aaaaaaaaaa aaaactctga tatcgtaacc taatcgattc
24121 ttttttgtgc gcccctccct ccattgcagC GTGATGAAGT ATTTTTGAAT TTAGTCCTCG
24181 AATATATACC AGAAACCGTA TACAAAGTGG CTCGCCAATA TGCCAAAACC AAGCAAACGA
24241 TACCAATCAA CTTTATTCGG gtgagtactg atctgctatc catctttgtg tagtcgacac
24301 taacttgcat cttcctgttg tttcgcccga ttatagCTCT ACATGTATCA ACTGTTCAGA
24361 AGTTTGGCCT ACATCCACTC GCTGGGCATT TGCCATCGTG ATATCAAGCC GCAGAATCTT
24421 CTGCTCGATC CGGAGACGGC TGTGCTGAAG CTCTGTGACT TTGGCAGCGC CAAACAGCTG
24481 CTGCACGGCG AGCCGAATGT ATCGTATATC TGCTCCCGGT ATTACCGCGC CCCCGAGCTC
24541 ATCTTTGGCG CCATCAATTA TACAACAAAG ATCGgtgagt atttcacaca taccatcaca
24601 ttattgcatt caaattccat gtaaattcta atatcctaaa ttgacgctac atttcagATG
24661 TCTGGAGTGC CGGTTGCGTT TTGGCCGAAC TGCTGCTGGG CCAGCCCATC TTCCCTGGCG
24721 ATTCCGGTGT GGATCAGCTC GTCGAGGTCA TCAAGGTCCT GGGCACACCG ACAAGAGAAC
24781 AGATACGCGA AATGAATCCA AACTACACGG AATTCAAGTT CCCTCAGATT AAGAGTCATC
24841 CATGGCAGAA Agtaagtggc tagttgccgc caaaatgtga attggatcga ttatgatttt
24901 cgatattccc cttgcgcttt tcacgcgctc atctgacccc cgcccctcgg cattcttctc
24961 gggatgggga ggaaacttca tttttccccc ggtacaagca ccccccccca ccccccccaa
25021 aaaaaaagca ctcattgtta atttaccaca agtgtttcta accgaaaatt gtgtgtacat
25081 acaaactcgt tgcagTCACT ACTCGAACGC ACCCAATTTC CAAACGCCCT AAACCAGAAA
25141 CAACGATTGC GAgtaagcca agaaacgaat tgaaaacatc aaacaaaaat ttaaaaaaaa
25201 gcgttaagca aaatgaccaa attatataag cacaatgcca atcgcagcca catttgctca
25261 acaaatcatg caggggggtt gaatgcattt gaaatctatg tcgtatcatg actactcaag
25321 tgattgtggg gccatccaga gccagatcca gattcccatt cttgcttgat cttgatcttt
25381 caactgataa gcctggttta ttcgaaaagc tttctgggtt cactcactca gttatgtatt
25441 aagtgtgcga gcctacaaat gcagctgcct caatcaaaca atcaattcgc atttcatgct
25501 caaccacata cataggaatg ggtataagta tccagatatc cagattccga gatgcaattc
25561 catggcttaa gattcatgct tagcttttcg ccagccctcc cagctgtaca aagaagttat
25621 aaattggttt aatttaattg atttattgga tttgatttga tttcgagttg aatttaattg
25681 attgttttgg tttcttgccc tgcctactgc cgacgtttgt tttgatccgt aaaaaaaaaa
25741 aaatgagtgt tcgatcaaat ctgtaaatac gaacgcttcc tctgtgttct cgtttgtgtc
25801 ctgtccatat ctacgatgat tgcctcactg cccgccccgc cccccttttg cccaccagcc
25861 caattctgtc ccgcaagaaa gtcagtgcaa agatgtcaga tggttaaaac gccacttaac
25921 cgaatgctcc tttcacacac agGTTTTCCG TATACGCACT CCTACAGAAG CTATCAACTT
25981 GGTGTCCCTG CTGCTCGAGT ATACGCCCAG TGCCAGGATC ACACCGCTCA AGGCCTGCGC
26041 ACATCCGTTC TTCGATGAGC TACGCATGGA GGGTAATCAC ACCTTGCCCA ACGGTCGCGA
26101 TATGCCGCCG CTGTTCAACT TCACAGAGCA TGgtgagtga gatcagatcg atcagccagg
26161 tggcagaatt tgttgcaaca ctaatgtcgc cttcaatccg cagAGCTCTC AATACAGCCC
26221 AGCCTAGTGC CGCAGTTGTT GCCCAAGCAT CTGCAGAACG CATCCGGACC TGGCGGCAAT
26281 CGACCCTCGG CCGGCGGAGC AGCCTCCATT GCGGCCAGCG GCTCCACCAG CGTCTCGTCA
26341 ACGGGCAGTG GTGCCTCGGT GGAAGGATCC GCCCAGCCAC AGTCGCAGGG TACAGCAGCA
26401 GCTGCGGGAT CCGGATCGGG CGGAGCAACA GCAGGAACCG GCGGAGCGAG TGCCGGTGGA
26461 CCCGGATCTG GTAACAACAG TAGCAGCGGC GGAGCATCGG GAGCGCCGTC CGCTGTGGCT
26521 GCCGGAGGAG CCAATGCCGC CGTCGCTGGC GGTGCTGGTG GTGGTGGCGG AGCCGGTGCG
26581 GCGACCGCAG CTGCAACAGC AACTGGCGCT ATAGGCGCGA CTAATGCCGG CGGCGCCAAT
26641 GTAACAGgtg agtaagcggt tggccatgca gctccatccc cgccttccgt cgcctgctcc
26701 ttttcacctc ctctcctctt tttcccagtc attgtatcat cagtattgta tcgtatcgta
26761 tcgtatcttc agtttctaac ttgtgtgcca tgtgacaaga ggggcactag ttttccaaat
26821 ctagccttcc atatttgcaa taaaaagtag catttaatca cgatacgata ctttgatctt
26881 ggcaattcgc ttgccacaca caatcccata ctttatccat tgcccgatcc ttagCTGGTG
26941 TCCATCTCAT GATGCGGCAA CATCGCAAGT TGCCGTTGTC GGGGAAGCCC TTCGTCCGCT
27001 ATACGGCCAA CATTTGATTG CGATGCTATC CGAGCATCGT TCCTCTATCC ATCTATTATA
27061 AGCAAACGCC TGAAATCCAA GCAAAACCCA GGAAAGATAT ACGAATAGAT CTAGTAAGGA
27121 AGCCGTAGAA GAAGCATGCT ATTTGGGGGC AAAGAGTGGA AATCATGAAT CAAGAATTGA
27181 GACAAGAATT GCGAATCAAG ACATAACATT TTGCAGATTC AAGAAGAAGA TGTTCTCAAA
27241 AACAAATTAT TGCTGTAGTT ATCTTTTCAG TTGAATATAA ATCAGTTATT TCCAAGTCAT
27301 TTaaaaagtt tcagtgaggt tttgtaaatg gttcctaaag tcgagatttt caatagagtt
27361 cttcggagtc tttccaaggt ttgccctaca atctatgcac tgaaaatcct tgtctgcaat
27421 tgtccttttc acgatatttc gaatcaatcg ggatatattg atcgtccatg gctatggaac
27481 acttgataga ttgaattggt tttcattcga tttcatttga tttcattaat tcatgtttac
27541 tttggtattt tggtgtgtca ttagacggct aaagatttat gattttaatt tgtttttttt
27601 ttgtttcctt ttgttttatt tacagATTCA TAGGGGAAAT AGTAACATAC ATACACACAC
27661 TAAATATATA TCCAAGCATA TATATATAGT AATCATTATA TATAACACCT ACACCCACAA
27721 CAACAACAAC AGCAATTATA TATAATAACC ATAAACAAGA ATGGAGAAAG CCAATCCAGC
27781 AATCACAGCA AACTATATAC ACAACAACAA CAATTAAATT AATTAATGCA ATTGATGAAA
27841 GAACAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCATC AACCGCAATT TCAAAAGAAC
27901 TCTAGAAACA GCAAAGGCAT AAAATATAAC AAAAGAAATA TTTTACTTAG GTAAAACATT
27961 AAATTTATTT TAAATCTAAA ATAAACTAAT AAGCATTAAA TAATACATGA TAATGGTAAA
28021 TAAACACACA ATAATTATAA TAGTAGAGCG AGCGCTGATC GATTGTCATT TTATTGCTGC
28081 CGCGCGTGGC GATATATATA TATATATATA TATATATCTT TTAATTAATA TTTTAAGTGA
28141 TCCTCTCCGC AACTCTCTTC GTTAATTAAT GTATCCCTCC TATTTTTTTT GACGCCTTGA
28201 AAAAGAAATG AACCAATGTA TATGTATATT TAAAAGAGTC ACTGCATATT TTTTTTACAA
28261 CACCACCTTG ATTTAGTACG TTTAACTTAT GATAACTGAT GGTAATAGAA TGGCGGACGA
28321 GTTTTGTGTG GTTAGAGCGA GCTAAGATCT AACTAACTAA GAGTTTGCGT AGGTTAAACA
28381 AGGCATGGTC TTGCAACAAC GTGCAGCATG CAGCATGCAA CACACTCACA CACACACACA
28441 CACCACCCTA AGAAAGCAAG AGGAAGGCAG AAGAGGACAG AAGAGGAGGC GAAGATGAAG
28501 TAAAGTGGAA CAGATTGAGA AAGAGAAGGA GAATGAGAAG GAGGAACAGA AACAAAGCAA
28561 AGCCCCGAGC ATAATGTTAA TGTTATGTTA AAACCTAAAT TTAATGCAAA TTATTAACGC
28621 AGAAAAGAAC GAAAGAGAAA AGGAAAAAAC AAAAAAAAAA AAAGCAAAAA AACAAAGCAA
28681 AAGCGAAAGC GAAACTGTTT AAACTATACC AATATAATAT ATAATCATTA TGAATATAAA
28741 CTATATATTT TTTTTTTTTT TCAACACACA CATTATGTAT ACATACATAC TGAATAAATA
28801 AATTATTTTT ATTTTTATTA TATATCGTAT CGTATATCGT ATATATTTTT TTTCGCGCTA
28861 AAATATATAT GTACAGATAC CATTATTTAG CCAGTAGATG AGTTATACGG ACACATACGG
28921 ACATACAGAA CGGAGGAGGA AGAGAGCGCA GGCGTGGAGC TGGCGCCTGA GAACCGGCGG
28981 ATTAGTTGCA ATATGTAGAT AAGGTCCAAA TAACCGGGTT CCCGCCACCG TAGAGCTCCA
29041 TTATTATTAT TCGCATAGAT AGTCAGTGTC GTCCTGCCTC GCCCCCAAAA AGCTCCTTCC
29101 CCGCTCCCCC GCTCGCCACA ATTTCCGCAC AGAGCTGCTA CTGAATATTA TTAACAATTT
29161 CTTGCTCAGA GTGGCAAGGG AAGAAGAAGA AAAGAAAAAA AAGAAAAAAT GAGAAGAACG
29221 AAGCGAATTG CATAAGCGAT ATGATGAAAA ATGATGAGCA AAAAACTTAT ATTTATTTCT
29281 ATAGCTTATT ATAATCGACC TAAAACTAAT TATTATGCTA ATTATAAACG ATTATTGAAT
29341 ACACACACGC AGAATAAAAT ACATTTTCCT AGTAACTTAG GCACACGCGA GTAAAAAAAA
29401 AAAAAAGAAC AACTGGAAAA CCTTGAAAAA AAATGCAAAA AAAAAGAATC ATGAAAAATT
29461 AAACACGTTA GCTTATTTTT AGACTCGCTA ATTACATAAC ACACACACAC ACACTCTATA
29521 CACAGACACG TGCATACACC GACAATTGTA TATGTAATGC TGTAATAATC ATGATAATAT
29581 TTAGATTCGT TGATGATAAT GAGCAAAGAA GCCGTAATGA TAATGATAAT AAATGAATAC
29641 AACAAAATCC AACAAATAAA AAGAGAAACA AATACAATAT TTAAACaaag taaaaccttc
29701 tgttgttgcc tttcctttta accaccgtat ccacagtatc attttgttca gtgtgtgaac
29761 ctttcgtctg catcttcatt tcccctctcc gaagtaattg ggctcggaat gggtgggata
29821 acatcgattc taatcgattt aatcgattcg ccatacttaa tgtttaagta taattatttg
29881 taatttcagt taaacatgcg tttttttttt aaatcatttc aatattattt cttacttaaa
29941 cgatggcttg tgcattctga ttgagcttct aggaaatggt ggcttgcaac tgtgtgaaat
30001 ggctgggtct gctctgtttt gggcctccca agcgaatcac ccacaaaaac aacataaagc
30061 ttaatccata cgaatagcaa tccacatctg tatatctttt tccttctttg gctctgaatc
30121 gttttgtagG CTCGCAATCG AACAGCGCCC TCAATAGCAG CGGAAGTGGC GGAAGCGGAA
30181 ACGGAGAGGC AGCCGGCTCG GGTTCCGGAT CCGGATCGGG CTCAGGAGGC GGGAACGGCG
30241 GGGATAACGA TGCTGGCGAC AGTGGAGCAA TCGCATCTGG AGGCGGAGCA GCAGAAACCG
30301 AGGCAGCGGC GTCGGGTTAG CGCGAGCGAG TAGCTCTTTA AATGTAATGT TATTAGCAGG
30361 TTTTTCGCTC GGCCCGGGGA TTCAGTTAAC CCATGTCGGC CAAGAGCGAG ATGACAACAC
30421 CACACACCAC ACACACACAT ACACACACGC AGCTACACTC GAAATATGAA AGAGATGTCG
30481 GATGTCCCAG CAGAAGCTAT GAGTAAATGA AATGAGACAA AGGAATTCAC TACACAAAAC
30541 GCCCAGTATC CTTACACCCC CACACAACAA TAAACCCATC CACACACACA CACACACACT
30601 AACACAAACA CACACACACA CACATGTATG TATGTATATC TAGCTATATG CATTGGGCGC
30661 AAGCAAATAT TTAGCATAAA ATCGAAATAA AACCAAAAAT CCACTTTAAA CTATGCATAA
30721 ATAATTAAAA TAATTATCTG TACTATTAAA GAGAAAGAGA AATCCCCGAA GGAATGTTGA
30781 GAAATAATCG GAAAACCCCT CGCCCGCGTC CCAAACCTTC AATATAGTAA ATAACACTTA
30841 ACAACCAGAT CGCGGAACGT AATATAAATT AAGTCAAAAA AAAAAACAAA AAGCAGAAGC
30901 AACTTGAATG AAATACTTAG TGAAATAAAT CAAAATTTTT GCCCATTTAA CGTTTATATA
30961 TATGCGCGGT TATACAAATA TATATAACGA TCACAGCAGT TAGCAATCCA TAGTAAAAGT
31021 AAACAATTAA AGGCGGCAAG TAAGAGGAAC TAGCAAAAGG GCGGATACAA CATAAACTAA
Atgc =ekson
Atgc =intron

Analisa potongan DNA di atas dan jawab pertanyaan berikut :


a. Dari organisme apa dan pada kromosom berapa fragmen DNA tersebut
berasal? (5)
Melalui sekuens tersebut, untuk mencari organismenya apa, saya
mencari data tersebut menggunakan NCBI. Kemudian pada web
tersebut, saya memilih BLAST. Selanjutnya menggunakan pilihan
blastn, dan dimasukkan sekuens yang sudah saya dapatkan dan di
RUN BLAST. Hasil yang saya dapatkan yaitu, sekuens ini dimiliki
oleh organisme Drosophila melanogaster. Setelah itu, ketika saya
melihat query cover serta identity yang dimiliki yaitu mencapai
100% pada Drosophila melanogaster yang memiliki kromosom X.
Sehingga pada kromosom 10 fragmen DNA itu berasal.

b. Ada berapa prediksi open reading frame (ORF)? Berapa exon


diprediksi dari setiap ORF? Tandai prediksi daerah promoter, start
dan stop codon, exon dan intron dan daerah terminator
(penambahan polyA). (35)

Untuk mengetahui prediksi open reading frame (ORF). Seperti yang


kita ketahui pencarian menggunakan ORF dari NCBI hanya
memberikan alternatif mengenai start dan stop kodonnya saja.
Karena sebenarnya ORF dari NCBI lebih cocok untuk prokariot
daripada eukariot, pada kasus ini Drosophila melanogaster adalah
eukariotik. Sehingga, apabila mencari ORF menggunakan ORF
Finder dari NCBI ditemukan hasil seperti berikut:
Sehingga digunakan GENSCAN, untuk mengetahui exon, intron,
start, stop kodon, hingga daerah terminator (penambahan poly A).
Sehingga melalui GENSCAN, kita dapat mengetahui coding ORFnya.
ORF
1

ORF
2

Pada gambar diatas. Dapat diketahui yaitu ada dua prediksi ORF.
Dilihat dari ORF 1: diprediksi terdapat 5 ekson. Dimana Init
menandakan adanya ekson pertama yang terdapat start kodon,
sedangkan term yaituekson terakhir dan terdapat stop kodon.
Sedangkan, pada prediksi ORF 2 terdapat 14 ekson.

Jawaban untuk menandai, lihat sekuens di atas.

c. Apakah prediksi urutan asam aminonya dari setiap ORF dan apakah
fungsinya? Berikan data dukung (45)

PREDIKSI URUTAN ASAM AMINO ORF MENURUT GENSCAN


Prediksi urutan asam amino dari ORF 1.

Prediksi urutan asam amino dari ORF 2.

Tidak adanya fungsi yang ditemukan dari urutan asam amino ORF 1.
Sedangkan urutan asam amino ORF 2 ditemukan data sebagai berikut:
Penjelasan yang didapatkan dari protein katalitik domain (accnumber:
cd14137) yaitu sebagai berikut:
Pada gambar di atas juga menunjukkan bahwa protein tersebut
memiliki ATP binding site dengan 18 residu. Kemudian terdapat axin
binding site dengan 13 residu, dimer interface dengan 25 residu,
polypeptide substrate binding site dengan 12 residu, active site
yaitu 13 residu, dan activation loop (Aloop) dengan 22 residu.

Namun, apabila kita mencari fungsi dari urutan asam amino ORF
yang didapatkan dari CDS di GenBank dari Drosophila
melanogaster, hasil yang ditemukan sebagai berikut:
Urutan asam aminonya yaitu:

Dan apabila menurut salah satu referensi dari Pubmed yaitu:


Menurut data BLAST, urutan asam amino dari ORF mengkode
protein glikogen synthase kinase/ Shaggy. Sehingga sama dengan
functional domain yang ditunjukkan oleh Prosite maupun conserved
domain dari NCBI yaitu menunjukkan urutan asam amino tersebut
mengkode protein kinase serine/thereonin, ataupun glikogen
synthase kinase 3,walaupun memiliki urutan asam amino yang
berbeda.

d. Prediksi daerah functional domainnya dan berikan penjelasan (15)

Untuk memprediksi daerah functional domain, maka menggunakan


program Prosite, prediksi urutan asam amino ORF 1 pada GENSCAN,
tidak memunculkan hasil apapun (no hit) sehingga dapat diartikan
bahwa urutan asam amino tersebut tidak terdapat pada database.

Sedangkan untuk urutan asam amino pada ORF 2 GENSCAN, hasil yang
diperoleh yaitu sebagai berikut:
Daerah functional domain yang didapatkan yaitu mengkodekan protein
kinase dom yaitu pada asam amino 1061 hingga 1345.

Juga diprediksi terdapat functional domain lainnya yang ditemukan pada prosite yaitu
Protein kinase ATP(Protein kinases ATP-binding region signature ) (aa1067-aa1091), dan
Protein Kinase ST (Serine/Threonine protein kinases active-site signature) (aa1182-
aa1194).

Umumnya protein ini memberikan deskripsi yaitu:


Eukaryotic protein kinases are enzymes that belong to a very extensive family of proteins
which share a conserved catalytic core common to both serine/threonine and tyrosine
protein kinases. There are a number of conserved regions in the catalytic domain of
protein kinases. We have selected two of these regions to build signature patterns. The
first region, which is located in the N-terminal extremity of the catalytic domain, is a
glycine-rich stretch of residues in the vicinity of a lysine residue, which has been shown
to be involved in ATP binding. The second region, which is located in the central part of
the catalytic domain, contains a conserved aspartic acid residue which is important for the
catalytic activity of the enzyme; we have derived two signature patterns for that region:
one specific for serine/ threonine kinases and the other for tyrosine kinases. We also
developed a profile which is based on the alignment in and covers the entire catalytic
domain.

Das könnte Ihnen auch gefallen