Beruflich Dokumente
Kultur Dokumente
5'-CGCAAGAGTACTGGGGGTAAAGCCCCCCGAAAGCAGTTGGCGACCAAGGCTGCCCGAAAG
AGTGCCCCGGCTACCGGCGGCGTGAAGAAACCCCACCGTTACAGGCCTGGTACCGTGGCA
CTGCGGGAGATTCGCCGTTACCAGAAAAGCACAGAGCTGCTTATCCGAAAACTCCCTTTC
CAGAGACTGGTCCGTGAAATAGCGCAAGACTTCAAGACCGAACTCCGATTCCAGAGCTCA
GCAGTGATGGCGCTTCAAGAAGCGAGCGAGGCATATCTCGTGGGACTCTTCGAAGACACA
AATCTGTGTGCCATCCACGCCAAGCGGGTG-3'
Counting the numbers of the various base types and the total number of bases is simple but
tedious. It turns out that there are 330 bases in the above sequence: 94 are G, 86 are A, 54 are T
and 96 are C. IMPORTANT NOTE: BECAUSE THESE NUMBERS COME FROM A
SINGLE STRAND, THERE IS NO NECESSITY FOR THE MOLAR AMOUNTS TO
FOLLOW CHARGAFF’S RULES. If we express these abundances as percentages, we get the
following table [rounded to nearest 1%]:
g 94 28.48 a 86 26.06 t 54 16.36 c 96 29.09
%G %A %T %C
given strand 94/330=28% 86/330=26% 54/330=17% 96/330=29%
complement 29% 17% 26% 28%
1. Using your understanding of the Watson-Crick base-pair rules, fill in the base frequencies in
the above table for the complement [as percentages].
3. What is the %GC for the complement of the given strand? _____57%_________
4. Is it always true that the %GC for a given strand and its complement will be equal?
YES or NO [circle one]
If you finish early: sequences like this histone gene that have %GC>50 are described as GC-rich,
or equivalently as AT-poor.
195
Appendix VII: Additional base composition problems
The base-composition rules that Erwin Chargaff determined for double-stranded DNA:
[A] = [T]; [C] = [G]; and [purines] = [pyrimidines], where [ ] = molar amount
place strong constraints (limitations) on the values that these numbers can take. Even without knowing
Chargaff’s rules, because [A]+[T]+[C]+[G] = 100%, there are only 3 independent numbers, when molar
amounts of the four nucleotide types are expressed as percentages of the total. Chargaff’s equalities above
place additional limitations on the values that the molar amounts can take.
Note: Questions 1-3 below deal with three different double-stranded DNA molecules, and can be answered
using only simple arithmetic and your knowledge of the Watson-Crick pairing rules. Some numbers
cannot be determined from the available information; in those cases, write in “ND.”
1. In particular, if we know [A] = 30% for a DNA molecule, we can determine which of the following
numbers? [write in all values that can be determined]
2. Also, if we know %GC = 60% for a DNA molecule, we can determine which of the following
numbers? [write in all values that can be determined]
3. Finally, if we know [A]+[C]= 50% for a DNA molecule, we can determine which of the following
numbers? [write in all values that can be determined] Note: A wide range of base composition values
are compatible with [A]+[C]=50%, including the sequences from question 1 and 2 above, so the
molar amounts cannot be determined.
%C = __ND___ %T = __ND___ %G =____ND__ %A = __ND___ %GC = __ND___
4 In the laboratory, it is possible to synthesize short, synthetic, single-stranded DNA sequences. These
synthetic DNA molecules, commonly called oligos, can either have a defined [known] sequence, or
else one or more positions within the sequence can vary randomly. Suppose that 6-base oligos are
synthesized that have a random 50%/50% mixture of Gs and Cs at each position. What is the
probability that a particular synthetic molecule would have the sequence 5'GGGCCC3'?
[½ ]6 = 1/64 - note that the arithmetic needed for this question is exactly the same as calculating
the sequence of any particular series of 6 coin tosses, like H H H T T T.
5. Suppose that 4-base oligos are synthesized with the following sequence:
5' C N A N 3'
where the position identified as N can be A, C, T or G with 25% probability each. What are the
probabilities that a particular synthetic molecule would fall into each of the following three sequence
groups:
5' C N A Y 3' 5' C R A N 3' 5' C R A Y 3'
1. What is the base sequence of the DNA strand that would be comp lementary to the following single-stranded
DNA molecules:
a. 5’ AGTTACCTGATCGTA 3’
3’ TCAATGGACTAGCAT 5’
b. 5’ TTCTCAAGAATTCCA 3’
3’ AAGAGTTCTTAAGGT 5’
2. Two double-stranded 25 base-pair DNA fragments are heated in solution. Fragment A has 60 percent GC,
and fragment B has 40 percent GC. Which fragment has higher density? Which fragment is more
thermostabile?
Fragment A, with higher %GC, has higher density and is more thermostabile than fragment B.
3. The percent cytosine in a double-stranded DNA is 17. What is the percent adenine in that DNA?
In double-stranded DNA, if (C) = 17 percent, then (G) = 17 percent. This means that the DNA has 34
percent G-C, and 66 percent A-T base pairs. The DNA will have 33 percent A.
Each base pair has 2 nucleotides, so the molecule has 200,000 nucleotides.
There are 10 base pairs per complete 360° turn, so that there will be 100,000/10 = 10,000 complete turns
in the molecule.
5. Assume that the two strands of one very long dsDNA molecule are separated from each other and purified.
One purified strand is analyzed and has an [A+T]/[G+C] ratio of .66 and an [A+G]/[T+C] ratio of 2.33. HINT:
because these ratios are calculated for single strands, there is no necessary requirement that [purines] =
[pyrimidines].
a. The [A+T]/[G+C] ratio of the other purified (comp lementary) strand is 2/3
6. Assume that all four nucleotide types [A or C or T or G] are equally frequent in a long single-
stranded DNA sequence [called the ‘given sequence’]. What is the probability that a
randomly-drawn tetranucleotide sequence from the given sequence would have at least one A?
7. For questions 7-8, suppose for the given DNA sequence in question #6 that the nucleotide
types are not equally frequent, but have the following relative frequencies:
What are the nucleotide frequencies for the opposite strand to the given DNA sequence?
8. For the given DNA sequence in question #7 with unequal nucleotide frequencies, what is the
probability that a randomly-drawn trinucleotide sequence from the given sequence would
have at least one C?
9. If a 60-bp double-stranded DNA molecule has a %GC = 33.33% = 1/3, how many total
hydrogen bonds would be present in this molecule?
1/3 of base pairs are G-C pairs with 3 H bonds and 2/3 are A-T pairs with 2 H bonds