Beruflich Dokumente
Kultur Dokumente
syngamy
Fern
life cycle haploid spores (n)
egg (n) sperm (n)
Fern genetics
• Recessive alleles are not masked in haploid
gametophytes
• Gametophytes and sporophytes can be
vegetatively propagated
• Controlled crosses and double haploid lines
(selfed individuals are homozygous at all loci)
• Gametic-phase segregation and recombination
can be directly observed from gametophytes
• Genome function can be examined in haploid
and diploid phases independently
Challenges in fern genetics
• Large genome sizes (avg. 10 Gb;
humans = 3.2 Gb; Arabidopsis = 0.157 Gb)
35000
derived from DNA extracted
in CTAB and purified on a
25000
CsCl gradient
Number of reads
• Combination of 454
15000
Standard FLX and Titanium
# contigs: 76,866
Total bases = 37.37 Mb
4000
(trained on Arabidopsis)
5000
4000
• 7.27 Mb are putative exons
Frequency
3000
(19.46%)
2000
1000
19.46%
0
0 500 1000 1500
Exon length
80.54%
Exon Noncoding
Genome: microsatellites
Histogram of microsattelite repeats
Repeat motif length #
2000
dinucleotide 5564
1500
trinucleotide 470
Frequency
1000
tetranucleotide 15
500
pentanucleotide 78
hexanucleotide 1
0
0 20 40 60 80 100
15000
Number of sequences
gametophyte total RNA
10000
Reads were vector screened
and quality trimmed:
5000
• 681,722 reads
0
# 2º contigs: 0 5,905
Mean length = 685.76 bp
Number of sequences
0 500 1000 1500 2000 2500 largest unigene size: 4,489 bp 4,897 bp
Unigene length, largest transcript = 4897 bp total consensus: 32.30 Mb 26.67 Mb
Transcriptome: BLAST
E-value distribution
5,000
4,000
3,000
2,000
1,000
•17,788 unigenes had no match
0
25 50 75 100
E-value (1e-X)
125 150 175 in the database
Sequence similarity distribution
0%
7,000
6,000 46%
5,000
54%
HITs
4,000
3,000
2,000
1,000
No BLAST result
No BLAST hit
0
0 10 20 30 40 50 60 70 80 90 100
#positives/alignment-length
Positive BLAST hit
Transcriptome: BLAST
HSP/HIT coverage distribution
7,000
6,000
5,000
4,000
HITs
3,000
2,000
1,000
0
0 10 20 30 40 50 60 70 80 90 100
HSP/HIT coverage in %
P:nucleobas...
P:generation ...
P:carbohydra...
P:catabolic process
P:cellular amino ac...
P:signal transduction
P:lipid metabol...
P:photosynthesis
P:biological_process
P:response to abiot...
Biological Process
Transcriptome: GO annotation
Direct GO Count
#Seqs
0 500 1,000 1,500 2,000 2,500 3,000 3,500
C:plastid
C:membrane
C:mitochondrion
C:cytoplasm
C:nucleus
C:plasma membrane
C:intracellular
C:thylakoid
C:ribosome
C:cytosol
#GO
C:extracellular region
C:endoplasm...
C:cell wall
C:vacuole
C:cell
C:cytoskeleton
C:Golgi apparatus
C:nucleolus
C:peroxisome
C:cellular_component
Cellular Component
Transcriptome: GO annotation
Direct GO Count
#Seqs
0 500 1,000 1,500 2,000 2,500 3,000 3,500
F:binding
F:catalytic activity
F:nucleotide binding
F:hydrolase activity
F:protein binding
F:transferase activity
F:kinase activity
F:transporter activity
F:DNA binding
F:structural molecu...
#GO
Molecular Function
Transcriptome:
paleopolyploidy
• Evaluate the distribution of
synonomous substitution rates
(Ks) for duplicate gene pairs
800
exponential decrease in
200