Beruflich Dokumente
Kultur Dokumente
Submited to Submited By
Varshapriya J.N
Anshul Vyas
(Computer & IT department)
(M.Tech 2nd Year)
Target
Next Target
Challenging keywords
Rna sequence
Allignment
Transcript/Transcriptome
Splice junction
Intron/Extron
Rna Sequence
Spliced transcripts,
Post-transcriptional modifications,
Gene fusion,
Allignment
Transcriptome
Splice Junction
Splicing
Current limitations:
Study TOPHAT
It is an open source.
FASTQ Format
@Read_id_1
CTGATGTGCCGCCTCACTTCGGTGGT
+
@@@DDDDDH8<BAHG@BHGIHIII>(
@Read_id_2
TGATGTGCCGCCTCACTACGGTGGTG
+
FHHHHHJIJIJIJIIIJJIIJGIGII
@Read_id_3
...
The name/ID of the read, preceded by a "@". For read pairs, there will be two entries with that name, either in the same or a
second FASTQ file.
A "+" sign. In very old FASTQ files, this is followed by the read name from the first line. Today, this line is present for historical
reasons backwards compatibility only.
The quality scores of the bases from line 2. The scores are generated by the sequencing machine, and encoded as ASCII
(33+score) characters. The line should have the same length as line 2, as there is one quality score per base.
BlackBox Architecture
FASTQ-2
FASTQ-1
?
Mapped
Sequence
Unmapped
Sequence
Next tasks
Next tasks
Thank You