Beruflich Dokumente
Kultur Dokumente
Genome project
by
N. AdithiSridhar,
Bioinformatics.
Before the HGP
The Human Genome
• Address the ethical, legal, and social issues (ELSI) that may arise
from the project
During the HGP
Co mp aring th e Hu ma n Genome
wit h other G enome s
➲ Gene numbers of different species
➲ Humans: 31,000
➲ Thale cress: 26,000
➲ Nematode worm: 18,000
➲ Fruit fly: 13,000
➲ Yeast: 6,000
➲ Tuberculosis microbe: 4,000
Co mp aring th e Hu ma n Genome
wi th th at of Mus musculus (mouse)
• The human genome has about 400 million more nucleotides than
the mouse.
Since each restriction enzyme cuts the DNA at slightly different points, the
genome is broken up in such a way that there is a degree of overlap
between adjacent DNA segments – a fundamental requirement for
determining the complete sequence.
The subsequent incorporation of the DNA fragments into living cells, such as
bacteria or yeast, stores the DNA fragments and enables more copies to be
made as the cells reproduce
Organizing mapped large clone contigs
Contig: Joined overlapping collection of
clones or sequences
Shotgun clone
Sequencing
Assembly
Gene mapping
GENETIC LINKAGE
• Law of independent assortment:
Genes are transmitted from parents to offspring independently of
one another.
• Genes that are located on the same chromosome and are described
as link ed genes .
C locus: C C C c c c
D locus: D D D d d d
E locus: E E E e e e
C c C c
d D d or D
e E
e E
C c C c
d D D or d
E e
e E
One centimorgan
• The distance between two genes in which recombination occurs
with a frequency of 1%.
A C B
0.6 0.4
1.0
1 2 3
Molecular hybridization
1. Single-stranded DNA is generated
5’
3’
Sanger’s method
Collins vs. Venter
Collins Venter
IHGSC and Celera
Hi dd en Ma rk ov mo de l
• Hidden Markov Model (HMM) system for segmenting
uncharacterized human genomic DNA into exons, introns, and
intergenic regions.
• Three separate models were designed for each of the three types
of human DNA (exons, introns, and intergenic),
• using biological knowledge about splice junction these models are
tied together
Expr es sed se qu en ce tags
• ESTs are DNA sequences read from both ends of expressed gene
fragments
• The Merck-WashU EST Project and several other public EST projects
are being performed to rapidly discover the complement of human
genes, and make them easily accessible.
• These ESTs are widely used to discover novel members of gene
families
Genome Assembly and
Annotation Process
• The primary data produced by genome sequencing projects are
often highly fragmented and sparsely annotated
The data are “frozen” at the start of the build process by making
a copy of all of the data available for use at that time
Freezing the data provides a stable set of inputs for the remainder
of the build process
• Th e Bui ld Cy cle
A build begins with a freeze of the input data and ends with the
public release of an annotated assembly of genomic sequences
•Our bodies can not synthesize phytanic acid. we have to obtain all of it
from our food.
• There are now more than 1,000 genetic tests for human
conditions.
• Comparative genomics
Genomics
Proteomics
Gene Therapy
Risk assessment
Agriculture, Livestock breeding and Bioprocessing
DNA forensics
•identify potential suspects at crime
scenes
• This means that the person undergoing the test should only do so
on a voluntary basis and with a full understanding of all the
implications.
Goals
• identify the protein machines that carry out critical life functions
• characterize the gene regulatory networks that control these machines
• characterize the functional repertoire of complex microbial communities in their
natural environments
• develop the computational capabilities to integrate and understand these data
and begin to model complex biological systems
GTL Applications in
Energy Security and Global Climate
Change
The International HapMap Project
• Although the DNA sequence of any two people is 99.9% identical,
the variations crucially affect an individual’s disease risk
• The points where the sequence differs at a single DNA base are
called single nucleotide polymorphisms (SNPs).
• Sets of SNPs on the same chromosome are inherited in blocks called
haplotypes.
➲ purpose is to enable the study of genetic associations with disease
ATGCATGCAT ATGCAAGCAT
• The project was launched in 2002 with $100 million
• Samples from people in Nigeria, Japan, and China and from those
with northern and western European ancestry living in the US.
• conclusions from this pilot project were published in June 2007 in Nature
&
Russia
• In 1988, the USSR council of Ministers adopted a resolution
on the creation of a Human Genome Project
India
• India play a very significant role, by its special social structure