Beruflich Dokumente
Kultur Dokumente
1 2004
Editors-in-Chief
Kelvin Lam – Pfizer, Inc., USA
Henk Timmerman – Vrije Universiteit, The Netherlands
DRUG DISCOVERY
TODAY
TECHNOLOGIES Target identification
titative, and differential detection remains the major chal- ation of precursor ions. MALDI–TOF/TOF allows highly au-
lenge. tomated and reliable identification of thousands of proteins
The current situation in proteomics appears to be charac- per week with sensitivities around 1–20 fmol. The LCQ–FT
terized by differing opinions and strategic uncertainty in a instrument is a hybrid MS system with high mass accuracy
climate requiring imperative large investment, whereas the and resolution (approximately 1 ppm). However, regardless
mainstream strategic thinking of investors is still completely of the ever increasing sophistication of mass spectrometry,
dominated by daunting experiences from the genomics era. the first step of a successful proteomics study is still the
Given the inherent enormous complexity and dynamics of effective separation and detection of proteins [4–8].
protein expression, current high-throughput screening (HTS)
methods bear the risk of producing poorly controlled data Resolution
morasses. We anticipate a phase of high intensity experimen- There are essentially two separation technologies for complex
tal efforts ahead, requiring the application of rigorous differ- samples: 2D-PAGE and multidimensional LC [9] (see Glos-
ential, quantitative and statistical procedures to provide the sary). 2D-PAGE remains the most powerful tool for compar-
necessary high quality data content (post-translational mod- ative protein profiling studies. Modern large zoom gels can
ifications of key biomarker proteins) to permit the next gen- reproducibly and reliably resolve tens of thousands of differ-
eration of relevant HTS processes for proteomic research in ent proteins from complex mixtures, which is superior to all
drug discovery and development. other methods [10,11]. LC methods resolve some hundreds
of peaks, at best a few thousand. A high-throughput variant
Key technologies for proteomics of the LC approach is provided by chromatographic surface
There are two main requirements for a meaningful pro- arrays, interfaced directly to tandem-MS (SELDI, 12; Cipher-
teomics strategy. (a) Going broad: pattern control, as com- gen, http://www.ciphergen.com; see Glossary). This method
prehensive as possible, to exactly find the spot, where it is has a low resolution and is not very sensitive for identifica-
worthwhile to drill deeper. (b) Going deep at the right coor- tion. It has been covered in recent reviews [4,12].
dinate: once a suitable biomarker has been found with suffi- LC and 2D-PAGE have complementary advantages and
cient statistical significance, it is necessary to characterize its disadvantages, as discussed below. In terms of resolution,
molecular features and interactions in more depth. gels are far superior, with patterns confined to a molecu-
This review attempts to cover the fast developing field lar weight range of 10–250 kDa. Chromatographic methods
of proteomics in terms of five key technologies. Separation, have much poorer resolution, but have advantages at lower
detection, identification of proteins and subsequent data molecular weights (1–20 kDa), and on-line-MS capability.
mining, data management, and interpretation, would each Moreover, 2D-PAGE has a single important advantage, which
require separate discussions, with some of the major chal- most LC-based methods do not have, and which is rarely
lenges (all worthy of particular considerations) at the inter- discussed: a new gel is used for each experiment; they cannot
faces of various platforms. This review, therefore, is focussed be used again, in contrast to HPLC columns or chromato-
on differential and quantitative pattern control in pro- graphic arrays. Anyone taking the pains to investigate, for
teomics. This implies coverage of three labeling techniques example, reversed phase or ion exchange resins will discover
(fluorescent cyanine dyes, radioactive, and stable isotopes), that after a while they residually accumulate certain pep-
the detection of labeled proteins in 2D gels and liquid chro- tides, gradually changing adsorptive properties. The use of a
matography (LC; see Glossary) and, most importantly, the new gel overcomes these problems.
quantification of identified differential proteins. The corre-
sponding pattern analysis and interpretation will be treated as Dynamic range and sensitivity of detection
well. Standard protein staining in gels (silver, colloidal Coomassie
Progress in identification of proteins by mass spectrometry or fluorescent dyes) has moderate sensitivities (approxi-
has been covered recently in related reviews [4–8], and will mately 1 fmol) and clear disadvantages in terms of the dy-
be mentioned here only in the context of differential pro- namic range of protein concentrations. The two to three or-
tein quantification by stable isotope techniques. Briefly, the ders of magnitude of these procedures do not match the pro-
most recent developments involve tandem mass spectrom- tein reality (8–15 orders of magnitude) and thus, a consider-
eters with advanced MS/MS capabilities, like MALDI–TOF/ able amount of information is lost. The same is true for detec-
TOF–MS (matrix-assisted laser desorption ionization/time tion of protein profiles from LC or chromatographic arrays,
of flight; Bruker, Applied Biosciences http://www.bdal.com; which is usually achieved directly in the mass spectrometer,
http://www.appliedbiosystems.com, see Glossary) and LCQ– with also moderate dynamic ranges of approximately three
FT–MS (http://www.thermofinniganmat.de). The ‘‘TOF/TOF’’ orders of magnitude and moderate sensitivities in the range
technique provides the possibility to sequence selected pep- of 5 fmol. New methods in MS, like Fourier transform–ion
tides, by analyzing fragment ions generated by the dissoci- cyclotron resonance (FT–ICR; see Glossary), are potentially
2 www.drugdiscoverytoday.com
Vol. 1, No. 1 2004 Drug Discovery Today: Technologies Target identification
much more sensitive, but still rather with peptide standards an art in itself, but eventually leads to detection sensitivities
than with real protein mixtures from biological samples. at attomol levels, with a dynamic range of over six orders of
These dynamic range and sensitivity problems, as well as magnitude and statistically significant quantification even of
the reliable quantification of small differences, can be solved changes in the 15–20% range [13]. In our opinion there is no
using radioactive isotopes. Appropriate labeling of proteins is alternative to using a small fraction of a sample for radioac-
Figure 1. Quantitative differential pattern control of neuronal differentiation with undifferentiated murine stem cells, early and late neurons. (a)
Approximately 1g protein from three different stages of neuronal differentiation of murine embryonic stem cells (S1–S3) were jointly analysed in one 2D gel,
after separate labelling with three different radioactive iodine isotopes (123 I, 125 I and 131 I). The upper right panel shows a surrogate composite color image, the
lower right panel gives an impression of dynamic range; on the left the three isotopic images obtained from this single gel are shown. The actual quantification
is performed using the radioactive data set (matrix with spot coordinates and quantitative values for each isotope, not shown here) and enables an extremely
reliable and exact quantification and differential detection of even subtle changes of protein expression. (b) This type of analysis of expression levels of
differential proteins is shown for 20 proteins. Standard statistical procedures (ANOVA) ensure the significance of respective quantitative differences: the blue
bars are associated with abundance of the particular protein in the early S1 stage, purple bars with intermediate S2 and green bars with the late S3 stage.
www.drugdiscoverytoday.com 3
Drug Discovery Today: Technologies Target identification Vol. 1, No. 1 2004
4 www.drugdiscoverytoday.com
Vol. 1, No. 1 2004 Drug Discovery Today: Technologies Target identification
www.drugdiscoverytoday.com 5
Drug Discovery Today: Technologies Target identification Vol. 1, No. 1 2004
biophysical from separation, mass spectra, and literature), indsay, M.A. (2003) Target discovery. Nat. Rev. Drug Discov. 10, 831–838
with the aim to generate hypotheses. At this point, tran- Boguski, M.S. McIntosh, M.W. (2003) Biomedical informatics for
proteomics. Nature 422, 233–237
scriptomic and genomic data should be integrated [18].
Nebert, D.W. (2003) Pharmacogenomics and “individualized drug
The fast and high-throughput identification of proteins by therapy”: high expectations and disappointing achievements. Am. J.
MALDI only uses a fraction of the information available Pharmacogenomics 3, 361–370
in the spectra (unmodified peptides), and hypotheses can
be falsified or verified using the whole detected peptide
pool of a given data set, by educated sequencing of selected dimension, but instead uses two different ionic detergents,
peptides. 16-BAC, benzyldimethyl-n-hexadecylammonium chloride,
This usually is also the stage where, guided by initial pro- and SDS], or 1D-MS/MS for membrane proteins [19,20].
tein lists, more specific affinity tools, like antibodies (for 2D Compared to LC-based methods 2D-PAGE is slower and
western blots or enrichment, see Glossary) or immobilized more laborious (despite considerable progress in automa-
ligands for certain targets (‘‘Chemical proteomics’’) can be tion), but it does provide superior information. In principle,
used to address more focussed questions, relating to im- 2D-PAGE can analyze 70–80% of the proteome of a given
mediate functional interactions. Chemical proteomics has sample and, moreover, the residual proteome can be salvaged
the potential to directly identifying targets, and some com- and treated with alternative approaches if appropriate. Mul-
panies offer related technologies and platforms (Serenex, tidimensional LC–MS methods have high-throughput and
http://www.serenex.com/; Cellzome, http://www.cellzome. clear advantages for small proteins and peptides (1–20 kDa),
com/; and ProteoSys http://www.proteosys.com/). It remains but lose a lot of relevant information because of their con-
to be seen whether these approaches can be translated di- finement to cysteine-containing peptides (in a total tryptic
rectly to relevant HTS systems in drug discovery or if they will digest of human peptides only 3% of the peptides con-
still need to be individually adjusted in specifically designed tain cysteine; Fig. 3), moreover, the inherent complexity of
experimental paradigms. samples might remain invisible because of limited pattern
resolution. SELDI, with chromatographic arrays, has similar
Comparison of technologies pros and cons. Resolution is inferior; in typical examples
The technologies compared here include 2D-PAGE with cya- only a few dozen peaks were detected [21,22]. Differential
nine dyes (DIGE), radioactively labeled proteins and stable analysis is also essentially static matching the resolution
isotope-tagged proteins, on the one hand, and LC-based of multiple experiments. Nevertheless, even on this level
multidimensional protein separation technologies, with on- (a few percent of a given proteome) interesting results can
line MS or SELDI, on the other hand. All these methods have be found.
advantages and disadvantages and should be regarded as
complementary. 2D-PAGE is absolutely superior for pattern
Conclusions
analysis of complex samples; a comprehensive control of 2D
Protein biomarkers enable the stratification of patient popu-
patterns requires real differential display strategies, which
lations and promote the quantification of new drug effects
are not available with standard staining procedures. The
in terms of molecular targets, altered pathophysiological
dynamic range and sensitivities of DIGE and stable isotope
approaches with 2D gels are moderate (approximately three
orders of magnitude, >10 fmol per spot); radioactive displays
are far superior for differential detection and in particular Links
quantification (approximately eight orders of magnitude, r Human Proteome Organization: http://www.hupo.org
>10 amol per spot), but are more laborious to employ. Gen- r Institute for Systems Biology: http://www.systemsbiology.org/
r Swiss Institute of Bioinformatics: http://us.expasy.org/
erally, 2D gels are said to be difficult to use with certain
r Nature, Supplement Proteomics:
proteins, like membrane proteins and basic proteins, but
http://www.nature.com/nature/insights/6928.html
for these subproteomes there are always excellent alterna- r National Center for Biotechnology Information,
tives, for example, 16-BAC 2D-PAGE [this method avoids http://www.ncbi.nlm.nih.gov/
chaotropic reagents and isoelectric focussing for the first
6 www.drugdiscoverytoday.com
Vol. 1, No. 1 2004 Drug Discovery Today: Technologies Target identification
www.drugdiscoverytoday.com 7
Drug Discovery Today: Technologies Target identification Vol. 1, No. 1 2004
5 Tyers, M. and Mann, M. (2003) From genomics to proteomics. Nature matrix-assisted laser desorption/ionisation time-of-flight mass
422, 193–197 spectrometry, and analysis of relative isotopologue abundances of
6 Fusaro, V.A. and Stone, J.H. (2003) Mass spectrometry-based proteomics peptides. Rapid Commun Mass. Spectrom. 17, 1273–1282
and analyses of serum: a primer for the clinical investigator. Clin. 15 Vogt, J. et al. (2003) Quantitative protein expression analysis and
Exp. Rheumatol. 21, S3–14 determination of amino acid precursor pool enrichment in stem cells
7 Purcell, A.W. and Gorman, J.J., (2004) Immunoproteomics: Mass performed with stable isotope amino acid in vivo labelling and
spectrometry based methods to study the targets of the immune MALDI-TOF-MS. Mol. Cell. Proteomics 2 (9), 795
response. J. Proteome. Res. 3(3), 572–581. 16 Washburn, M.P. et al. (2001) Large-scale analysis of the yeast proteome by
8 Aebersold, R. and Mann, M. (2003) Mass spectrometry-based proteomics. multidimensional protein identification technology. Nat. Biotechnol. 19,
Nature 422, 198–207 242–247
9 Wang, H. and Hanash, S. (2003) Multi-dimensional liquid phase based 17 Sommer et al. Molecular analysis of homocysteic acid-induced neuronal
separations in proteomics. J. Chromatogr. B Analyt. Technol. Biomed. Life stress. J. Proteome Res. 3, [in press]
Sci. 787, 11–18 18 Boguski, M.S. and McIntosh, M.W. (2003) Biomedical informatics for
10 Poland, J. et al. (2003) Isoelectric focusing in long immobilized pH proteomics. Nature 42, 233–237
gradient gels to improve protein separation in proteomic analysis. 19 Marvin, L. et al. (2000) Identification of proteins from one-dimensional
Electrophoresis 24, 1271–1275 sodium dodecyl sulfate-polyacrylamide gel electrophoresis using
11 Ravichandran, V. et al. (2004) Ongoing development of two-dimensional electrospray quadrupole-time-of-flight tandem mass spectrometry. Rapid
polyacrylamide gel electrophoresis data standards. Electrophoresis 25, Commun. Mass Spectrom. 14, 1287–1292
297–308 20 Hartinger, J. et al. (1996) 16-BAC/SDS–PAGE: a two-dimensional gel
12 Tang, N.P. et al. (2004) Current developments in SELDI affinity electrophoresis system suitable for the separation of integral membrane
technology. Mass Spectrom. Rev. 23, 34–44 proteins. Anal. Biochem. 240, 126–133
13 Cahill, M.A. et al. (2003) Analysis of relative isotopologue abundances for 21 Petricoin, E.F. et al. (2002) Use of proteomic patterns in serum to identify
quantitative profiling of complex protein mixtures labelled with the ovarian cancer. Lancet 359, 572–577
acrylamide/D3-acrylamide alkylation tag system. Rapid Commun. Mass 22 Xu, Q. Lam, K.S. (2003) Protein and chemical microarrays-powerful tools
Spectrom. 17, 1283–1290 for proteomics. J. Biomed. Biotechnol. 5, 257–266
14 Vogt, J.A. et al. (2003) Protein abundance quantification in embryonic 23 Somiari, R.I. et al. (2003) High-throughput proteomic analysis of human
stem cells using incomplete metabolic labelling with 15 N amino acids, infiltrating ductal carcinoma of the breast. Proteomics 10, 1863–1873
8 www.drugdiscoverytoday.com