Beruflich Dokumente
Kultur Dokumente
About me
B.Sc Computer Science Ph.D. Chemoinformatics (Willett Lab) Worked for 5 years in Scientific Computing leadership at Pfizer, responsible for the development of computational tools for scientists Now run a consulting firm based in Ann Arbor, Mich., and am also an Adjunct Professor at the University of Michigan.
VIRTUAL SCREENING
Isolate protein Using a computer to predict activity
COMBINATORIAL CHEMISTRY
Rapidly producing vast numbers of compounds Find drug
MOLECULAR MODELING
Computer graphics & models help improve activity
Preclinical testing
Analyzing risk
Series Followup Produce 2-3 lead compounds ADME study Are compounds safe?
Summary
The pharmaceutical industry is a high-risk industry with very long development times and short product lifespans There has been a lot of investment in new technologies for early stage drug discovery, but so far these are not resulting in more drug candidates (or profits) Companies are looking at ways to address this problem including managing attrition, risk analysis and metrics.
Making information available at the right time and the right place Needs to be integrated into processes
Screening perhaps millions of compounds in a corporate collection to see if any show activity against a certain disease protein
High-Throughput Screening
Drug companies now have millions of samples of chemical compounds High-throughput screening can test 100,000 compounds a day for activity against a protein target Maybe tens of thousands of these compounds will show some activity for the protein The chemist needs to intelligently select the 2 - 3 classes of compounds that show the most promise for being drugs to follow-up
Informatics Implications
Need to be able to store chemical structure and biological data for millions of datapoints
Computational representation of 2D structure
Need to learn as much information as possible from the data (data mining)
Apply statistical methods to the structures and related information
BioReason ClassPharmer
Clusters actives into groups representing series Attempts to find a scaffold using MCS algorithm Recovers inactives back into series Presents series as rows in a spreadsheet view Gives other statistics on series, such as activity distribution http://www.bioreason.com
BioReason Classpharmer
www.bioreason.com
BioReason Classpharmer
www.bioreason.com
Missed opportunities
Not a specific breakdown, but if the right piece of information had been available at the right time, better decisions could have been made E.g.
A group of compounds is being followed up as potential drugs, but a rival company just applied for a patent on the compounds A large amount of money is being spent developing an HTS assay for a target, but marketing research shows any drug is unlikely to be a success A group of compounds is selected from an HTS as good candidates for follow up, but 20 years ago they were followed up for a similar project and had severe solubility problems
What do we do?
No large company has really solved the problem But ongoing attempts include:
Defining information produced and needed at each stage of the discovery process Improving processes to be more consistent, especially across different sites Improving information flow between departments and sites Harmonizing terminology across disciplines and sites Defining needed management information as well as raw data Looking for quick win opportunities
This will presumably impact what is stored in databases and what software is used
Oracle Chemistry Cartridges help
Understanding the link between diseases, genetic makeup and expression of proteins
Genomics
Genomics is fast-forwarding our understanding of how DNA, genes, proteins and protein function are related, in both normal and disease conditions Human genome project has mapped the genes in human DNA Hope is that this understanding will provide many more potential protein targets Allows potential personalization of therapies
ATACGGAT TATGCCTA
functions
Gene Chips
Gene chips allow us to look for changes in protein expression for different people with a variety of conditions, and to see if the presence of drugs changes compounds administered that expression Makes possible the design of drugs to target different phenotypes
expression profile (screen for 35,000 genes) people / conditions e.g. obese, cancer, caucasian
Virtual Screening
Build a computational model of activity for a particular target Use model to score compounds from virtual or real libraries Use scores to decide which to make, or pass through a real screen
Scoring methods
Profile compounds based on properties related to target
Fast Docking
Rapidly dock 3D representations of molecules into 3D representations of proteins, and score according to how well they bind
Model will come out with with either prediction of how well each molecule will bind, or a score for each molecule
Combinatorial Chemistry
By combining molecular building blocks, we can create very large numbers of different molecules very quickly. Usually involves a scaffold molecule, and sets of compounds which can be reacted with the scaffold to place different structures on attachment points.
Examples
OH NH
R1
O OH C
OH
R2 R3
OH NH
NH
OH
CF3 O CH3 O C NH OH
Software Relevance
To be able to make software relevant requires the software designer to understand:
the science, i.e. the domain the scientific computing techniques that are used in the domain the possibilites and limitations of software development.
Even with this, its hard to match the things we can do with the things that people want or need to do Techniques like personas and contextual inquiry simply help us understand the people who use the software, their goals, and tasks they want to do
goals: e.g. produce compounds that have high biological activity tasks: work out a chemical synthesis
tools
chemoinformatics
science
Software Usability
Tend to focus on the method and the science, but not how easy it is for people to get their job done using the software Programmers tend to make software intuitive for them, but not necessarily the people it is designed for A usability lab and other techniques can make a HUGE difference to the satisfaction of users and programmers alike!
Software Management
Disparate set of tools & platforms Disparate programming styles, languages A variety of people tend to be writing software
Trained software developers Enthusiastic scientists Scientific computing specialists
Focus on the science tends to mean software management is neglected Everyone hates traditional software management rules But there are ways of making everything work better and having more fun doing it! Have a recommended basic setup that should help a lot
Foundation reading
The Inmates Are Running the Asylum by Alan Cooper Contextual Design by Hugh Beyer and Karen Holtzblatt Usability Engineering by Jakob Nielsen The Visual Display of Quantitative Information by Edward Tufte Dont Make Me Think! by Steve Krug
Summary
R&D in the pharmaceutical industry is undergoing a lot of technological changes, and there is pressure to make the investment pay off There is a big need to sensibly use the large amounts of chemical and biological-related information produced in the process Thoughtful use of chemoinformatics methods and software is becoming crucial to the success of drug discovery