Sie sind auf Seite 1von 101

PCMODEL V 8.

Molecular Modeling Software for Windows Operating System Apple Macintosh OS Linux and Unix

Serena Software Box 3076 Bloomington, IN 47402-3076 (812)-333-0823

1993-2002 Serena Software. All rights reserved. First Edition June 1998 Second Edition June 1999 Third Edition January 2001 Fourth Edition April 2002

Trademarks IBM and IBM PC are registered trademarks of International Business Machines Inc. Windows is a registered trademark of the Microsoft Corporation. Alchemy and Sybyl are registered trademarks of Tripos Associates, Inc. Chem-3D is a registered trademark of Cambridge Scientific Computing, Inc. ISIS is a trademark of MDL Information Systems, Inc.

License Agreement Serena Software provides these programs and licenses their use in your teaching and research. You assume responsibility for the use of the program to achieve your intended results. You may: 1. Use the programs on any machine or machines in your possession. 2. Copy the programs and source code into any machine readable form for backup or modification purposes in support of your use of the programs. 3. Modify the programs and source code or merge them into another program for your use. (Any portion of these programs or source code or merged programs resulting will continue to be subject to the terms and conditions of this agreement.) You may not: 1. Give copies of these programs to anyone else, unless you destroy all copies you have in any form. 2. Sell copies of these programs. 3. Use the programs on a network, unless others are prevented from accessing the programs. This license is effective until terminated. You may terminate it at any time by destroying the programs and source code together with all copies, modifications and merged portions in any form. These programs are provided "AS IS" and without warranty of any kind, either expressed or implied. The entire risk as to the quality and performance of the programs is with the user. Serena Software does not warrant the operation of the programs will be uninterrupted or error free. If Serena Software is unable to deliver a copy of the programs that is satisfactory, you may obtain a full refund by returning all the program materials and documentation within 30 days of date of delivery. These programs are intended for use in chemical research and teaching. Serena Software will not be liable in any event to you for any damages, including any lost profits, lost savings or other incidental or consequential damages arising out of the use or inability to use these programs even if Serena Software has been advised of the possibility of such damages, or for claim by any other party.

Table of Contents
Table of Contents ..................................................................................................................................... 4 Chapter 1. Overview ............................................................................................................................... 8 1.1 Introduction................................................................................................................................... 8 1.2 Changes in Version 7 ..................................................................................................................... 8 1.3 Changes in Version 7.5 .................................................................................................................. 8 1.4 Changes in Version 8.0 .................................................................................................................. 8 1.5 History ........................................................................................................................................... 9 1.6 The PCMODEL Window............................................................................................................. 10 1.7 Definitions .................................................................................................................................. 10 1.8 Closing Notes ............................................................................................................................... 11 Chapter 2. Installation........................................................................................................................... 12 2.1 Windows Installation.................................................................................................................... 12 2.2 Macintosh Installation .................................................................................................................. 13 2.3 Linux Installation ......................................................................................................................... 13 Chapter 3. Quick Summary of PCMODEL Menu Bar Functions ............................................................ 15 Chapter 4. Tutorials .............................................................................................................................. 19 4.1 Methylcyclohexane ....................................................................................................................... 19 4.1.1 Drawing the Structure............................................................................................................ 19 4.1.2 Building the Structure............................................................................................................ 21 4.1.3 Minimizing and Adjusting the Structure ................................................................................ 24 4.2 Trans-Decalin.............................................................................................................................. 25 4.3 Special Options ............................................................................................................................ 26 4.3.1 Compare ................................................................................................................................ 26 4.3.2 Rotational Energy Barriers..................................................................................................... 26 4.4 Benzene and a Pi Calculation ....................................................................................................... 27 4.5 Biphenyl and a Pi Calculation ...................................................................................................... 27 4.6 Ferrocene - Using Metals and Coordination.................................................................................. 28 4.7 The Methanol dimer - Hydrogen Bonding and Docking................................................................ 28 4.8 Substructures - Creation and Manipulation ................................................................................... 28 4.8.1 Building Polystryene using Dummy Atoms ............................................................................ 29 4.9 Diels-Alder Transition State ......................................................................................................... 30 Chapter 5. Drawing Tools ..................................................................................................................... 31 5.1.1 Select Atom ........................................................................................................................... 31 5.1.2 Select Bond............................................................................................................................ 31 5.2 Draw ............................................................................................................................................ 31 5.3 Build ............................................................................................................................................ 31 5.4 Update.......................................................................................................................................... 31 5.5 H/AD ........................................................................................................................................... 31 5.6 Add_B.......................................................................................................................................... 32 5.7 In ................................................................................................................................................. 32 5.8 Out............................................................................................................................................... 32 5.9 Del ............................................................................................................................................... 32 5.10 Move .......................................................................................................................................... 32 5.11 Rotate Bond................................................................................................................................ 32 5.12 Query ......................................................................................................................................... 32 5.13 PT .............................................................................................................................................. 33 5.14 Metals ........................................................................................................................................ 33 5.15 Rings.......................................................................................................................................... 34 5.16 AA ............................................................................................................................................. 34 5.17 Su............................................................................................................................................... 34 5.18 Nu .............................................................................................................................................. 35

5.18 Organo Metallics Templates ....................................................................................................... 35 5.20 Transition State Templates ......................................................................................................... 36 Chapter 6. File Menu ............................................................................................................................. 37 6.1 Open ............................................................................................................................................ 37 6.2 Save ............................................................................................................................................. 39 6.3 Save Graphic ................................................................................................................................ 43 6.4 Print ............................................................................................................................................. 43 6.5 Read pcmod.bak ........................................................................................................................... 43 6.6 Exit .............................................................................................................................................. 43 Chapter 7 Edit Menu.............................................................................................................................. 44 Draw .................................................................................................................................................. 44 Erase.................................................................................................................................................. 44 Structure name ................................................................................................................................... 44 Hide Hydrogens.................................................................................................................................. 44 Epimer ............................................................................................................................................... 45 Enatiomer .......................................................................................................................................... 45 Remove LP......................................................................................................................................... 45 Retype ................................................................................................................................................ 45 Copy_To_Clipboard ........................................................................................................................... 45 Orient_XY Plane................................................................................................................................ 45 Build Solvent Box/Solvate.................................................................................................................. 45 Build Periodic Box ............................................................................................................................. 46 Chapter 8 View Menu ............................................................................................................................ 47 Control Panel ..................................................................................................................................... 47 Labels ................................................................................................................................................ 47 Mono/Stereo....................................................................................................................................... 48 Stick Figure........................................................................................................................................ 48 Ball and Stick..................................................................................................................................... 48 Pluto .................................................................................................................................................. 49 Tubes ................................................................................................................................................. 49 CPK_Surface...................................................................................................................................... 49 Dot_Surface ....................................................................................................................................... 49 Ribbon ............................................................................................................................................... 49 Red/Green Stereo ............................................................................................................................... 49 Top View ........................................................................................................................................... 49 Dipole Vector..................................................................................................................................... 49 Chapter 9 Compute and Force Field Menus........................................................................................... 50 Minimize ........................................................................................................................................... 50 Single Point........................................................................................................................................ 51 Use External Charges ......................................................................................................................... 51 Normal Vibrational Modes ................................................................................................................. 51 Mopac ................................................................................................................................................ 52 Ampac ............................................................................................................................................... 52 Gaussian ............................................................................................................................................ 52 Orbitals .............................................................................................................................................. 52 Vibrations .......................................................................................................................................... 52 GMMX .............................................................................................................................................. 53 Metropolis-MC................................................................................................................................... 53 Vibrational Mode Search.................................................................................................................... 53 Auto Dock.......................................................................................................................................... 53 Manual Dock...................................................................................................................................... 54 Batch.................................................................................................................................................. 54 Rot_E................................................................................................................................................. 55 Dihedral Driver .................................................................................................................................. 55

Chapter 10 Analyze Menu...................................................................................................................... 57 Surface Area ...................................................................................................................................... 57 Volume .............................................................................................................................................. 57 Compare ............................................................................................................................................ 57 Dihedral Map ..................................................................................................................................... 58 Movie................................................................................................................................................. 58 Dot Map............................................................................................................................................. 59 MultiStructure.................................................................................................................................... 59 Assign Symmetry ............................................................................................................................... 60 Chapter 11 Substr Menu......................................................................................................................... 61 Read................................................................................................................................................... 61 Create ................................................................................................................................................ 61 Move.................................................................................................................................................. 61 Connect.............................................................................................................................................. 62 Erase.................................................................................................................................................. 62 Show Dummy..................................................................................................................................... 62 Don't Minimize .................................................................................................................................. 62 Chapter 12 Mark Menu .......................................................................................................................... 63 H_Bonds ............................................................................................................................................ 63 Pi Atoms ............................................................................................................................................ 63 Metal Coordination ............................................................................................................................ 64 TS_Bond Orders ................................................................................................................................ 64 Fix_Distances..................................................................................................................................... 64 Fix Angle ........................................................................................................................................... 64 Fix_Torsions ...................................................................................................................................... 64 Reset .................................................................................................................................................. 65 Chapter 13 Options Menu ...................................................................................................................... 66 Printout .............................................................................................................................................. 66 Dielc .................................................................................................................................................. 66 DPDP................................................................................................................................................. 66 Minimizer .......................................................................................................................................... 66 MMX_PI Calculation Options ............................................................................................................ 67 Added Constants ................................................................................................................................ 67 Standard Constants ............................................................................................................................ 67 Stereo................................................................................................................................................. 67 Pluto .................................................................................................................................................. 67 Vdw Surface....................................................................................................................................... 68 Dot Surface ........................................................................................................................................ 68 Chapter 14. Conformational Searching .................................................................................................. 70 Setup Rings........................................................................................................................................ 71 Setup Bonds ....................................................................................................................................... 71 Setup Query ....................................................................................................................................... 72 Comparison Method........................................................................................................................... 72 Options .............................................................................................................................................. 73 Run GMMX....................................................................................................................................... 74 Run 2nd Cycle ..................................................................................................................................... 74 Save Job ............................................................................................................................................. 74 Read Job............................................................................................................................................. 74 15 File Formats ...................................................................................................................................... 75 15.1 PCM File format......................................................................................................................... 75 15.2 MMX INPUT FILES .................................................................................................................. 79 15.3 Mopac Files................................................................................................................................ 92 15.4 X-RAY Files .............................................................................................................................. 93 16 The MMX Force Field....................................................................................................................... 94

16.1 The MMX Force Field................................................................................................................ 94 16.2 Minimization with PCMODEL (hints)........................................................................................ 95 16.3 TRANSITION STATE ATOMS................................................................................................ 96 16.4 PI CALCULATIONS ................................................................................................................. 97 16.4.2 HEAT of FORMATION of PI SYSTEMS ............................................................................ 98 16.5 TRANSITION METALS............................................................................................................ 98

Chapter 1. Overview

1.1 Introduction
PCMODEL is an interactive molecular modeling program for the study of organic and inorganic molecules. PCMODEL is simple to use and has a number of structure creation, display and energy minimization options. In this manual, we will discuss installation of the program, the features of the various menus, the different force fields supported by PCMODEL, and the formats of the various files that can be read and written. A short tutorial on structure building and manipulation is also included. A complete discussion of the methods of molecular mechanics is beyond the scope of this manual. General information about molecular mechanics can be found in the book "Molecular Mechanics", U. Burkert and N. L. Allinger, ACS Monograph Series, Number 175 and in "A Handbook of Computational Chemistry: A Practical Guide to Chemical Structure and Energy Calculations", by Tim Clark, J. Wiley and Sons, 1985. More current information can be found in "Journal of Computational Chemistry", edited by N. L. Allinger and the book series "Reviews in Computational Chemistry", edited by K. Lipkowitz and D. B. Boyd.

1.2 Changes in Version 7


Version 7 of PCMODEL introduces several major changes; the most important of which is the addition of support for other force fields. The currently release supports the MM3 and MMFF94force fields in addition to the MMX force field that was in earlier versions of PCMODEL. Planned for future releases is support for the Amber, Charmm and OPLS force fields. In order to support additional force fields we have rewritten the entire force field section of the code, added new minimizers and changed the format of the parameter files. Version 7 includes both a first derivative minimizer and also a full-matrix second-derivative minimizer.

1.3 Changes in Version 7.5


Version 7.5 of PCMODEL introduces several additional changes. The currently release now supports the AMBER force field. Additional items have been added to the Menu Bar. Some items under the View and Compute options have been moved to a new Analyze menu item. The View options now include the ability to change the screen background color, to view structures in red and green stereo, to have a small top view of the structure and to display a dipole vector. The Analyze menu options have added the ability to display the molecular symmetry, and view/analyze multiple structure files. With the multiple structure option one may view in sequential or random order a multiple structure file. One may also do a query on a set of structures, extract structures which match a query or do a global comparison of all structures in a file. The Compute menu option has added the ability to do normal vibration mode calculations and the ability to do conformational searches using low-mode vibrations. The use of solvents has also been improved.

1.4 Changes in Version 8.0


Version 8.0 of PCMODEL includes support for the Oplsaa force field, reading Gaussian Formatted checkpoint files, reading and writing Hondo files. In the Compute Menu there are options to

use charges read from a structure file or Gasteiger charges rather than the default force field charges. There is a Manual Docking option, and there are more options for running Batch calculations. The ability to change the atom colors and the background colors has been added to the Options Menu

1.5 History
PCMODEL is derived from several sources. The graphical user interface originally came from MODEL, written by C. Still. It has been extensively modified for PCMODEL for WindowsTM. MODEL was ported to the IBM-PC and renamed PCMODEL by M. Mark Midland, University of California, Riverside, J.J. Gajewski of Indiana University, Kosta Steliou of Boston University and K.E. Gilbert of Serena Software. K. E. Gilbert, J. J. Gajewski and T.W. Kreek of Indiana University then ported PCMODEL to the Macintosh, Silicon Graphics and Sun workstations. K.E. Gilbert wrote the current version of PCMODEL for Windows, the Apple Macintosh, Unix and Linux. The default force field used in PCMODEL is called MMX and is derived from MM2 (QCPE-395, 1977) force field of N. L. Allinger, with the pi-VESCF routines taken from MMP1 (QCPE-318), also by N. L. Allinger. J. McKelvey of Kodak modified the pi-VESCF routines for open shell species, while J. J. Gajewski did improvements to the heat of formation calculations. MMX increased the number of atom types for the MM2 force field, added the ability to handle transition metals and transition states, and increased the number of parameters included in the database. J. J. Gajewski and K. E. Gilbert made these changes.

1.6 The PCMODEL Window


PCMODEL uses a standard window in all versions of the program. The main window in the WindowsTM version is shown below.

A menu bar appears at the top of the screen, with 10 options - FILE, EDIT, VIEW, COMPUTE, ANALYZE SUBSTR, TEMPLATES, MARK, OPTIONS, FORCE FIELD and HELP. The commands under each of these headings are described in detail in later chapters. A second bar of drawing tools appears along the right hand edge of the screen (this is referred to as the TOOLS menu or Drawtools menu) The Drawtools menu may be moved. At the bottom of the screen is a status bar, which indicates the status of various options.

1.7 Definitions
Within this manual, the terms window and screen will be used interchangeably to refer to the main graphics window in which PCMODEL is running.

10

The term attached atom refers to an atom with only one connection, such as hydrogen attached to a carbon. A connected atom is one that is connected to the given atom and may include an attached atom. Pointing, clicking, hitting or pressing a button all refer to a single click of the left mouse button over the word, button, atom or bond. A substructure is a collection of atoms that may or may not be connected, that have a common value among their substructure numbers and an associated name. Substructures can be moved, rotated, connected, hidden, minimized independently, and docked to one another or to a main structure. When written with a PCM file format, substructure information is maintained, so substructures can be stored. A single atom may belong to more than one substructure. When a substructure is connected to another structure, a bond is formed but the substructure information is maintained. Thus, a substructure can be part of a contiguous structure, and when the structure is moved or rotated the joining bond can be stretched abnormally. Remember that, when connecting substructures, the substructure that is chosen many not be the entire connected structure, thus leading to the stretch of one bond in order to form the connected bond.

1.8 Closing Notes


This manual describes only a few of the possible chemical structures that we can study with PCMODEL. We have tried to do the very best job of representing chemistry with a computer model. No doubt there are many problems still remaining and many faults with our models. If you find problems, or think the calculations are incorrect, please let us know. We are always trying to improve the chemistry of PCMODEL and since we are few and the problems are many, your help is always appreciated. We can be reached at: Phone FAX Email WWW Mail (812)-333-0823 (812)-332-0877 gilbert@serenasoft.com http://www.serenasoft.com/ Serena Software Box 3076 Bloomington, IN 47402-3076

11

Chapter 2. Installation
2.1 Windows Installation
This manual assumes a basic working knowledge of the operation of your computer system, and the WindowsTM software. If there are things mentioned which are not clear, consult your computer's user or system administration help procedures or manuals. PCMODEL for Windows is shipped either as a compressed file on a floppy disk in PKZIP format, or as a series of files on a CD. Installation from a Floppy Disk You should first create a directory on your hard disk to contain the PCMODEL program. Then you will need to change to that directory, copy and decompress the PCMODEL files. To do this, type the following at the DOS command window prompt (this example assumes your hard drive is "c", you are using floppy drive "a", and you wish to call the PCMODEL directory "pcmod"): c:> md \pcmod c:> cd pcmod c:\pcmod> copy a:p*.* c:\pcmod> pkunzip pcm7exe Alternatively you can create a new folder in the File menu and then use WinZipTM to extract the files to the pcmod directory. The following files will be generated: Pcm8.exe mmxconst.prm mm3.prm mmff94.prm amber.prm oplsaa.prm aa.sst nu.sst su.sst rings.sst templt.sst fg.sst orgmet.sst executable copy of PCMODEL text file containing specific constants used by PCMODEL file containing MM3 constants used by PCMODEL file containing MMFF94 parameters file containing AMBER parameters file containing Oplsaa parameters Substructure template files " " " " " "

Installation from a CD You should first create a directory on your hard disk to contain the PCMODEL program. Then you will need to copy all the files from the CD into the new folder on the hard disk. The list of files given above will now be on the hard disk and ready to use. You may now return to Windows and use the Taskbar setup from the Setting option in the Start Menu to install the PCMODEL icon. In the Taskbar properties dialog box choose the Start Menu Programs and then choose Add to add a new item to the Task Bar. Use the Browse option to search for pcm8.exe, then select the program group for the icon or create a new program group.

12

2.2 Macintosh Installation


PCMODEL for the Macintosh is shipped either on a floppy disk as a self-extracting archive or as the fully expanded program on a CD. If you have the floppy disk version drag the PCMODEL_v8.0 Installer file into a folder on the hard drive of the Macintosh. Double click on the Installer icon and the program will automatically extract the files necessary to run PCMODEL. The files included are: PCMODEL_8.0_PPC xx.prm xx.sst Power Macintosh version of PCMODEL text file containing specific constants used by PCMODEL Substructure template files (same as PC)

There is only a Power Macintosh version of PCMODEL v8.0 on the floppy disk and this version will run on Mac OS 9.x and earlier. The CD contains two folders, one for Mac OS 9 and one for Mac OS X. Drag the appropriate folder to the hard drive of the Macintosh. The constants files, *.prm, contain the parameters used in the calculations. These are text files that can be read and edited with a word processor to make parameter editing and addition easier. Please be certain to save these files as text only files when editing them with a word processor like Microsoft Word. If you save them as a standard Word document they will have formatting information embedded in the file and will not be read properly by PCMODEL. Using a simple, non-formatting text editor, such as Teach Text, is recommended.

2.3 Linux Installation


PCMODEL for Linux or Unix is shipped in serveral files, pcm8_XXX.tar.gz and pcm8c.tar.gz. The files must be copied to the hard drive on your computer, unzipped using gunzip and then untarred. The possible files are: Pcm8_24.tar.gz Pcm8_16.tar.gz Pcm8c.tar.gz RedHat and SuSE executable for 24 bit X server RedHat and SuSE executable for 16 bit X server Source code for Linux/Unix version of PCMODEL

PCMODEL uses X and OpenMotif on Linux, however, the executables are statically linked and you will need to have Motif or Lesstif installed only if you will be recompiling the source code. The latest version of Lesstif can be obtained from www.lesstif.org and OpenMotif from www.openmotif.org . Two executable versions of pcm8 are included, pcm8_24 for 32 or 24 bit graphics X servers and pcm8_16 for 16 bit X servers. The source code is included with instructions for generating an 24 bit version of pcm8. Please consult your Linux or X11R6 documentation for setting up your X server. The Linux version of PCMODEL has been tested with both RedHat Linux (up to version 7) and SuSE Linux (v 7). Other versions of Linux and other flavors of Unix can be supported by recompiling the source code. We have recompiled the current source code on SGI and HPUX computers with only one modification of the source. The file graphics.h must be edited and the lines defining the bit depth of the X server must be changed appropriately. For SGI, HPUX and other Unix systems EIGHBIT must be defined and TWENTYFOUR bit undefined (the default settings for Linux are shown below). #undef EIGHTBIT #define TWENTYFOURBIT You will also need to edit the Makefile included with the source code to use these systems as appropriate for your system.. You are now ready to run PCMODEL by double clicking on the PCMODEL icon. PCMODEL normally only uses sixteen colors for drawing structures, however if you want good CPK surfaces you

13

should set your monitors control to Millions of colors (if possible), otherwise use 256 colors. This will have no effect on normal operation of PCMODEL. There are some fundamental hardware differences between a PC, a Macintosh and Linux, such as the Macintosh having only a one button mouse while the PC normally uses a three button mouse. Where this is important we will make a comment in the documentation, but we have tried to make the three versions as similar as we can so functions should look and feel the same in the Windows, Macintosh and Linux versions.

14

Chapter 3. Quick Summary of PCMODEL Menu Bar Functions


File Menu Open Save SaveGraphic Print Read pcmod.bak Exit Edit Menu Draw Erase Structure Name Hide Hydrogens Epimer Enantiomer Remove LP Retype Copy_To_Clipboard Orient_XY Orient_XZ Orient_YZ Build Solvent Box/Solvate Build Periodic Box View Menu Control Panel Labels Mono/Stereo Stick Figure Ball and Stick Pluto Tubular Bonds CPK Surface Dot Surface Ribbon Wht/Black Background RedGreen Stereo Top View Dipole Vector

Reads a file from disk. Saves a file to disk Saves a graphics image to a disk file Prints the current structure and energy windows Read pcmod.bak file Exits PCMODEL

Recovers the TOOLS menu Erases all structures Edit or enter name of current structure Make the hydrogens in a structure invisible but still present Epimerizes the selected atom by interchanging the two selected attachments. Reflects the structure through a specified axis Remove lone pairs from a structure. Forces PCMODEL to reassign atom types. Copies an image of the structure window to the clipboard. Orients the selected three atoms in the XY Plane with the first atom at the origin the second atom along the axis and the third atom in the plane. Build a box of solvent molecules or solvate the current structure. Build a box of the current structure.

Brings up a dialog box to scale, translate and rotate structures in the structure window. Brings up a dialog box to change the structure labels Toggles between mono and stereo display Display all structures as stick figures. Display all structures as ball and stick. Display all structures as Pluto type ball and stick. Display all structures as Tubes. Display all or selected structures with CPK surface. Display all or selected structures with dot surface. Display a Ribbon diagram of peptide or nucleic acid backbone Sets background color Sets view to red/green stereo Turns on top view of current structure Draws a dipole moment vector on current structure.

15

Compute Menu Minimize Single Point Use External Charges Use Gasteiger Charges Normal Vibrational Modes Mopac Ampac Gaussian Orbitals Vibrations GMMX Metropolis MC Vibrational Mode Search Dynam Auto Dock Manual Dock Batch Rot_E Dihedral Driver Analyze Menu Surface Area Volume Compare Dihedral Map Movie

Minimize the current structure(s). Calculate the energy of the current structure(s). Use charges read from Mopac or Ampac file Use charges generated by Gasteiger sigma model Compute normal vibrational modes Runs the quantum chemistry program Mopac. Runs the quantum chemistry program Ampac Runs the ab initio quantum chemistry program Gaussian. Runs the Orbdraw program to view molecular orbitals and electron densities Runs the Vibrate program to view normal vibrational modes Runs the conformational searching program on the current structure. Runs conformational search using Monte Carlo algorithm Runs conformational searching using low frequency vibrational modes as search directions. Molecular Dynamics simulation. Simulated annealing search of lowest energy interaction between two structures. Manually move two structures to dock them Read a multiple structure file and minimize all structures. Rigid rotor evaluation of rotational energy barrier. Grid search with minimization evaluation of rotational energy barrier. Calculate Surface area of molecule. Calculate Volume of molecule. Compare different structures. Display the results of dihedral driver calculations. Replay the structures contained in a multiple structure file. Currently supports PCM, MMX, Gaussian and Cambridge Structural Database file formats. Generate a dot map from a multi conformation file. Analyze a file containing multiple structures. Assign point group symmetry to current structure.

Dot Map Multi Structure File Assign Sym Substr Menu Read Create Move Connect Erase Show Dummy Don't Minimize Templates Menu Rings Amino Acids Sugar Nucleosides OrganoMetallics Transition State

Read a structure file and label the structure as a substructure. Label the selected structure as a substructure Move the selected substructure. Connect two structures. Erase the selected substructure Displays dummy atoms in current structure. Mark the selected substructure so that it will not be minimized in a molecular mechanics calculation. Dialog box to read in selected ring systems. Dialog box to read in and connect Amino Acids. Dialog box to read in and connect Sugars. Dialog box to read in Nucleosides. Dialog box to read in selected organometallic structures. Dialog box to read in selected transition state models.

16

Functional Groups

Dialog box to read in selected functional groups for connection to larger structure.

Mark Menu H Bonds Pi Atoms Metal Coord TS_BondOrders Fix Distance Fix Angle Fix Torsions Equivalent Pi Atoms Reset

Hydrogen Bonds Pi atoms. Set electron count, charge and coordination of metal atoms. Set the bond orders for transition state bonds. Fix a distance between any two atoms-up to 10 pairs Fix an angle. Fix a dihedral angle Mark pi atoms to be made equivalent in MMX calculation. Reset any of the above and also substructure membership.

Options Menu Printout Dielc DPDP Minimizer MMX_Pi Calc Added Constants Standard Constants Stereo Pluto VDW Surface DOT Surface Force Field MMX MM3* MMFF94 Amber Oplsaa Help Menu About

Sets the amount of output in the output file, PCMOD.OUT. Resets the dielectric constant. Change between dipole-dipole and electrostatic calculations. Set options for the minimizers. Set options for Pi calculation Allows user defined constants Rereads the default MMX datafile Changes the direction of rotation of stereo display. Sets options for the Pluto Display. Sets the options for the VDW display. Sets the options for the DOT surface display.

Use the MMX atom types, parameters and force field. Use the MM3 atom types, parameters and force field Uses the MMFF94 atom types, parameters and force field. Uses the Amber 95 atom types, parameters and force field. Uses the Oplsaa atom types, parameter and force field.

Current version of PCMODEL.

17

Quick Summary of TOOLS Menu: Select Draw Build Select atoms. Draw a carbon skeleton in either the plane of the screen or the plane of the first selected atom. Generate 3-D structure by replacing selected hydrogen with methyl group. Starting from ethane, any hydrogen selected will be replaced by methyl. Update the structure and redraw the screen. Add and delete hydrogens and lone pairs. Increment the bond order of the selected bond. Push an atom into plane of screen. Pull atom out of plane of the screen. Delete an atom or bond. Move an atom. Select Move, then atom, then new position. Rotate about the bond selected by four connect atoms. Query selected distance, angle or dihedral. Works by first selecting Query, then selecting two, three or four atoms followed by blank space in drawing window. Queries removed by selecting Update button twice. Dialog box to choose atom type. Dialog box to choose metal atom type. Dialog box to read in selected ring systems. Dialog box to read in (and connect) Amino Acids. Dialog box to read in (and connect) Sugars. Dialog box to read in Nucleosides. Dialog box to read in OrganoMetallic structures Dialog box to read in transition state models.

Update H/AD Add_B In Out Del Move Rot-B Query

PT Metals Rings AA Su Nu OrgMet TransSt

18

Chapter 4. Tutorials
PCMODEL is designed to be easy to use. In this chapter the basics of using the program, along with many of its options, will be covered through example. Before beginning the specific examples, there are some general points about structure entry to be considered. The screen is two dimensional. Initial input occurs at 0 on the z-axis. The screen itself is the xyplane. Thus, a hexagon drawn to represent cyclohexane using the DRAW Command is initially flat. To obtain three-dimensional structures you need to move atoms behind or in front of the z plane with the IN and OUT commands on the TOOLS menu. Once some three-dimensional information is available, further structural input can be done by rotating the structure. If you pick an atom and draw a bond to a new atom, the new atom will lie parallel to the picked atom along the coordinate that is perpendicular to the screen. Complex structures are easier to build using the substructure ADD, ROTATE and MOVE commands. You can use the built-in substructure templates, or build a library of your own by creating structures and saving the minimized coordinates in an MMX or PCM file. Options are selected by pointing to the desired option and clicking (move the cursor until it is over the desired option and click the left mouse button). If the wrong option is chosen, simply choose the correct one and continue. Likewise, atoms are input or selected by pointing to the appropriate spot in the drawing box. When the cursor is over an atom the atom will be highlighted (a box will be drawn around the atom) so you may tell which atom is closest to the cursor. On the PC the right mouse button may always be used to rotate the current structure. Holding the right mouse button down and dragging the mouse will cause the structure to rotate. Moving the mouse to the left or right will rotate about the y-axis, while up and down movement rotates about the x-axis. Holding down the shift key while dragging the mouse will rotate the structure about the Z-axis. On the Macintosh simply hold down the mouse button and drag. More complete control of rotation, translation and scaling is had using the Control panel in the View Menu. The following examples will illustrate a few of the options available for structural input, minimization and display using PCMODEL. A few mistakes will be purposely made along the way to illustrate how they may be corrected.

4.1 Methylcyclohexane
Structures are drawn on the screen in essentially the same way as they would be represented as 3D structures on paper. For example, a hexagon could be drawn. Initially this would be in the XY plane with all Z coordinates equal to zero. Three-dimensional information could be entered by using IN to push one atom in to the screen and OUT to place the opposite atom in front of the screen. Alternatively, OUT could be used to place 3 alternating carbons in front of the screen to provide a chair shape. In this tutorial, however, we will begin by drawing a side-on view of the chair form.

4.1.1 Drawing the Structure


Begin by selecting the DRAW button on the TOOLS menu with the mouse. A dotted outline will indicate that you are in Draw mode. Move the cursor off the TOOLS menu and into the drawing box, somewhere in the upper right hand corner. (We will be drawing cyclohexane as in Figure 4.1.) Click the mouse button once then move the cursor down ~ 3 cm and left another 3 cm, and click again. A line indicating a bond between the first atom and the one just placed will be drawn. Continue drawing in this manner, placing atoms 3, 4 and 5.

19

1 2

4
Figure 4.1 Completed cyclohexane structure with atom numbers displayed. The atom numbers can be displayed on the screen by choosing LABELS from the VIEW menu, then clicking in the circle next to "atom numbers" and clicking on "okay". They can be removed by again choosing LABELS from the VIEW menu, then clicking in the circle next to "Hydrogens and Lone Pairs" At this point, we will purposely make a mistake, by connecting atom 2 to atom 5. To get back into the drawing mode, select DRAW from the TOOLS menu. Then, click on atom 5, then on atom 2. This will create a bond between atoms 2 and 5. (If the cursor is within about 1/4 inch or 0.5 cm or an atom, then that atom will be picked by the program instead of creating a new one.) To correct the mistake, move the cursor to the DEL button on the TOOLS menu and click on it. Now click on the middle of the bond between atoms 2 and 5 (the incorrect bond). The structure will be redrawn with the deleted bond removed. To continue drawing, again click on the DRAW button on the TOOLS menu, then click on atom 5 and continue to draw atom 6 and the bond back to atom 1. At this point the structure should resemble a chair cyclohexane as in Figure 4.1. However, remember that all the atoms are still in the plane of the screen, that is, no 3-D information has been included. To fix the structure, choose OUT from the TOOLS menu, and then click on atom 2 twice. Each click moves the atom 0.33 out of the plane of the screen. Next, point to atom 3 and click 2 times. Now select IN from the TOOLS menu and click on atom 5 twice and atom 6 once (another mistake which we will shortly correct). Rotation of the structure by dragging the mouse will reveal the changes in the structure. The initial view of the molecule is in the XY plane. The Z+ axis comes out towards the user. To view the structure from a different orientation, select CONTROL_PANEL from the VIEW menu. A new window entitled "Dials" will appear. Within this window will be continuous sliders that allow you to change the orientation of the structure. By clicking on the arrow at either end of a slider once, the view will move slightly. Holding down on one of the arrows will provide continuous movement in that direction. Alternatively, you may click on the box in the middle of the slider bar, and drag it while holding down with the mouse button. This will also provide continuous movement of the view until the mouse button is released. The RESET_VIEW button will return the structure to its original orientation. If you rotate the structure in the Y direction, you can see how atoms 2 and 3 line up, but atoms 5 and 6 do not (because atom 6 was purposely moved only once). This could be corrected by returning to the original view, then using IN on atom 6 one more time. However, we will correct this using the MOVE command from the TOOLS menu. After selecting MOVE from the TOOLS menu, click on atom 6, then near atom 5 (the position to which atom 6 should be moved). The structure will be redrawn with the correction. Hydrogens may be automatically added by selecting the H/AD command from the TOOLS menu. However, once you have done this, there is no free valence at which to add the methyl group. This can be corrected in one of two ways. The first alternative is to delete the hydrogens by clicking on H/AD again, then select DRAW and click on atom 1 then on a blank space to the right of atom 1 to add the extra

20

methyl group. (If the structure is too large, there will not be room on the screen to add the methyl group. If this is the case, use the SCALE option, accessed through the CONTROL_PANEL option of the VIEW menu, to shrink the structure to a more manageable size.) The structure would them be completed by reselecting H/AD to add the hydrogens. A second alternative is to change the equatorial hydrogen into a carbon. Select PT (Periodic Table) from the Tools menu. Select C from the list of available atom types, and click on the equatorial hydrogen. Select H/AD to delete hydrogens (but the equatorial carbon will stay), then H/AD again to add hydrogens. The PERIODIC_TABLE window may be removed be selecting CANCEL (the last option in this window). This method of replacing specific hydrogens is very useful for controlling the stereochemistry when building large and complicated structures. A third alternative is to use the Build command from the Tools. In Build mode clicking on a hydrogen replaces that hydrogen with a carbon and then does a hydrogen delete/add sequence so the methyl group is added with only one click of the mouse. It is not important if the structure does not look perfect at this point. Only a crude approximation of the structure is needed for minimization. However, the closer to correct the structure is, the faster the minimization will proceed. Methyl groups pointing in to the middle of a cyclohexane ring will obviously cause problems. Also, if three carbons are attached to a central carbon such that all 4 are in a plane, the program will have difficulty in adding a hydrogen in the correct position. You can check for such errors by using the rotation and translation sliders in the DIALS box to get different views of the molecule. Use IN, OUT and MOVE from the TOOLS menu to correct these problems. Before starting a minimization, it is a good idea to store the structure with a descriptive filename. Select SAVE from the FILE menu. Select the appropriate file type from the list provided by clicking on it with the left mouse button. (The default file type of PCM is preferable.) The "File Name:" box will change to indicate the default extension for that type of file. The "*" in the file name should be changed to a descriptive name for this structure, for example, etcyclo. The path indicating where the file is to be saved may also be changed at this time. When the correct path and filename have been entered, click on "OK" or press return on the keyboard to save the structure in the indicated location. At the beginning, every five iterations during, and at the end of the minimization process a backup file of the structure, called PCMOD.BAK will be written (the format is PCM). If anything should go wrong or should you need to stop the program, you may restart the minimization by reading in this file and continuing from the last saved point.

4.1.2 Building the Structure


A structure may also be constructed by using the BUILD command of the DRAW TOOLS. Begin by selecting BUILD. An ethane will appear (Figure 4.2). Click on the upper right hydrogen to produce propane as shown in Figure 4.3. Next click on hydrogen 10 to provide butane as shown in Figure 4.4. It may be easier to see the 3-dimensional nature of the structure if you turn on Tubular Bonds from the View Menu as in Figure 4.5. However it is usually easier to click on atoms using the stick figure. You can rotate the structure at any time by holding down the mouse button (right button on the PC) and dragging.

21

Figure 4.2.

Figure 4.3

Figure 4.4

Continue the build by clicking on the next hydrogen in the sequence to obtain Figure 4.6 and then to obtain Figure 4.7. Again it may be helpful to turn on Tubular Bonds to see the 3-d structure.

22

Figure 4.5

Figure 4.6

Figure 4.7

Complete the ring by removing the hydrogens (H/AD) and then use Draw to create a bond between carbon 1 and 6. It does not matter if you draw the bond first or remove hydrogens first. You will arrive at Figure 8 either way.

23

Figure 4.8

Figure 4.9.

Figure 4.10

Complete the structure by adding the hydrogens (H/AD) to give Figure 4.9 and then using Build to add the methyl group (Figure 4.10). The easiest method for constructing methylcyclohexane is to read C6 from the RINGS template of the Draw Tools or under Template on the Menu Bar then add the methyl group using BUILD.

4.1.3 Minimizing and Adjusting the Structure


The structure may now be minimized. Select MINIMIZE from the ANALYZE menu, and the calculation will automatically start. Once the minimization process has started, pointing to the Stop Job button may stop it. The cursor will change into an hourglass shape while the minimization is in progress, and will return to the arrow shape when it is completed. The final energy should be around 6.89 kcal. Save this structure, as it will be used later.

24

We now want to compare the energy of the equatorial isomer to that of the axial isomer. We could start from scratch by clearing the current structure by selecting ERASE from the DRAW menu, and drawing in a new chair cyclohexane with an axial methyl group. However, it is usually simpler to modify the present structure. This can be done in several ways. Some choices are: Use MOVE: Select H/AD to remove the hydrogens. Then select MOVE from the TOOLS menu, and click on the equatorial carbon. Next point to a new position above atom 1 where the axial group is to be located. Use H/AD again to add the hydrogens back on to the structure. Check that you have in fact obtained the axial isomer. If not, remove the hydrogens again and move the carbon atom further. Use DEL and the Periodic Table: Bring up the Periodic Table by selecting PT from the TOOLS menu. Choose the C atom, and click on the axial hydrogen to change it into a carbon. Then select DEL from the TOOLS menu and click on the carbon of the equatorial methyl group. It, along with the attached hydrogens, will be automatically deleted. Then click on H/AD twice to remove then replace the hydrogen atoms. Use DEL and BUILD: Select DEL from the TOOLS menu and click on the carbon of the equatorial methyl group. It, along with the attached hydrogens, will be automatically deleted. Then select BUILD and click on the axial hydrogen. Use EPIMER: Click on Sel-Atm on the TOOLS menu. Then click on carbon 1, then to the axial hydrogen and equatorial carbon. Small filled circles will indicate that the atoms have indeed been selected. If hydrogens obscure any of the atoms, use the DIALS box to rotate the structure to a better orientation, or SCALE it up to zoom in on the area of interest. Then select EPIMER from the EDIT menu, and the two attached groups will be switched. Now select MINIMIZE from the ANALYZE menu to find the energy of this structure. It should be about 8.67 kcal or 1.78 kcal higher than the equatorial isomer. Save this file with a different filename from that used for the equatorial isomer.

4.2 Trans-Decalin
We will next illustrate how to build up a simple structure by converting axial methyl cyclohexane into 1-methyl- trans-decalin. Begin by reading in the minimized axial methyl cyclohexane structure previously saved by choosing OPEN from the FILE menu. Use H/AD to remove the hydrogens. Turn on atom numbering (LABELS under VIEW menu), bring up the DIALS box by choosing CONTROL_PANEL from the VIEW menu, and rotate the structure along the X-axis until it has an orientation similar to that in Figure 4.11. Using DRAW from the TOOLS menu, click on atom 1, then continue to place atoms 8, 9, 10 and 11, and the final bond back to atom 2.

25

10 9 11 8 7 2 3 1 6 4 5
Figure 4.11 Completed 1-methyl-trans-decalin structure with atom numbers displayed. Carbons 8 through 11 will have the same Z coordinate as atom 1, the last atom selected prior to drawing them. If we had chosen atom 2 as the starting point, then carbons 8 through 11 would have the same Z coordinate as atom 2. Use OUT to push atom 9 out 2 clicks, and IN to move atoms 10 and 11 each in 4 clicks. Use the rotation and translation sliders to check placement of the atoms and MOVE to correct any mistakes. Add the hydrogens back with H/AD and proceed with the minimization.

4.3 Special Options 4.3.1 Compare


The COMPARE option is used to calculate the differences in two structures and to produce an overlaid view of two structures. Begin with axial methyl cyclohexane as the active structure. Select COMPARE from the VIEW menu. Select NEXT_STRUCTURE from the COMPARE menu, and enter the name of the equatorial cyclohexane structure. We will want to see how the axial and equatorial atoms change positions if the remainder of the ring is held constant. Use the Sel-Atm button from the TOOLS menu to mark carbons 3,4 and 5 on the left (axial) isomer, then carbons 3,4 and 5 on the right (equatorial) isomer. The comparison will pair the first atom selected on the first structure with the first atom selected in the second structure, etcetera, so it is important to select the atoms in the same order on the two structures. Once the atoms have been selected, choose CALCULATE from the COMPARE window. PCMODEL will now find the best least squares fit for these three atoms, and will give the rms differences between all corresponding atoms in the Compare Output window, and the structures will be redrawn overlapped, with red lines connecting the atoms that are being compared. If the structures overlap well, the red lines may not be visible. The structures may be rotated in mono or stereo to get a better view of the comparison. Select MOVE from the SUBSTR menu, and choose the substructure "untitled". Use the DIALS box to translate this substructure away from the other one. A new set of atoms for comparison can now be marked on each substructure (using the Sel-Atm option), and the calculation re-done. Click on CANCEL to dismiss the COMPARE window.

4.3.2 Rotational Energy Barriers


PCMODEL can calculate rotational barriers using the DIHEDRAL_DRIVER option in the ANALYZE menu. However, there is a quick way to explore rotational minima using ROT_E from the

26

ANALYZE option. Read in the equatorial methyl cyclohexane structure and convert one of the methyl hydrogens to a carbon using the BUILD option from the Draw Tools menu. After minimization, we will explore the rotational minima of the ethyl group. We will need to Sel-Atm carbon 1 and the first carbon of the attached methyl group (carbon 7). If hydrogens obscure these, rotate the structure until they are not hidden. Choose Sel-Atm from the tools menu, and click on the carbon of the ring where the ethyl group is attached and the first connected carbon atom. One may also use Sel-Bnd and click on the bond to be rotated. Then select ROT_E from the ANALYZE menu. Enter an increment of 10 and an extent of 360. Click on okay (or hit return). The program will then produce a plot and of energy vs. angle (this may take a few seconds to appear) similar to that shown. The energy scale is relative to the energy of the starting conformation and the angles are in degrees. The plot can be printed. To return to the structure window: In the Windows version pull down the File menu and choose Exit (the plot window has it's own menu bar and under the File menu there will be two choices, Print and Exit); In the Macintosh version choose Stick Figure from the View Menu.

4.4 Benzene and a Pi Calculation


Select DRAW from the TOOLS menu and draw a hexagon. Next select ADD_B from the TOOLS menu to change every other bond in the ring to a double bond, by clicking in the middle of the bonds. Finally, use H/AD to add the hydrogens to create benzene. Although the structure now looks like benzene, it is not ready to be minimized because PCMODEL does not have any information about the pi system. Choose PIATOMS from the MARK menu, and the structure will be redrawn with small ~ symbols near each of the pi atoms. (PCMODEL automatically determines which atoms are pi atoms, once it has been told to look for them.) Minimize the molecule using MINIMIZE from the ANALYZE menu, then save it to disk using the SAVE command from the FILE menu.

4.5 Biphenyl and a Pi Calculation


Begin with the benzene structure studied above as the active structure. Select READ from the SUBSTR menu, and read the benzene file back into the program. There will now be 2 benzene rings on the screen, side by side and several inches apart. Sel-Atm one hydrogen from each substructure, then choose CONNECT from the SUBSTR menu. The screen will be redrawn with the two hydrogens removed and a carbon-carbon bond in their place. Use Sel-Atm from the TOOLS menu to highlight 4 carbons involved in the biphenyl dihedral, then select ROTATE_BOND from the TOOLS menu. Change the dihedral angle to plus or minus 5 degrees, then click on EXIT to return to the main window. Finally, select PIATOMS from the MARK menu since the atoms in the substructure may not be labeled as pi atoms at this time. Now minimize the system by choosing MINIMIZE from the ANALYZE menu. This time all the pi atoms will not lie in one plane. The structure should minimize to a dihedral angle of about 39 degrees with a shallow potential energy surface.

27

4.6 Ferrocene - Using Metals and Coordination


The sandwich complex of iron and two cyclopentadiene rings can easily be modeled with PCMODEL. There are two ways to represent the cyclopentadiene rings: as aromatic rings with two double bonds and a carbon anion, or as a ring of aromatic carbons. Since the first option would require a pi calculation, the second option will be used. This atom type is designed to reproduce the bond lengths and angles of aromatic rings without requiring a special pi calculation. To begin, draw a five-membered ring. Double bonds may be added, but this is not necessary when the aromatic carbon type is used. Use the PT (Periodic Table) option from the TOOLS menu and replace all the ring atoms with atom type Ca (aromatic Carbon). Use H/AD to add hydrogens, then rotate the ring onto its side using the DIALS box (accessed via the CONTROL_PANEL option on the VIEW menu). Save the ring to a file, then read it back in as a new substructure using the READ option in the SUBSTR menu. If the two rings are too close together, Sel-Atm an atom in one and use SUBSTR MOVE to separate them. Select DRAW and place an atom between the two rings. To make this the iron atom select METALS, click on Fe and click on the isolated atom. Select UPDATE to clean up the structure. There should only be one hydrogen on each of the aromatic carbons, whether or not you have drawn in the double bonds in the cyclopentadiene rings. At this time PCMODEL does not know anything about the interaction of the cyclopentadiene rings with the iron atom. Use Sel-Atm to mark the iron atom and all 10 of the Ca atoms, then select METAL_COORD from the MARK menu. A dialog box will appear asking for information about the metal. Since this is an 18 electron complex select "saturated_18_e" for the electron count. Clicking on OK causes the screen to be redrawn with the metal coordination bonds shown in red. The structure can now be minimized. If the structure had been minimized without coordinating the iron to the rings, the iron atom would not have been bonded to anything and would have wandered out of the picture.

4.7 The Methanol dimer - Hydrogen Bonding and Docking


This calculation will demonstrate the hydrogen bonding and docking capabilities of PCMODEL. To begin, draw and minimizing methanol. Select DRAW from the TOOLS menu and draw a 3-carbon skeleton. Next select PT and change the central atom to an oxygen (O) and one of the terminal carbons to a hydrogen (H). Now use H/AD to add the remaining hydrogens and lone pairs. If you repeat the H/AD command you will note that the hydroxyl hydrogen is not removed. Hydrogens attached to heteroatoms are not removed by H/AD and must be removed explicitly using the DEL command. (This is to ensure that particular hydrogen bonding orientations drawn by the user will not be disturbed by the normal H/AD function.) Minimize this structure (MINIMIZE on the ANALYZE menu) and then save it in to file (SAVE under the FILE menu). Since there are no other heteroatoms present, hydrogen bonding was not important during this step. Next, read the methanol structure back in as a substructure using the READ command under the SUBST menu. A red line will appear between one lone pair and one hydroxyl hydrogen, indicating a potential hydrogen bond. Use the MOVE command under SUBST to move the second methanol molecule around so that the O-H-O system is linear, and is within bonding distance. (By default Hydrogen bonding is now turned on, use the REST option in the Mark Menu to turn it off). MINIMIZE (ANALYZE menu) the dimer. If the linear dimer is obtained, the energy should be -3 to -4 kcal. Next select DOCK from the ANALYZE menu to begin a simulated annealing calculation. The default settings are adequate for this system. Click on OK, and then enter a filename to store the results of the calculation. If the calculation is successful, the dimer energy will be between -4 and -5 kcal, and a linear hydrogen bond will be present. The calculation may be stopped at any point by hitting the ESC key.

4.8 Substructures - Creation and Manipulation


In this example, we will create, move, rotate, hide and reset substructures. First, sketch two separate structures - a hexane and a pentane. To make the pentane into a substructure, choose Sel-Atm from the TOOLS menu and click on any atom in pentane. Select CREATE from the SUBST menu. A

28

dialog box will appear asking for the name of this substructure, type "pentane" and select OK or hit return. Select LABELS from the VIEW menu and click on "Substructure Numbers". The atoms will now be labeled with numbers, according to the substructure to which they belong. Select MOVE from the SUBST menu, and the DIALS box will appear. Since an atom in the pentane substructure was still selected when SUBSTR MOVE was selected, the DIALS box will only move the pentane substructure. Click on EXIT to dismiss the DIALS box, then click on Sel-Atm on the TOOLS menu to de-select all atoms. Now select MOVE from the SUBSTR menu, and a dialog box will appear with a list of all currently defined substructures. Select a substructure from this list, and a new DIALS box will appear which will move only that substructure. To move a different substructure select click on one atom in a different substructure. As long as no other Draw Tools options have been used, the Sel-Atm option will still be active. This substructure now becomes the active structure and will be translated or rotated by the DIALS box. Now click on H/AD to add the hydrogen atoms. Use Sel-Atm to mark one hydrogen in each substructure, and select CONNECT from the SUBSTR menu. A bond connecting the two substructures will replace the hydrogens. Select MOVE from the SUBSTR menu, translate one structure, and notice that the central bond of the hexane is stretched. This will be the case whenever two separate substructures, which are directly bonded, are moved. Choose HIDE from the SUBST menu, if no atoms are selected a substructure list window will appear again. However, each substructure will have a lowercase "v" next to it, indicating that it is currently visible on the screen. Select pentane and say okay. The pentane structure will disappear from the display. Select HIDE from the SUBST menu again, and this time pentane will have an "i" next to it, indicating that it is invisible. Select pentane again and click on OK, and pentane will re-appear on the screen. Select ERASE from the SUBSTR menu, and select "Str 0" from the list. Note that the entire complex is erased. This is because pentane is still part of "Str 0", the initial structure, as well as being its own substructure ("pentane").

4.8.1 Building Polystryene using Dummy Atoms


Read in benzene from the Rings Templates and then select BUILD from the DrawTools. Select one of the hydrogen atoms on the ring to replace it with a methyl group. Next select one of the hydrogens of the methyl group and you should now have ethyl benzene. Minimize the structure to obtain a reasonable geometry. Bring up the Periodic Table from the DrawTools menu and select the dummy atom type, DU, from the dialog box. Next point to one of the hydrogens of the CH2 group and then the hydrogen anti to the first hydrogen on the CH3 group. Do an HAD sequence and notice that the hydrogens that were converted to dummy atoms are not removed by the hydrogen delete. Once the hydrogens are readded do a Minimization. Next , using SUBSTR READ, read in the pcmod.bak file. Highlight the Dummy atoms by using the Show Dummy command from the SUBSTR menu. There should be four dummy atoms showing on the screen. Next select the Connect command from the SUBSTR menu. This will bring up the Connect dialog box with the dummy atoms boxes filled. Selecting Connect will connect the two structures from the second dummy atom of structure 1 to the first dummy atom of structure 2. Selecting Next Structure brings up a File Open dialog box and a new structure can be read. Reread pcmod.bak and note that the Substructure Connect dialog box will be updated with new atom numbers for the dummy atoms. Selecting Connect adds the new structure to the old structure. Repetition of this procedure makes it easy to build oliogmers. You are not limited to single monomers since any structure can be read

29

using the Next Structure button. As long as the new structure has two atoms marked as Dummy Atoms, the substructure connect function will work.

4.9 Diels-Alder Transition State


To model the Diels-Alder transition state for the reaction of butadiene and ethylene, begin by drawing a hexagon of atoms (a flat cyclohexane). Use PT from the TOOLS menu to replace atoms 1 and 2 by C* and atoms 3 and 4 by C# and atoms 5 and 6 by C Use H/AD to add hydrogens and select . MINIMIZE from the ANALYZE menu to optimize the structure. A dialog box will appear inquiring about what bond orders to use. The C*-C* bond represents a sigma bond being formed between one terminus of butadiene and one terminus of ethylene. Enter 0.3 for this bond order. The C#-C# bond represents the other sigma bond being formed. Enter 0.3 for this bond also, and then click on OK. This will start the minimization. The resulting structure should be very similar to that calculated by Houk et al. (Houk, K. N., Paddon-Row, M. N., Rondan, N. G., Wu, Y. D., Brown, F. K., Spellmaeyer, D. C., Metz, J. T., Li, Y., and Loncharich, R. J. Science, 1986, 231, 1108 and references therein.) It should be noted that the bond length between the two C carbons is 1.4 . (You can check this by selecting QUERY from the TOOLS menu, then clicking on each of the C atoms, then on a blank space on the screen. The distance between the two atoms will be displayed on the screen. To remove the distance from the screen, click on UPDATE twice.) If a normal double bond were drawn between these two atoms, the resulting minimized structure would have a 1.34 bond between these two carbons, which will ultimately give a double bond in the product. If other bond orders are chosen, the distances between the C* or C# carbon pairs will respond, as will the 1,3 angles at C* and C#. The calculated MMX energy is the potential energy of the system relative to the potential functions for the transition state, NOT relative to the ground state. As usual, the potential energy of a minimized structure can only be compared to a conformational isomer or a diastereomer. The potential energy of a transition state with one set of bond orders might be compared to the potential energy of another transition state with different bond orders, but only the difference in steric energy can be determined, not the actual free energy difference between the two structures. That is, the electronic energy difference is not considered in the MMX calculation. See the chapter on the MMX force field for more information.

30

Chapter 5. Drawing Tools


The palette of drawing tools appears at the left-hand side of the drawing window when PCMODEL program is started. The palette can be moved anywhere on the screen by dragging on the "Tools" title. The window can be closed by clicking on the CANCEL button at the very bottom, and recalled by selecting DRAW from the EDIT menu.

5.1.1 Select Atom


This option allows certain atoms to be selected for action by future commands. As many atoms as desired can be selected. To clear all selected atoms, click again on the SELATM button

5.1.2 Select Bond


This option allows bonds to be selected for action by future commands. As many bonds as desired can be selected. To clear all selected bonds, click again on the SELBND button

5.2 Draw
This command enables the drawing mode. To create a new structure, select DRAW then move the cursor to the desired location and click the left mouse button. If the cursor is then moved to a new location and the left mouse button is clicked again, a line will be drawn, indicating a bond between the previous atom and the new one just placed. If the cursor is moved to a new location and the left button is clicked again, a third atom will be created and bonded to the second atom. Note that all atoms are added in the plane of the screen. To close a ring, place the cursor on an existing atom and click the button. The cursor must be within about 1/4 inch of an atom to draw a bond to that atom instead of creating a new one. A small box will appear around the atom when the cursor is close enough. To start drawing from an existing atom without adding a new bond to that atom, click on DRAW again, place the cursor on the old atom, click and continue drawing as before. New atoms will be added in the same plane as the last atom picked. To start drawing from a new location, click on the DRAW button again then move to the new location and click the mouse. A small dot will appear indicating a new atom. Clicking on DRAW, then selecting first one end of the double bond, then selecting the other end may create double and triple bonds. Conventional double lines will be used when the structure is redrawn. Alternatively, the ADD_B option can be used (see below). If there are many multiple bonds, this is the easier method to use.

5.3 Build
This option allows for rapid building of 3-D structures. After selecting BUILD, click on any hydrogen and it will be replaced with a methyl group. This option allows PCMODEL to provide the 3-D information required to build the structure, as opposed to the DRAW option, where all atoms are added in the plane of the atom to which they are connected. (Or, in the plane of the screen, in the case of unattached atoms.) The structure may be rotated and viewed with tubular bonds to get a better 3-D view of the molecule. The Stick Figure view is best for selecting atoms.

5.4 Update
This option redraws the screen, and updates the structure with any changes since it was last drawn. It clears any Query information.

5.5 H/AD

31

This command will automatically add or delete hydrogens and lone pairs. The structure will be redrawn with the appropriate structure. Hydrogens and lone pairs attached directly to metals will not be removed. Hydrogens attached to oxygen and nitrogen are not removed to preserve the orientation of hydrogen bonds. Note that removal of hydrogens followed by re-addition will not necessarily return you to the original structure. Hydrogens are added according to a geometric prescription. To remove hydrogens for viewing or comparison purposes only, use the H/OO command (see below).

5.6 Add_B
This command increments the value of the selected bond by one (single to double to triple). After selecting ADD_B, click in the center of the bond to be incremented. An X will appear over the bond when you are close enough. The structure will be redrawn with the conventional double or triple line representation.

5.7 In
This command pushes selected atoms behind the plane of the screen, by approximately 0.33 . Each time an atom is clicked on it will be pushed in an addition increment.

5.8 Out
This command pulls selected atoms out of the plane of the screen, by approximately 0.33 . Each time an atom is clicked on it will be pulled out an addition increment.

5.9 Del
This command DELetes an atom or bond. After selecting DEL, click once on each atom or bond to be deleted and the structure will be redrawn with the atom or bond deleted If an atom is deleted, all bonds to that atom are deleted, as well as any attached atoms. Thus, you only need to click on the carbon to delete an entire methyl group. It is important to do an H/AD command before writing a file if any atoms or bonds were deleted. This ensures that any "holes" created but not removed from the arrays will be removed during this process and that the heavy atoms will be in the proper sequence.

5.10 Move
This command moves an atom into a new position. After selecting MOVE click on the atom to be moved, then click in the new position. The structure is redrawn the atom will be moved. Multiple MOVEs may be done at one time, until another option is chosen.

5.11 Rotate Bond


This command rotates the central bond of the four selected atoms. Use SELATM to first select the four atoms. The selected atoms will be marked by small gray dots.

5.12 Query
This command is used to measure atom positions and charges, bond lengths, angles and dihedrals. To determine the position and charge of an atom select QUERY click on an atom and then a blank space on the screen. To measure a distance, select QUERY, click on the two atoms whose distance is to be measured, then click on a blank area of the screen. The distance between the two atoms will be displayed on the screen at the location of the last click. To measure an angle, click on the three atoms forming the angle of interest, then a blank area of the screen. The numerical value of the angle will be displayed. Similarly, to measure a torsion angle click on the 4 atoms of the torsion then a blank area. If you select two vicinal hydrogens or if you select four atoms of a dihedral and the first and last atoms are hydrogens the PMR coupling constant will be calculated. The PMR coupling constants are corrected for electronegativity according to the method of Haasnoot, DeLeeuw and Altona, Tetrahedron 1980, 36, 2783.

32

Any number and combination of distances, angles and torsion angles may be measured, until another option is chosen. The queries can be printed and will remain on the screen during minimization and dynamics calculations so that changes in the structure can be monitored. To clear the values from the screen, click on UPDATE twice.

5.13 PT

This command calls up a window containing a list of atom types. An atom may be changed to a different type by selecting the new type from the Periodic Table window, then clicking on the atom to be changed. Multiple atoms may be changed to the same type without re-selecting the type. A complete listing of atom types is given in the MMX Force Field chapter. Clicking on CANCEL closes this window.

5.14 Metals
This command calls up a window containing a list of metal atom types. An atom may be changed to a particular metal by selecting the appropriate metal from the list, then clicking on the atom to be changed. The electron count (saturated, unsaturated, high spin, low spin or square planar), charge and coordination of a metal to a pi system are set with the Metal Coord menu item in the Mark Menu. Clicking on CANCEL closes this window.

33

5.15 Rings
This command calls up a window containing a list of ring templates. A complete listing of the rings is given in Appendix B. Selecting a ring from the list causes that structure to be placed in the structure window as a new substructure. Clicking on CANCEL closes this window.

5.16 AA
This command opens a dialog box containing 24 standard amino acid residues. Selecting one of the amino acids causes that structure to be drawn in the structure window. If the CONNECT box is selected, the secondary structure of the connection (alpha helix, beta sheet, beta turn type I or beta turn type II) must be specified prior to selecting the amino acid sequence. Clicking on CANCEL dismisses this window. A complete list of all the amino acids is given in Appendix B.

5.17 Su

34

This command opens a dialog box containing 25 standard sugar residues. Selecting one of the names causes the corresponding structure to be drawn in the structure window. If the CONNECT box is checked, the structure of the connection (alpha or beta) must be specified prior to selecting the sequence. Clicking on CANCEL dismisses this window. A complete listing of all the sugars is given in Appendix B.

5.18 Nu
This command opens a dialog box containing 5 nucleosides. Selecting one of the names causes the corresponding structure to be drawn in the structure window. The default is to draw DNA, but clicking on the RNA button instead may draw RNA. If the CONNECT box is checked then a letter symbol representing the Nucleoside chosen will be written in the edit box. When all the nucleosides are entered selecting Build will generate a single strand of DNA or RNA. Selecting Build Double will build a double strand. Clicking on CANCEL dismisses this window. A complete listing of all the nucleosides is given in Appendix B.

5.18 Organo Metallics Templates


This command calls up a window containing a list of organometallic templates. These are structures that have been built and saved in a file. Selecting a structure from the list causes that structure to be placed in the structure window as a new substructure, which can then serve as the starting point for building new structures. Clicking on CANCEL closes this window.

35

5.20 Transition State Templates


This command calls up a window containing a list of transition state structures. These are structures that have been built and saved in a file. Selecting a structure from the list causes that structure to be placed in the structure window as a new substructure, which can then serve as the starting point for building new structures Clicking on CANCEL closes this window.

36

Chapter 6. File Menu


This menu contains the following commands: Open Save Save Graphic Print Read pcmod.bak Exit Reads a file containing a molecular system Writes a molecular system to a file Writes a file containing a picture of the screen Print the current molecular system Read the structure backup file Exit PCMODEL

6.1 Open
The OPEN option in the FILE menu is used to read a new structure file into PCMODEL. Any structure on the screen will be erased.

Macintosh: Selecting OPEN immediately brings up a standard Macintosh file open dialog box with the different file formats listed in a drop down window at the bottom of the dialog box. Windows and Linux: Selecting OPEN immediately brings up a standard Windows Open File dialog box which gives the current path, an edit window for the filename, a drop down box containing the various file types supported by PCMODEL, a window listing directories and disk drives, and a scrollable window listing the files in the current path (if a wildcard expression is used for the filename). When the dialog box first appears, the PCMODEL file type is selected and all files with the .pcm file extension will be shown. A filename may be entered by typing the name in the filename edit box or by selecting (with the mouse) a file from the scrollable file list box. To see files available in the current path, use a wildcard expression such as *.PCM to see all files with the PCM extension, or *.* to see all files.

37

PCMODEL supports several different types of files. Clicking on the combo box at the bottom of the dialog box will show all the types of files that can be read in PCMODEL. The default file type is PCMODEL. File Types Read by PCMODEL PCMODEL files contain information about all the atoms and bonds in a molecule, including information about hydrogen bonding, pi atoms, metal coordination and substructure membership. It is the most general file type for molecular mechanics information, is a free format ASCII text file, and can be read or edited with any word processor. A PCMODEL file can contain either one structure or multiple structures. PCMODEL files can be read by Gaussian 92 and can serve as input to this ab initio quantum chemistry program. (A full description of this file type is given in chapter 6.) MMX files are essentially MMP2 formatted files (see QCPE program number 395 for a detailed description). These files contain information about atoms and bonds in a molecule, including information about pi atoms, hydrogen bonding and metal coordination, but not substructure membership. An MMX file can contain either one substructure or multiple substructures. It is an ASCII text file with a fixed, Fortran format, and can be read or edited with any word processor. The fact that it is fixed format means that data must be entered in specific columns to be read correctly, which makes editing difficult. (A full description of this file type is given in chapter 6.) MM3 files contain information about all the atoms and bonds in a molecule written in the MM3 format and using MM3 atom type rules. MM3 files are text files and use a Fortran type fixed format. The files may be edited with a text processor, but the fixed format requires that data be entered in specific columns to be read correctly. PDB files are structure files using the Protein Data Bank format developed at Brookhaven National Laboratory. The PDB format can be used to describe any molecule and is not restricted to proteins or nucleic acids. The PDB format contains information about the atom type, the residue name if part of a protein or nucleic acid, and the atomic coordinates of the atom. Bonding information may be included in the file but it is not required. PCMODEL reads PDB files and tries to determine the bonding arrangement based on internuclear distances. While this method is good it is not perfect and the bonds shown when a PDB file is read should be looked at carefully before further calculations are attempted. MOPAC INP and ARChive files are used by the semi-empirical quantum chemistry programs, AMPAC and MOPAC (QCPE program number 514). These files contain a list of atom symbols, bond lengths, angles and dihedral angles, but no information about bonding. MOPAC files may be used for input to the AMPAC, MOPAC and Gaussian programs since all of these programs can read and interpret internal coordinates. The INP file is an input file to AMPAC or MOPAC in Z matrix format and the ARChive file is the archive output file written by Mopac (normally FOR012) and contains the optimized geometry in Z matrix form and the atomic charges calculated by Mopac. Both the structure and charges can be read into PCMODEL. Alchemy and Sybyl molecular modeling programs marketed by Tripos Associates (St. Louis, MO) use ALCHEMY and SYBYL files. Alchemy file format is also used by Chemical Abstracts Service for 3Dstructure information. MACROMODEL files are molecular mechanics type files generated by the Macromodel program. This large molecule modeling program, developed by C. Still, Columbia University, runs on workstations and provides many facilities for handling large molecules. Macromodel files contain information about atoms and bonds, but no information about pi atoms, hydrogen bonding or transition metals.

38

X-RAY files are free format structure files in a format that has been developed for reading x-ray crystal data. X-ray structure files only contain information about atom types and positions. (A full description of this file type is given in chapter 6.) CHEM-3D is a molecular modeling program developed by Cambridge Scientific Software for use on a Macintosh computer. PCMODEL reads and writes the Cartesian type II structures. Chem-3D can be used to generate publication ball and stick pictures. TINKER is a molecular modeling program developed by Professor Jay Ponder of Washington University (ponder@dasher.wustl.edu). PCMODEL reads and writes the xyz cartesian filetype. MDL_MOL files are structure information files in the molfile format developed by MDL Information Systems Inc. Molfiles can be generated by programs such as Isis Draw, or as the result of a search of the REACCS or MACCS databases. Molfiles contain 3-D structure and bonding information. Corina and Concord are structure files generated by the structure building programs Corina and Concord. Gaussian files are the output files (text) written by the Gaussian Quantum Chemistry programs running under Windows, on a workstation or supercomputer. Since PCMODEL reads the text file, the output file is portable across all these platforms. Gaussian ChkPt files are the formatted checkpoint files generated by the Formchk utility of the Gaussian program. Jaguar Input and Log files are input and output files written by the Jaguar quantum chemistry program from Schrodinger, Inc. CSSTR files are Cambridge Structural Database files. Gamess files are the output files (text) written by the Gamess Quantum Chemistry programs running under Windows, Macintosh, on a workstation or supercomputer. Since PCMODEL reads the text file, the output file is portable across all these platforms. Hondo files are the output files (text) written by the Hondo Quantum Chemistry program. Since PCMODEL reads the text file, the output file is portable across all platforms.

6.2 Save
The SAVE option is used to write the current molecular system to a disk file. Macintosh: Selecting Save brings up a dialog box listing the current types of files PCMODEL can write. Select the appropriate file format and then OK. The standard Macintosh file save dialog box will then be presented. Windows and Linux: Selecting SAVE brings up a standard Windows File Save dialog box which gives the current path, an edit window for the filename, a list box containing the various file types supported by PCMODEL, a window listing directories and disk drives, and a scrollable window listing the files in the current path (if a wildcard expression is used for the filename). All files with the default extension will be shown. The last file name used will be retained. If this file is used again, you will be asked if you want to replace the file or append the structure onto the existing file.

39

When the dialog box first appears, a wildcard expression is displayed in the filename edit box, and the PCMODEL file type is selected. All files that match the wildcard expression will be listed in the file list box. A filename may be entered by typing the name in the filename edit box or by selecting (with the mouse) a file from the scrollable file list box. To see files available in the current path, use a wildcard expression such as *.PCM to see all files with the PCM extension, or *.* to see all files. If the filename that is entered already exists, a dialog box will appear asking if the new structure is to be appended to the existing file, or if the existing file is to be replaced. Choosing APPEND to generate multiple structure files or libraries of related structures. The current directory is given in the current path box. The directory list box displays all directories below the current directory, the directory one level up from the current directory ([..]), and all other disk drives on the computer. The current path may be changed by selecting either another directory or disk drive in the directory list box. PCMODEL supports several different types of files. Clicking on the combo box at the bottom of the dialog box leads to a drop down menu of all the files written by PCMODEL. The default file type is PCMODEL. MMX atom types are assumed for all files except MM3 files where the MM3 atom types are assumed. File Types Written by PCMODEL PCMODEL files contain information about all the atoms and bonds in a molecule, including information about hydrogen bonding, pi atoms, metal coordination and substructure membership. It is the most general file type for molecular mechanics information, is a free format ASCII text file, and can be read or edited with any word processor. A PCMODEL file can contain either one structure or multiple structures. PCMODEL files can be read by Gaussian 92 and can serve as input to this ab initio quantum chemistry program. (A full description of this file type is given in chapter 6.) MMX files are essentially MMP2 formatted files (see QCPE program number 395 for a detailed description). These files contain information about atoms and bonds in a molecule, including information about pi atoms, hydrogen bonding and metal coordination, but not substructure membership. An MMX file can contain either one substructure or multiple substructures. It is an ASCII text file with a fixed, Fortran format, and can be read or edited with any word processor. The fact that it is fixed format means that data must be entered in specific columns to be read correctly, which makes editing difficult. (A full description of this file type is given in chapter 6.)

40

MM3 files contain information about all the atoms and bonds in the molecule. The MM3 atom types are used and pi atom marking is supported. MOPAC and AMPAC files are used by the semi-empirical quantum chemistry programs, AMPAC and MOPAC (QCPE program number 514). These files contain a list of atom symbols, bond lengths, angles and dihedral angles, but no information about bonding. MOPAC files may be used for input to the AMPAC, MOPAC and Gaussian programs. The dialog box will facilitate writing the file.

ALCHEMY and SYBYL files are used by the Alchemy and Sybyl molecular modeling programs, marketed by Tripos Associates (St. Louis, MO). Alchemy format is also used by Chemical Abstracts Service for 3D structure information. MACROMODEL files are molecular mechanics type files generated by the Macromodel program. This large molecule modeling program, developed by C. Still, Columbia University, runs on workstations and provides many facilities for handling large molecules. Macromodel files contain information about atoms and bonds, but no information about pi atoms, hydrogen bonding or transition metals. CHEM-3D is a molecular modeling program developed by Cambridge Scientific Software for use on a Macintosh computer. Chem-3D can be used to generate publication ball and stick pictures. Tinker is a molecular modeling program developed by Professor Jay Ponder of Washington University in St Louis. PCMODEL generates an xyz cartesian file that can be read by Tinker. PDB files contain the Cartesian coordinates of the current structure written in PDB format. Jaguar Input files are written for the Jaguar quantum chemistry program. They include the Cartesian coordinates of the structure and the atomic symbol.

41

Gaussian Job files are input files for the Gaussian quantum chemistry program. Structural information can be output in either Cartesian or Z matrix format and many options may be set using the dialog box. The Keywords section is updated each time an option is selected. If you wish to add additional keywords not available through the options, they should be added just before OK is selected. Gamess input files will bring up a dialog box similar to the Gaussian box. Hondo input files will bring up a dialog box similar to the Gaussian box. Corina and Concord are structure files generated by the structure building programs Corina and Concord.

42

6.3 Save Graphic


This option saves the screen in one of the following file formats; GIF, BMP, HPGL, PostScript or POVRAY. The file may then be imported into other programs for editing. The POVRAY option will write a file for input into the rendering program POVRAY. This program is freely available for a number of platforms (PC, Mac, UNIX) at http://www.povray.org. It will provide a ray-traced picture of the molecule as shown to the left for potassium 18-crown-6.

6.4 Print
Print sends the current structure and related information to the default Windows printer (which may be a file). The Print Manager handles printer selection in WindowsTM or by the Chooser on the Macintosh. When PRINT is selected, a dialog box appears containing the currently selected printer. If the current structure is drawn as a stick figure then a dialog box will be shown with options for printing the current structure and compare or energy windows. If the current structure is drawn in Ball and Stick, Tubes or CPK modes then the second dialog box will not be shown and the output and compare windows will not be printed. Any information in the structure window (such are queries) will be sent to the printer along with the structure.

6.5 Read pcmod.bak


Every time a minimization is started and upon its completion a structure file, pcmod.bak, is written to the current working directory. This option will read in this file.

6.6 Exit
This option leaves PCMODEL and returns to the operating system. A dialog box asks for confirmation before exiting. Note that any unsaved work will be lost when PCMODEL is exited.

43

Chapter 7 Edit Menu


The EDIT menu contains the following commands: Draw Erase Structure Name Hide Hydrogens Recovers the TOOLS menu Erases all structures Edit or enter name of current structure This option toggles display of hydrogen atoms on and off. The hydrogens are not removed from the structure, merely un-displayed. This allows for a much clearer drawing without changing the geometry as with H/AD. Epimerizes the selected atom by interchanging the two selected attachments. Reflects the structure through a specified axis. Remove lone pairs from heteroatoms. Force PCMODEL to retype the current structure. Copies an image of the structure window to the clipboard. Orient the molecule in a specified plane, placing the first selected atom at the origin, the second atom along the axis and the third atom in the plane. Useful when generating structures for MO calculations Build a box of solvent molecules or solvate the current structure. Build a periodic box from the current structure.

Epimer Enantiomer Remove LP Retype Copy__To_Clipboard Orient_XY Plane Orient_XZ Plane Orient_YZ Plane

Build Solvent Box/Solvate Build Periodic Box

Draw
The DRAW option brings back the Drawing Tools buttons if they are not visible, or makes them the active window if they are visible. Otherwise, this command does nothing. The commands in this Tool bar are described in detail in chapter 5 of this manual.

Erase
This command erases the current structure and resets the default conditions for modeling. Since any unsaved structures will be lost, a dialog box appears and confirmation is required before the structure is actually erased.

Structure name
This option brings up a dialog box for entering or editing the name of the current structure. The default structure name is "untitled". If a file containing a named structure is read, then the current structure name is replaced by the name from the file. The structure name is displayed in the title bar of the PCMODEL window, just to the right of the word "PCMODEL". The structure name is limited to 60 characters and will be written into the structure file whenever the structure is written to disk (if the filetype selected supports structure names).

Hide Hydrogens
This option toggles the display of hydrogens attached to carbon. The atoms are still present in the structure database they are just not displayed. This will often make the display less cluttered, especially if atom numbers or types are being displayed.

44

Epimer
This option allows for inversion of the stereochemistry at a particular chiral center. Before selecting EPIMER, the central atom and the two attachments to be switched must have been selected with the SELECT command from the TOOLS menu. Once the three involved atoms have been SELECTed, the EPIMER option can be chosen and the position of the two attachments (and all atoms attached to them) will be exchanged. This option will not work if the two attachments are connected to each other in a chain.

Enatiomer
This option allows for enatiomerization of the structure by reflecting through an axis selected by the user.

Remove LP
This option forces immediate removal of lone pairs from heteroatoms. Only the MMX force field uses lone pairs and lone pairs are normally removed automatically when calculations are done with any other force field. However, you may force removal of lone pairs with this option.

Retype
This option forces PCMODEL to retype the current structure.

Copy_To_Clipboard
Selecting this option copies the current structure window to the clipboard with all color information intact. The clipboard can then be copied into a paint program for editing, or it can be printed when a color printer is available.

Orient_XY Plane
Orient_XZ Plane Orient_YZ Plane These commands will orient the molecule in the selected plane. Select three atoms using the Select command from the DrawTools menu, then select one of these options. The first selected atom will be placed at the origin, the second atom will be placed along the axis and the third atom will be rotated to lie in the plane specified. This option is useful for orienting a structure before writing an input file for a molecular orbital calculation.

Build Solvent Box/Solvate


This option is used to build a box of solvent molecules or to solvate the current molecule. If there is molecule in the structure window when this dialog is called the dimensions of the smallest box enclosing the current structure will be calculated and displayed in the Mol Dimensions edit boxes. The default solvent is set to water but may be changed to chloroform, octane or to the current structure. The number of molecules desired must be entered and the density may be changed at this time. The cell size that is required to contain the current structure and the number of solvent molecules at the entered density is computed and displayed in the Cell Size edit boxes. Hitting the OK button will causes the solvent molecules to be added and the dialog box dismissed. If there is a current structure on the screen it will be moved to the center of the box and the remainder of the

45

molecules will be added using the skew start method of Refson (K. Refson, Moldy v 2.9 ..). The skew start method works well for small, compact molecules whose dimensions are nearly spherical. If you want to build boxes of odd shaped molecules, those whose dimensions are elongated in one dimension, then you should use the Build Periodic Box command discussed below. Upon exiting this dialog the force field calculations are set to use Periodic Boundary Conditions for minimizations and dynamics calculations. We currently use a minimimum image convention, a spherical cutoff for dispersion interactions and a particle mesh ewald method for long range electrostatic interactions. Due to the organization of the code in PCMODEL these calculations are currently very slow and PCMODEL is probably not the program of choice for production dynamics calculations. However for short runs, initial testing and visualization PCMODEL works fine.

Build Periodic Box

This menu options is useful for building periodic boxes of oddly shaped molecules. There must be a molecule currently on the screen for this dialog to function. The molecular dimensions of the current structure are computed and displayed in the Mol Dimensions edit boxes. The number of molecules to include in the box and the desired density must be input. Calc Cell Size computes the cubic cell required to hold the number of molecules at the current density. Build Cell adds the appropriate number of molecules to the cell using the skew start method (see above). The resulting cell can then be visually examined for bad non-bonded interactions and a single point energy calculation done to assess the extent of bad non-bonded interactions. If there are severe molecular overlaps the added molecules can be erased using the Reset Cell button which returns the cell to its initial state. The density can then be changed and the process of building the cell can be repeated. By this interactive method a reasonable starting cell can be obtained without severe non-bonded interactions. Energy minimization followed by dynamics runs with increasing densities can be used to build a relaxed cell at the density desired for study.

46

Chapter 8 View Menu


This menu contains the following commands: Control Panel Labels Mono/Stereo Stick Figure Ball and Stick Pluto Tubular Bonds CPK Surface Dot Surface Ribbon RedGreen Stereo Top View Dipole Moment Brings up a dialog box to scale, translate and rotate structures in the structure window. Brings up a dialog box to change the structure labels Toggles between mono and stereo display Display all structures as stick figures. Display all structures as Shaded ball and stick figures. Display all structures as Pluto type ball and stick figures. Display all structures with tubular bonds. Display all or selected structures with CPK surface. Display all or selected structures with dot surface. Display a Ribbon diagram of peptide or nucleic acid backbone Toggles red/green stereo display. Turns on/off a top view of the current structure Draws a dipole moment along with structure.

Control Panel
Selecting CONTROL_PANEL brings up a dialog box containing seven controls which allow scaling, translation and rotation of the current molecular system.

The SCALE control allows the structure to be reduced or enlarged. Enlarging the structure is useful when trying to select atoms in congested systems, and reducing the structure can be useful when visualizing large structures. The three TRANSLATE controls move the molecular system in the X, Y, or Z directions. The X direction is horizontal on the screen, the Y direction is vertical, and the Z direction is perpendicular to the screen. The three ROTATE controls rotate the system around the respective axes. RESET_VIEW returns the molecular system to the coordinates it had when the CONTROL_PANEL option was selected. All translations and rotations are reset, but changes in scale remain in effect. RESET_SCALE resets the scale size to the default value. It has no effect on rotations or translations. EXIT dismisses the dialog box. To move one substructure relative to other substructures, use the MOVE command under the SUBSTR menu.

Labels
Selecting LABELS brings up a dialog box through which the currently displayed labels can be changed. The default is to only show bonds. Clicking on "OK" will close this window, with the selected display type enabled, and this display type will be remembered and made the default when you exit PCMODEL.. The options are:

47

Hydrogrens and lone pairs - places the letter "H" at each hydrogen position, and displays lone pairs as a pair of dots. This does not add hydrogens or lone pairs, it only displays them if they exist. Bonds only (default) - removes all atom symbols from the display, but atoms and bonds are shown in their normal colors. This is especially useful with large structures which quickly become cluttered when atom symbols are displayed. Atom numbers - labels each atom with its individual atom number. This is useful for setting up files for GMMX, the global searching program, which works on atom numbers. It is also useful when trying to interpret output given in terms of atom numbers. MMX Atom Types - labels each atom with its MMX atom type number. This is useful for examining parameters used in the MMX calculations, since these are given by atom type. (see Chapter 7 for a complete list of atom types and the corresponding numbers). MM3 Atom Types - labels each atom with its MM3 atom type number. This is useful for examining parameters used in the MM3 calculations, since these are given by atom type in the file MM3.PRM. MMFF94 Atom Types - labels each atom with its MMFF94 atom type number. This is useful for examining parameters used in the MMFF94 calculations, since these are given by atom type in the file MMFF94.PRM. AMBER Atom Types - labels each atom with its Amber atom type number. This is useful for examining parameters used in the Amber calculations, since these are given by atom type in the file AMBER.PRM. Oplsaa Atom Types - labels each atom with its Oplsaa atom type number. This is useful for examining parameters used in the Oplsaa calculations, since these are given by atom type in the file OPLSAA.PRM. Atomic Charge - labels each atom with its current atomic charge. This will be zero (0.0) until a calculation is done, and the charges will be specific to the force field used in the calculation. Substructure number - labels each atom with a number representing its substructure membership. This number is a value form 0 to 255 and is the sum of 2substructure number for all substructures for which the atom is a member. Color by substructure - colors the bonds by the lowest substructure number at each atom. Color by Strain - During a calculation the energy for each interaction is partitioned by atom. The individual atomic energy components are then normalized and colors are assigned to each atom based on the magnitude of the energy with Red being the most energetic, followed by rose, yellow, green, cyan, magenta, blue and gray.

Mono/Stereo
This option changes the default display between one structure (Mono) and two structures displayed side by side in stereo. The stereo display can be rotated inward or outward, though the use of STEREO option in the OPTIONS menu. In stereo mode the two structures are rotated + and - three degrees from the mono structure to simulate a stereographic display.

Stick Figure
Selecting this option returns the current drawing mode to the default stick figure display of all atoms.

Ball and Stick 48

Selecting this option displays the current structure with small balls representing the atoms and tubular bonds connecting the atoms.

Pluto
Selecting this option displays the current structure as a Pluto type ball and stick figure with shading.

Tubes
Selecting this option displays the current structure with thick bonds.

CPK_Surface
This option displays the current molecular system as a CPK model

Dot_Surface
This option displays the current molecular system with a dot surface around it.

Ribbon
This option draws a ribbon for the backbone of a peptide or nucleic acid chain. The display can be of the ribbon only, or the ribbon and a stick representation of the structure itself. The type of molecule (acyclic protein, cyclic protein or DNA/RNA) must be indicated. The dialog box offers a choice of Oxygen-Oxygen (default) or swapped orientation of the ribbon. Oxygen-oxygen produces smooth ribbons for alpha helices, while swapped produces smooth ribbons for sheets. The algorithm searches for the first N-terminus and draws the ribbon from there. Once it cannot find another amide linkage in the chain, it stops there and draws the ribbon. The ribbon display remains in effect until either the STICK_FIGURE option is selected or the Ribbon dialog box is brought up and canceled. The ribbon display can be rotated and translated, and can be printed.

Red/Green Stereo
Redraws the current structure in stick figure form using red and green images to create a stereo image when viewed through red/green glasses.

Top View
Draws a second view of the current structure rotated by 90 degrees. The top view is smaller and placed in the upper or lower corner, but this allows two views of the same structure simultaneously.

Dipole Vector
Draws a vector representing the computed dipole moment of the current structure. The dipole vector will be rotated with the current structure.

49

Chapter 9 Compute and Force Field Menus


This menu contains the following options: Minimize Single Point User External Charges Use Gasteiger Charges Normal Vibrational Modes Mopac Ampac Gaussian Orbitals Vibrations GMMX Metropolis MC Vibrational Mode Search Dynam Auto Dock Manual Dock Batch Rot_E Dihedral Driver Minimize the current structure(s). Calculate the energy of the current structure(s). Use charges read from external file, usually Mopac/Ampac Assign charges based on Gasteiger sigma method s Compute normal vibrational modes Run the semi-empirical quantum program Mopac Run the semi-empirical quantum program Ampac Run the quantum chemistry program Gaussian View the molecular orbital or electron densities calculated in Mopac, Ampac, Gaussian, Jaguar, or Gamess View the normal vibrational modes calculated in Mopac, Ampac, Gaussian, Jaguar, Gamess or PCMODEL Run the stochastic conformational search. Run the Monte Carlo conformational search. Run search using low frequency vibrational modes Molecular Dynamics simulation. Simulated annealing search of lowest energy interaction between two structures. Move the current substructure about the main structure and compute the energy at each step Read a multiple structure file and minimize all structures or compute properties. Rigid rotor evaluation of rotational energy barrier. Grid search with minimization evaluation of the rotational energy barrier.

Force Field Menu


MMX MM3 MMFF94 AMBER Oplsaa Use the MMX force field. Use the MM3 force field Use the MMFF94 force field Use the Amber 95 force field. Use the Oplsaa force field.

Minimize
This option optimizes the geometry of the current molecular system using the currently selected force field. The current molecular structure is analyzed for atom types, bonds, angles and dihedral angles and the appropriate constants are extracted from the database of parameters. These parameters are found in the text files mmxconst.prm, mm3.prm, mmff94.prm amber.prm or oplsaa.prm. The energy of the molecular system is evaluated and written to the screen in the Output window, and then the minimizer is called. The default action of the minimizer is to first do a first derivative minimization until the gradient falls below 1, then a second derivative minimization until the gradient falls below 0.0001. The minimizer attempts to lower the energy of the system by moving the atoms. Every 5 iterations during the first derivative minimization, and then ever 2 iterations during the second derivative minimization the Structure window is updated with the current geometry and the Output window is updated with the iteration number, the atomic movement (in 10-5 units) and the energy (in kcal/mol). If the energy goes

50

up, or if the energy difference between two calculations falls within the convergence criterion the calculation is stopped, the final energy and its components, the heat of formation (for MMX and MM3 force fields) and the SE (strain energy) are written to the Output window, and the Structure window is updated. In general, the energy minimum found will be a local energy minimum. A calculation may be stopped at any time by clicking the Stop Job dialog box. The results, in abbreviated form, are written to the file pcmod.out. This file is a record of all the minimizations done during the current session and can get quite large. The default is to generate a minimum output, but this may be changed with the Printout command in the Options Menu. Pcmod.out is overwritten every time you start PCMODEL, so the file must be renamed to preserve a record of a particular set of calculations. A structure file, in PCM format, is automatically saved upon starting the minimization and after every five iterations so that if the program crashes or is stopped the last structure can be recovered. The filename is pcmod.bak and it is written in the current directory. This file is overwritten every time a new minimization is begun, so to preserve a structure it must explicitly saved with the Save command. The PCM file format is the most concise way to save data since the energy data can be regenerated by repeating the energy minization with the final optimized geometry. A major problem in minimizing structures drawn without a template is that hydrogens and lone pairs are often forced into inappropriate positions resulting in high energy structures. It is a good idea to do an H-delete-H-add sequence (select H-A/D in the TOOLS menu twice) then reminimize. For large structures, reminimize to be sure the structure is fully converged. Since minimization is quite timeconsuming with large molecules, it is often economical to minimize the structure without added hydrogens to get the three-dimensional structure approximately right prior to H-A/D and final minimization. In the MMX force field PCMODEL has access to a number of generalized force field parameters ( i.e., torsional potentials dependent only on the two central atoms) to which the program defaults if no standard MMX parameters are available. These parameters are not optimal and energies produced with them should be considered accordingly (however, geometries should not be too far off). The use of generalized parameters is always noted in the output file PCMOD.OUT. It is important to note that the MMX energy resulting from a minimization with a pi calculation does not contain the potential energy of the pi system. So a comparison of MMX energies for isomeric pi systems is invalid, just as a comparison of MMX energies is invalid for structural isomers. A comparison of the heats of formation, on the other hand, is appropriate. The only circumstance where MMX energies may be compared directly is with conformers or diastereomers.

Single Point
SINGLE POINT does a single calculation of the energy of the current structure(s). The calculation is begun just as for the MINIMIZE option, however the calculation is stopped before the minimizer is called. The heat of formation is not calculated during a single point calculation.

Use External Charges


This option forces PCMODEL to use any charges that were read into the program from an external file. This option is often used with Mopac or Ampac files where the charge is computed and included in the archive file. At this time charges are not read for Gaussian, Gamess, Hondo or Turbomol files.

Use Gasteiger Charges


This option forces PCMODEL to use charges computed by Gasteiger sigma model (Gasteiger s and Marsili, Tetrahedron, 36:3219-3288, 1980) .

Normal Vibrational Modes


Since PCMODEL now has a full second derivative minimizer included the normal vibrational modes of a molecule may be computed. You will first be prompted for the name of a file to write the

51

normal vibrational mode information, then a mass weighted hessian matrix will be generated, diagonalized and the vibrational modes will be written to the file. This file can be displayed using the Vibrate program (see below). MM3 does a reasonable job with vibrational frequencies while MMX does not give reasonable frequencies.

Mopac
This command attempts to execute the program Mopac using as input the current structure. PCMODEL first attempts to find Mopac.exe in the current directory. If Mopac.exe is not located in the current directory and PCMODEL does not have any previous information about the location of Mopac.exe you will be prompted to provide a path to the executable file. Use the browse dialog box to locate and select the program. You should only need to do this the first time you run Mopac since the path information will be saved when you exit PCMODEL. Your version of Mopac must be capable of reading a filename from the command line since we will be passing the filename to Mopac to execute. Serena Software markets a version of Mopac that does this. If you have obtained a copy of Mopac from other sources or you have generated it yourself you should check that this version can read the command line. After Mopac.exe is located, you will first be prompted for a name for a Mopac input file that will be generated, and then a dialog box for creating a Mopac format file will be presented. Either select from the standard options or enter the keywords you wish to use on the Keyword line. Mopac.exe will then be called with the filename a DOS type command window will be displayed and the calculations will be run. When the calculation is complete the DOS window will disappear and a new prompt will be displayed asking if you want to read the geometric output of the Mopac calculation. If you chose Yes, then a file dialog box will be presented for reading a Mopac Archive file. The filename will be filename.arc, that is the name you first created with the three letter extension ARC appended. Selecting this file will read in the new structure. If this file does not exist then the Mopac calculation failed and you should look at the file filename.out (or FOR006) to discover the source of the problem.

Ampac
This command attempts to execute the program Ampac using as input the current structure. See the description given above for Mopac for instructions on running Ampac.

Gaussian
PCMODEL can directly run Gaussian 98. The executable program to be run is g98.exe (not g98w.exe !!!!!). See the description for running Mopac for more details on finding the Gaussian executable and generating filenames. The dialog box for generating a Gaussian job file provides most of the standard options (see File Menu save section for more information about the dialog box), however additional options can be added by typing them on the keywords line.

Orbitals
This command executes the program Orbdraw which will display the molecular orbitals and electron densities that can be computed by the quantum chemistry programs Ampac, Mopac, Gaussian, Gamess and Jaguar. After the data file is selected the file is read and the number of atoms, orbitals and electrons are displayed in a dialog box. Next the orbital or density to display is selected from the View menu. The structure and selected orbital or density will be drawn. The entire image may be rotated by dragging the mouse with the left button depressed. The initial resolution for the display is 25x25x25 which gives a very crude picture but can be generated rapidly. A more detailed picture can be generated by increasing the resolution and threshold value.

Vibrations
This command executes the program Vibrate which will display the normal vibrational modes that can be computed by the quantum chemistry programs Ampac, Mopac, Gaussian, Jaguar and PCMODEL. After the data file is selected and read a dialog box giving the number of atoms and vibrations of the structure will be presented then a stick figure of the vibrational spectrum will be displayed. A particular vibration can be displayed by either pointing to the line in the spectrum and

52

clicking the mouse, or be selecting the vibration from the list of vibrations in the menu. Up to four vibrations may be displayed at once and the structures can be rotated.

GMMX
This command initiates a conformational search on the current structure and up to four rings and fifty rotatable bonds can be searched at one time. There are many options available to control the search and they are discussed in the chapter on conformational searching.

Metropolis-MC
This command initiates a Monte Carlo conformational search in internal coordinates. All bond lengths, angles and dihedrals are included in the search.

Vibrational Mode Search


This command runs a conformational search that uses the low frequency vibrational modes as search directions based on the method of Kolossvary and Guida (Kolossvary and Guida, Journal of the American Chemical Society, 118:5011-5019).

Dynam
DYNAM performs a molecular dynamics simulation using the leapfrog Verlet algorithm. Selecting this option brings up a Setup dialog box, through which several parameters of the simulation may be set. The time step for numerical integration (default 1 femtosecond) should never be larger than 5 fs (and the program may hang with this value). The next options are the initial temperature in K of the sample molecule and the surrounding bath (default for each is 300 K). The next option is the viscosity of the medium in cp (default of 0). The next parameter is the time constant for heat transfer (default value is 1, which results in transfer of energy from the bath to the sample). The final parameter is the equilibration time before the run is sampled. You may sample the run by clicking the Sample check box. The default sample time is 10fs. You will be asked to provide a file name for the run. At each sample point a structure will be written to the selected file. The run may be played back using the Movie option under the View menu option. Newton's equations of motion are used to calculate the position and velocity of all the atoms at each time step. This method can be used to add or remove kinetic energy from a molecule and to distribute the energy among various vibrational motions. Processes that occur on the pico- and femtosecond time scale can be studied. The initial structure should be minimized before starting the dynamics run. Initially the structure will be drawn overlaid on the screen along with the kinetic and potential energy at the current time. The screen is updated at each timestep. Remember that vibrations occur on the picosecond time scale and rotation about a bond, as in butane, may not be observed by this method. Dynamics has application to large structures where strain can be relieved by reducing van der Waals interactions.

Auto Dock
The AUTO DOCK option takes two or more minimized structures and attempts to find the best interaction geometry. This is done using a simulated annealing algorithm. The basic series of steps is:

53

1. 2. 3.

4.

Evaluate the energy of the initial structures. Randomly translate and rotate the second structure about the first. Evaluate the energy of the new system. If the energy is lower than that of the original system, accept the new structure and go to step 4. If the energy is higher than that of the original system, use the Metropolis algorithm to decide whether to accept the new structure. If the new structure passes then go to step 4, otherwise go back to the previous structure and then proceed to step 4. Increment the counter. When enough structures have been analyzed, lower the temperature which is used in the Metropolis algorithm. The net effect of this scheme is to accept fewer uphill energy steps, and drive the system to lower energies.

Selecting the DOCK option brings up a dialog box to set the initial conditions of the calculation. The parameters are:

XYZ displacement Base angle Start Temperature Cooling decrement Energy Change/Cycle Stop on No Change

(default 1 Angstrom) (default 15 degrees) (default 3 kcal) (default 0.1 kcal) (default 0.2 kcal) (default 5 cycles)

The xyz displacement is the distance the substructure will be moved in x, y or z times a random number. The base angle is the rotation angle to be used for rotating the substructure in x, y or z times a random number. The starting temperature is given in Kcal as are the cooling decrement and energy change per cycle. Clicking on OK brings up a dialog box into which the filename for saving the results should be written, then starts the calculation. When docking, the calculation may be aborted by hitting the ESC key on the keyboard. Since the simulated annealing procedure is stochastic, there can be no guarantee of success or even that two successive calculations will end up with the same structure. However, this method is superior to both manual and grid methods for docking studies. Trial and error is required, and starting from several different conformations can be useful. The structures to be docked should be located close together, since motion is limited to the sum of the diameter of the main structure, plus twice the diameter of the substructure plus 4 , even at the highest temperature.

Manual Dock
The Manual Dock options allows the user to take two minimized structures and manuall move them about and evaluate the energy at each step. The entire ensemble can be rotated by dragging the mouse, and the individual molecules with the control panel. High energy interactions are shown in red as the structure is redrawn.

Batch
The BATCH option allows the handling of multiple structure files without user intervention. There are a number of options including both calculations and file conversions. Selection of the BATCH option first brings up a file dialog box to get the name of a file containing multiple structures (see FILE OPEN for a detailed description of this dialog box). Any file format that supports multiple structures may be used. Next a dialog box to select the desired options is presented. The current options include energy minimization, computation of the surface area, volume or conversion of file type. One or more of these options can be selected.

54

A FILE SAVE dialog is then presented to obtain the name and file type of the output file for saving output structures. The input file is then read and the number of structures is determined. A new dialog box is presented giving the number of structures, the default starting structure, and the default filenames for the output and summary files. The output file will be the name given in the FILE SAVE dialog and will contain the updated geometries after minimization or file conversion. The summary file will contain a summary of the minimization information including the MMX energy and the heat of formation, surface areas or volumes depending upon the options selected. The various fields of the dialog box may be edited to change the filenames and the default starting and ending structure numbers. Clicking on OK starts the calculations, which will run unattended until completion. The BATCH process can be stopped by selecting the STOP JOB button. The cursor will change from an hourglass to a pointer when the calculation is done. The bottom of the output window indicates the current structure number, and the total number of structures to be calculated so that progress in the BATCH process can be monitored.

Rot_E
The Rot_E option evaluates the energy required to rotate about a given bond, using the rigid rotor approximation (meaning that the rotating ends are held rigid). The current structure should be minimized and the two atoms of the bond of interest should be SELECTed before ROT_E is selected. A dialog box appears, into which the step size (default 10) and total rotation (400) are entered. Clicking on OK starts the calculation. The energy of the current structure is evaluated, then the specified bond is rotated by one step size increment and the energy is re-evaluated. This procedure is repeated until the total rotation value is reached, at which time a plot of energy vs. rotational angle is displayed. This plot may be printed via its FILE menu. the ROT_E method is only approximate since the energy is not minimized at each step of the calculation. This method is useful for a fast scan of the rotational energy minima, which can then be studied in more detail using the ROTATE_BOND and MINIMIZE options. A more accurate barrier may be obtained by using the ROTATE_BOND command under the EDIT menu to rotate the bond to a maximum value, then repeating the ROT_E calculation.

Dihedral Driver
The DIHEDRAL_DRIVER option is an accurate, but more time consuming alternative to the ROT_E option. With the DIHEDRAL_DRIVER option, a particular dihedral angle is fixed at a specified value, and all the other degrees of freedom are allowed to relax. Up to two dihedral angles can be driven in one calculation. Before DIHEDRAL_DRIVER is chosen, the SELECT option from the TOOLS menu should be used to mark the 4 atoms of the bond of interest. When DIHEDRAL_DRIVER is chosen a dialog box will appear, displaying the selected atom numbers, along with the current value of the angle. Values for the starting angle, final angle and step size for the first angle must be entered. If two angles are to be driven, select "Get Second Angle". The dialog box will be dismissed, and the four atoms forming the second angle can be SELECTED. (The four atoms forming the first atom will be labeled with "d1".) Selecting DIHEDRAL_DRIVER again will cause the dialog box to reappear, with the data for the

55

first and second angles present. The starting angle, final angle and step size for the second angle must be entered. When all the required information has been entered, click on CALCULATE. A new dialog box will appear for entering the filename for the output structures. During the calculation, the coordinates of each step, along with angle 1, angle 2 and the energy will be saved in a multiple structure file in either PCM or MMX format for later display and analysis. The one or two dihedral angles are rotated to their initial values and the energy of the structure is minimized, while holding the one or two angles fixed. When the energy of that structure is minimized, the second angle is incremented by its step size and the minimization is repeated. This procedure is repeated until the final angle of angle 2 is reached, then angle 2 is reset, angle 1 is stepped to a new angle and angle 2 is stepped through again.

The option to use "last structure" or "original structure" is also presented in the dihedral driver dialog box. Choosing "last structure" (default) uses the previously minimized structure for the next calculation, while choosing "original structure" reads the original structure in, rotates the dihedrals to the specified angles, then calculates. The "last structure" option is faster if machine I/O is slow, but can have problems with very hindered structures where attached groups have a difficult time rotating past each other. The result of all these calculations is an n by m grid of points where n is total rotation of angle 1 divided by step size for angle 1, and m is the total rotation of angle 2 divided by step size of angle 2. The current limit on the grid size is 50 x 50. The DIHEDRAL_DRIVER method is more accurate than the ROT_E option since all other degrees of freedom are allow relaxed. However, it is more time consuming, and in molecules with more than three rotatable bonds one needs to ensure that other rotatable bonds have not rotated into a new energy minimum during the dihedral driver sequence. This often happens when attempting to generate Ramachandran plots of dipeptides. Again, the calculation may be aborted by pressing the ESC key. To reset the dihedral angle, use the RESET command under the MARK menu.

56

Chapter 10 Analyze Menu


Surface Area Volume Compare Dihedral Map Movie Dot Map Multi Strucutre File Assign Sym Calculate Surface area of molecule. Calculate Volume of molecule. Compare different structures. Display the results of dihedral driver calculations. Replay of all the structures in a multiple structure file Generate a dot map from a multi conformation file Analyze a file containing multiple structures Assign point group symmetry.

Surface Area
This option calculates the exposed surface area of the current molecular system. A stochastic algorithm is used which generates points on the surface randomly, determines if the point is exposed or not, and counts the number of exposed and unexposed points. A dialog box is used to set the number of points calculated for each atom (default 100), and the number of times the calculation is repeated (default 20). Since the method is stochastic the calculation is normally repeated several times and the average surface area values are reported. The calculation may be aborted by pressing the ESC key. When the calculation is complete, a window will appear with the total surface area, saturated area, unsaturated area and polar area. (The atom types of the atoms are used to divide the surface into non-polar, unsaturated non-polar and polar surface areas.)

Volume
This option calculates the volume of the current molecular system. Both molecular volume (in cubic ) and molar volume (in cubic cm) are reported. The calculation may be aborted by pressing the ESC key.

Compare
This option allows the comparison of two or more structures. The main structure is, by definition, the structure with the lowest substructure number. If only one structure is present when the Compare option is first selected, the structure is redrawn colored by substructure and a dialog box is displayed so that a comparison structure can be read in. If two or more substructures are present when COMPARE is selected, the structures are redisplayed colored by substructure and the compare dialog box is then displayed.

Comparisons can be done an all atoms (default), heavy atoms only, or selected atoms only. In order to use the Selected atoms option, the atoms must be SELECTed using the SELECT button from the TOOLS menu before the COMPARE option is chosen. The atoms may be marked in any order, but the calculations will match the first selected atom of one structure with the first selected atom in the second structure, and so on. Alternatively, one atom in the first structure may be selected, followed by the atom to be matched to it in the second structure. The other option in this dialog box are:

57

Calculate - starts a least squares calculation which attempts to minimize the differences in the positions of the atoms being compared. The results of the calculation will be given in the Compare window. The table of results will include 3 columns - atom in structure 1, corresponding atom in structure 2, and distance between them in angstroms. At the bottom of the table are the average difference between the distances, and then the root mean square difference in the distance. If the output is too large to fit in the window, the window can be scrolled. The structures will be drawn in the Structure window overlapped with red lines connecting the atoms that are being compared. If the structures overlap well, the red lines may not be visible. Reset - clears the comparison arrays and removes the red lines between comparison atoms from the drawing. This button is essentially used only for cleaning up the screen. Next Structure - allows reading in another structure for comparison. If two or more structures are currently present, all but the main comparison structure will be hidden. a file dialog box will be presented for selecting the desired structure file, the structure will be read in and displayed to the right of the current structure, colored in the appropriate substructure color. Hide - makes all the comparison structures invisible. This is useful to keep the screen from becoming cluttered when many structures are being compared. Show All - makes all comparison structures visible. This is useful to display several structures that have been compared. Cancel - dismisses the Compare dialog box. All comparison structures are deleted, and the main structure is re-displayed. It is necessary to erase all the comparison structures since they occupy the same coordinates as the main structure (if the comparison has been successful), and would cause any subsequent molecular mechanics to fail.

Dihedral Map
This option allows redisplay of the results of a prior dihedral driver calculation. A dialog box is presented prompting for the filename of the saved calculation. The file is then read and a one angle or two angle plot is presented. See the FILE OPEN command for a detailed discussion of the use of the file dialog box, and the DIHEDRAL_DRIVER option in the ANALYZE menu for a full discussion of dihedral driver calculations.

Movie
This option reads a multiple structure file and shows all the structures in sequence. The file formats currently supported include PCM, MMX, Gaussian output and Cambridge Structural Database. PCM type multiple structure files are often generated during a dynamics run, or part of a GMMX search. Thus the results of a dynamics run can be replayed using this options. Another use of this option is to view the results of a Gaussian reaction coordinate scan. You will first be asked to select a file for play. The options in this dialog box are: Play Plays the movie. Step Steps through the file one structure at a time. Loop Continuously plays the movie. Stop Stops the Play or Loop. Exit - Exits the dialog box and returns to PCMODEL.

58

The total number of structures in the file (5 in the illustration) and the current structure number (1) will be displayed. The track bar reports the progress of the movie but does not respond to clicks of the mouse.

Dot Map
This option reads in a file from a GMMX search, compares a selected set of atoms and orients each structure to overly the first structure, then plots the position of a selected atom from each structure relative to the original structure. The result is a new structure containing the position of every place where the selected dot atom can visit in the various conformations. See Midland, M.M.; Plumet, J.; Okamura, W.H. Effect of C20 Stereochemistry on the Conformational Profile of the Side Chains of Vitamin D Analogs, Bioorganic and Med. Chem. Letters 1993, 3, 1799-1804. for an application in the vitamin-D field. After selecting Dot Map, a file-read dialog box will appear. Usually you will want to use the first structure for the comparison. Next enter the atom numbers for the comparison overlay. This can be done by either typing in the numbers separated by a space or comma, or by using the Sel-Atm option from the Draw Tools. If Sel-Atm is used, click on Update to fill in the edit box. Next select the atom for the dot. This can be done by entering an atom number or by reselecting Sel-Atm, clicking on the appropriate atom and then on Update. The dots will look bigger on the screen then in the print out. Do not add hydrogens to the resulting structure, particularly if the dot atom is an oxygen or nitroge n. An exampl e of a dot map is shown below.

MultiStructure

59

Multi structure/conformation files can be analyzed. There are three main options- View Structures, Analyze Structures and Compare Structures. The View structures option provides a quick way to step through a file and look at individual conformations. A scrolling list of all the structures in the file is shown and the file is displayed when selected from the list. This is very similar to a standard file open, but the scrolling dialog box is not dismissed and no questions are asked.

Assign Symmetry
This option attempts to assign the point group of the current structure using the cartesian coordinates. This option attempts to assign the point group of the current structure using the Cartesian coordinates.

60

Chapter 11 Substr Menu


This menu contains the following commands: Read Create Move Connect Erase Show Dummy Don't minimize Read a structure file and label the structure as a substructure. Label the selected structure as a substructure Moves the selected substructure. Connect two structures. Removes the selected substructure from existence. Display any dummy atoms in current structures. Marks the selected substructure so that it will not be minimized in a molecular mechanics calculation.

Substructures are a method for naming and manipulating individual molecules. By default any structure read into PCMODEL using the OPEN command, or any structure drawn in the Structure window is initally assigned to substructure number zero. Several molecules may be drawn in the Structure window, but they all belong to substructure zero and all will be moved, erased or minimized together. In order to manipulate the structures individually it is necessary to mark them as substructures. The commands available in the Substructure Menu provide all the facilities necessary for handling substructures.

Read
The READ command reads a structure file into PCMODEL, marks it with a unique substructure number and displays it to the right of the current structure. The current structure is not erased. A file open dialog box is used for obtaining the filename, filetype and path (see the FILE OPEN command for a detailed description of this dialog box). Up to thirty two substructures may be created at one time in PCMODEL.

Create
The CREATE command marks a structure present in the Structure window as a substructure. Before choosing CREATE, at least one atom in the structure must have been marked with the SELECT button from the TOOLS menu. When the CREATE command is chosen, a dialog box appears requesting a name for the substructure. This name is associated with the substructure and can be used with the MOVE, ERASE, HIDE and DON'T MINIMIZE commands.

Move
The MOVE command allows one substructure to be translated and or rotated with respect to all the other structures. There are several methods for selecting the substructure to move. One method is to use the SELECT button on the TOOLS menu to mark an atom in the substruture to be moved before chosing MOVE. Alternatively, if no atoms have been selected when the MOVE command is chosen, a dialog box will be presented listing all substructures by name, and the substructure to be moved can be chosen by selecting its name from the list with either the left mouse button or the up/down arrow keys. After a substructure has been selected, click on OK, and the Control Panel dialog box will appear. The translate and rotate controls will now only work on the selected substructure. Rotation will be about the center of the selected substructure. To change the substructure being moved, simply select an atom in another substructure. The Control Panel controls will now operate on that substructure. Complex docking can be accomplished by translation and rotation of selected substructures. To rotate the entire collection of molecules as a unit it is necessary to Exit the Control Panel dialog box, and then chose the CONTROL_PANEL option from the VIEW menu.

61

Connect
The Connect command connects two structures. First, SELECT the two hydrogens to be removed to make the new bond (using the SELECT button in the TOOLS men). The hydrogens will be removed and a new bond will be drawn between the two heavy atoms that the hydrogens were attached to. Please see Chapter 3, Tutorials, for a step by step description of connecting two molecules.

Erase
The ERASE command allows a substructure to be erased (deleted from existance). As with SUBST MOVE, any atom in the substructure to be deleted may be SELECTed before chosing ERASE, or if none are selected a dialog box will list the possible substructures. There is no prompt for confirmation. Once the structure is deleted it can not be recovered.

Show Dummy
The SHOW DUMMY commands redraws the structure with all dummy atoms (atom type 60 in MMX) shown with gray dots over the atom. Dummy atoms can be placed in a structure using the Dummy Atom type from the periodic table. Dummy atoms have the same properties as hydrogens and can thus be minimized, but they are not removed by HAD. The main purpose of Dummy atoms is to serve as place holders for points of connection while building polymers or other large structures. See the Tutorial Chapter of the use of dummy atoms in building poly-stryene.

Don't Minimize
The DON'T MINIMIZE command marks a substructure so that it will be ignored during an energy minimization. This is a useful procedure when studying interactions between surfaces and molecules, in which the interactions between the surface and the molecules are of interest, but minimizing the surface would not be practical. This allows building surfaces that are only two or three atoms deep, and thus would not maintain their geometry if minimized, reading the surfaces into PCMODEL and then studing the interaction of these surfaces with various molecules. As with SUBST MOVE, any atom in the substructure to be deleted may be SELECTed before chosing DON'T MINIMIZE, or if none are selected a dialog box will list the possible substructures.

62

Chapter 12 Mark Menu


This menu contains commands that H Bonds Pi Atoms Metal Coord TS_BondOrders Fix Distance Fix Angle Fix Torsions Reset Hydrogen Bonds Pi atoms. Set electron count, charge and coordination of metal atoms. Set the bond orders for transition state bonds. Fix a distance between any two atoms-up to 10 pairs Fix an angle. Up to 10; Fix a dihedral angle Reset any of the above and also substructure membership.

H_Bonds
The MMX force field contains an extra term to correct deficiencies (in the original MM2 force field) in describing hydrogen bonds. This term is turned on or off depending upon whether the hydrogen bonds are marked. The default is to automatically mark hydrogen bonds and to turn on the extra term in the force field. If the hydrogen bond marking is turned off using the Reset option (see below), then the H_Bond option is used to turn on the hydrogen bond term again. Hydrogen bonds are shown in red between a donor hydrogen and acceptor atom when the distance between the two is less than 2.10 angstroms.

Pi Atoms
The MM2 and MM3 force field only contains one atom type to describe alkenes (atom type 2), which is used to describe all types of unsaturation. The default parameters for atom type 2 are set to reproduce the geometry of an isolated double bond, such as ethylene, and will not accurately represent the geometry or energy of a conjugated system such as butadiene or benzene. The solution to this problem is to do a simple pi vescf calculation on conjugated systems. This calculation gives the bond orders of all the bonds in the conjugated system, and these bond orders are used to adjust the stretching and torsional parameters for those atoms in the conjugated system. The pi atom marking is used to tell PCMODEL which atoms belong to the conjugated system and that a pi calculation should be done. Selecting the Pi atom marking will automatically mark all conjugated atoms. Reset is used to unmark the atoms. When Pi atoms are marked and minimize is selected PCMODEL will first do a pi calculation on the conjugated system, the parameters will be adjusted and then 15 iterations of geometry optimization will be done. A pi calculation is then done on the updated geometry, the parameters are again adjusted, and a complete minimization is done. When the minimization is complete another pi calculation is done, followed by another minimization. The minimization ends when no change in the geometry is observed after a pi calculation.

63

Metal Coordination
The Metal Coord option is used for coordinating lone pairs and pi systems to a metal atom, setting the electron count and geometry of a metal, and setting the charge on the metal. The metal atom of interest and any coordinated atoms must first be selected using the SELECT button on the TOOLS menu. After selecting Metal Coord a dialog box will be presented showing the metal atom symbol, the current charge if any, and a set of radiobuttons describing the electron count and geometry. The force field model was designed with no charges on the metal atoms (default charge is 0). Setting a charge will turn on electrostatic interactions with other atoms and should be treated with caution. The electrostatic model within molecular mechanics is not well defined or well tested, but it is known that formal charges do not work well. You should check your calculations against known compounds as you make modifications. If the Metal Coord option is not used, all metals are assumed to be coordinately saturated, and there will be no attractive potential between the metal and the lone pair or p-orbitals of ligands which were not explicitly bonded during the DRAW phase of the input. Atoms coordinated to the metal will be shown with a dotted line between the atom and the metal.

TS_Bond Orders
If transition state atom types are used, their bond orders must be set before a minimization can be done. When the TS_BondOrders option is selected a dialog box appears listing the transition state bonds found and the current bond orders (if any are known). The bond orders should be entered in the appropriate edit boxes. This data will be written to the structure file when the file is saved.

Fix_Distances
The Fix_Distances option is used to fix the distance between two atoms, and up to ten pairs can be fixed at one time. The two atoms whose distance is to be fixed should be selected using the SELECT button on the TOOLS menu, then Fix_Distances can be selected from the MARK menu. A dialog box will appear listing the two atoms, the current distance and two edit boxes, one for the distance to fix at, and the other for a force constant (default 5.0 mdyne/angstrom ). The distance is fixed by creating a bond between the two atoms with the default distance set at the fixed distance, and with a force constant input. The default force constant works for most cases. Fixed distances can be combined with energy minimization or dynamics to build large structures with specific interactions, for example, folded peptides (where the fixed distances come from NMR experiments).

Fix Angle
The Fix Angle option is used to fix the angle made by three atoms and up to ten angles can be fixed at one time. The three atoms of the angle should be selected first using the Select button, and then Fix Angle from the Mark menu. A dialog box will appear giving the atom numbers of the atoms of the angle, the current angle and edit boxes for the angle to fix at and the force constant to use.

Fix_Torsions
The Fix_Torsions options is used to fix up to two dihedral angles. The four atoms defining the dihedral angle must first be selected using the SELECT button in the TOOLS menu. The Fix_Torsion option will then present a dialog box listing the four atoms, the current angle, and an edit box for entering the angle to fix at. The routines used for fixing a torsion are the same as those used for the dihedral driver and are limited to two dihedral angles at one time.

64

Reset
The Reset option brings up a dialog box which allows resetting all the options given above, and also substructure membership.

65

Chapter 13 Options Menu


This menu contains commands that Printout Dielc DPDP Minimizer MMX_PI Options Added Constants Standard Constants Stereo Pluto VDW Surface Dot Surface Sets the amount of output in the file PCMOD.OUT. Resets the dielectric constant. Switch between dipole/dipole and electrostatic calc. Control which minimizer to user. Set options for Pi calculations. Allows the addition of user defined constants Forces the use of the default data set Changes the direction of rotation of stereo display. Set Pluto options Set VDW display options Set Dot Surface display options.

Printout
The PRINTOUT option sets the level of information written to PCMOD.OUT. The default is to write a minimal output file which lists that atomic charges, any generalized constants used and the initial and final energies for a calculation. Full printout gives a complete list of all interactions used in the calculation of the final energy. Each bend, stretch, torsion, VDW interaction and charge-charge interaction is listed with constants used, current geometry and the energy contribution from that item. A full printout can be quite long. The current option (Full or Min-imal) is indicated in the Output window.

Dielc
The DIELC option sets the dielectric constant which has a default of 1.5. It is rare for increases in DIELC to reproduce the properties of molecules in higher dielectric solvents, so this should be used with great care. If a negative value for DIELC is input, then the charge-charge interaction will be calculated with a distance-dependent dielectric constant (Dielc = f(Rij )), an option that is used successfully in the AMBER program when attempting to model the effect of a polar solvent or lack of counterions. Setting Dielc to -1. will adversely effect calculations done with dipole-dipole interactions.

DPDP
The DPDP options allows choice of either dipole-dipole or electrostatic calculations. The default in PCMODEL is to use electrostatic calculations with the charges being calculated from the default bond moments. This is a more general procedure and allows incorporation of charged species. However, the MM2 force field was originally developed with the dipole dipole method. The charges for the electrostatic calculations are, in most cases, derived from the bond dipoles programmed into MM2 for the bond dipole option, although heteroatom pi systems will also contribute charges. The default electrostatic calculations use the MM2 charges but also include the dielectric constant in the denominator of the energy expression. Electrostatic calculations should be used with atoms having charges (e.g., N+, C-) and they will also allow the halogens to be considered as ions if they are not directly bonded to any other atom. The only reason to use DP-DP is to compare a calculation to an MM2-like calculation.

Minimizer
This options controls the default actions of the minimizer. PCMODEL now includes a first derivative minimization method, BFGS, and both truncated Newton-Raphson and full Newton-Raphson second derivative minimization methods. The default behaviour is to do a first derivative minimization until the gradient falls below 1, and then a second derivative minimization until the gradient falls below 0.0001. This dialog box allows you to switch to a first derivative only minimization, second derivative only method or to the mixed method. While the mixed method appears to be the most robust method for

66

general use, there are times when the path taken through conformational space is dependent upon the choice of minimizer.

MMX_PI Calculation Options


This option allows setting a number of options for the pi calculation. The choices are: RHF or UHF Choose between Restricted Hartree Fock and Unrestricted HF calculation. For closed shell molecules with all the electrons paired the RHF option should be used. The UHF option is used for species with unpaired electrons (monoradicals, diradicals etc.) or for higher spin states. Mult Set the multiplicity of the system. The default is 1 for a singlet state. Huckel or Full SCF Set the type of the initial pi calculation. The default is to do a simple Huckel calculation which ignores the geometry of the pi system in determining the bond orders. This method is preferred when the starting geometry is poor. The Full SCF uses the geometry of the pi system in determining the bond orders and will give better bond orders if the starting geometry is close to the final geometry. Non-Planar or Planar Pi System- This tells PCMODEL whether the entire pi system is planar or non-planar. The default is to assume a non-planar pi system and check for non-planarity. Use the Planar option only if the entire pi system is co-planar. All of these options are reset when a structure is erased.

Added Constants
The Added CONSTANTS option forces allows the use of a file of user defined constants to be used in addition to or as replacements for the standard constants. The user is prompted for the name of the added constants file, which must use the same format as the standard constants file. PCMODEL checks as the added constants are read and replaces the standard constants if one exists or adds the new constant to the bottom of the parameter list if no previous constant has been defined. To return to the standard constants use the Standard Constants button below.

Standard Constants
The STANDARD CONSTANTS option forces PCMODEL to use the default constants. All parameter arrays are set to zero and the default parameter files, mmxconst.prm is read

Stereo
The Stereo option sets the direction of rotation for the stereo display. The options are to rotate the two structures inward (cross eyed display) or outward (wall eyed display, default). The selecteds option remains in effect until changed.

Pluto
Choosing this option brings up the following dialog box:

67

Ang1 and Ang2 - Control the position of the light source. Shade Controls the density of the shade lines, lower number gives more lines. Ball_Stick or CPK Produces a ball and stick structure or a space-filling structure. Bond Taper A higher number will give a greater perspective taper to bonds. Bond Radius Controls how wide the bonds are when drawn. A large number gives wider bonds. Bond Lines How many lines are drawn per bond. The default values are given in the example dialog box.

Vdw Surface
Upon choosing this option the following dialog box appears:

The atoms to be displayed in this manner can be all atoms, heavy atoms only or selected atoms only. In order to use the Selected-atoms option, the atoms must be SELECTed using the SELECT button from the TOOLS menu before the CPK option is chosen. The radius for the CPK model can be varied from the van der Waals radius (1.0, default) down to 0.65 (the .20 ball and stick option has been removed). The color by option has not been implemented. The CPK model continues as the default display until either the STICK FIGURE option is chosen, or the CPK_Surface dialog box is brought up and canceled. The CPK models can be rotated and translated. CPK models, dot surfaces and Ribbon diagrams can all be displayed at the same time.

Dot Surface
Choosing this option brings up the following dialog box:

68

The dot surface is generated using the Lee and Richards algorithm and two different radii are available - the van der waals radius (default) or a water surface (van der waals + 1.4 ). The CHARGE option uses the van der waals radius and colors the surface by charge, with redder colors representing positive charge and bluer colors representing more negative charges. White is used for neutral atoms. The atoms to be displayed in this manner can be all atoms or selected atoms only. In order to use the Selected atoms option, the atoms must be SELECTed using the SELECT button from the TOOLS menu before the DOT_SURFACE option is chosen. The Dot Spacing option controls the number of dots displayed on the surface. A closer spacing gives more dots, but takes longer to draw and manipulate (rotate and translate). The dot surface model continues as the default display until either the STICK_FIGURE option is selected, or the DOT_SURFACE dialog box is brought up and canceled. CPK models, dot surfaces and Ribbon diagrams can all be displayed at the same time.

69

Chapter 14. Conformational Searching


GLOBAL-MMX (GMMX) is a steric energy minimization program which uses the currently selected force field to search for the global energy minimum and other low energy local minima. Processing by GMMX is done in two stages. The first stages randomly searches over the selected rings and rotatable bonds and keeps all conformers within 3.5 Kcal of the lowest energy conformer found (Eminim) during the minimization. The second cycle reminimizes the structures found in the first cycle and keeps only those which are within 3.0 Kcal of the lowest energy conformer. The default energy windows for the two cycles can be changed. In addition to an output file containing the coordinates of the final structures, a textual summary file called '<OUTPUT_FILENAME>.pkm' is produced which lists the energy and Boltzmann distribution of each low energy conformation. This file may also include a listing of query operations (PMR coupling constants, distances, angles, and dihedrals) for each structure as well as a Boltzmann averaged summary. Although the conformational searching techniques that follow are unique in their approach, the methods described by the Still group (see M. Saunders, K.N. Houk, Y-D Wu, W.C. Still, M. Lipton, G. Chang, and W. Guida J. Am. Chem. Soc. 112, 1419 (1990) and references to previous work cited there in.) were the inspirational source for this work. The search techniques in GMMX are based on the methods used in BAKMDL, developed by Professor Kosta Steliou of Boston University, and ported to the MMX force field by Mark Midland of UC Riverside and Joe Gajewski and Kevin Gilbert of Indiana University. We wish to thank Professors Still and Steliou for sharing with us unpublished work and code that greatly enhanced our routines. Conformational space can be searched by GMMX in three ways. These are the Mixed method, the Bonds method and the Cartesian method. The Bonds method randomly selects a subset of the bonds designated by the user for rotation. Bond rotation can cause large changes in the shape of a molecule. The Cartesian method randomly moves a subset of all heavy and nonvolatile atoms in 3d space causing small changes in the shape of the molecule. The Mixed method alternates between the Bonds and Cartesian methods and is the preferred method. This procedure is especially efficient with cyclic structures having side chains. You may select ring bonds for rotation as well as the side chain bonds but restricting the bond selection to the side chain bonds usually is all that is necessary if only 3 to 7 member rings are present. The coordinate (Cartesians) movements will apply to the entire structure.

Figure. GMMX setup dialog box. When GMMX is first selected from the Analyze Menu a dialog box will be presented as shown above. This dialog box controls the setup and running of GMMX. Items that can be set directly in this dialog include the

70

jobname, the search method (bonds, coordinates or both (default)), and the energy windows for the first and second cycles. The buttons Setup Rings, Setup Bonds, Comparison Method, Setup Queries, Options, and Read Job all call up further dialog boxes to setup the various options. Save Job allows the user to save a file of the current GMMX options, and Read Job reads this file. Run GMMX starts the first cycle processing while Run 2nd Cycle starts the job in the second cycle, which presumes that a file of structures saved from the first cycle is available.

Setup Rings
GMMX will automatically search for all rings in the current structure and the results will be listed in a scrolling list window. To search on a particular ring select that ring from the list of rings and the select the Add button. In molecules containing fused or multicyclic ring systems all possible permutations of the rings will be listed. Thus trans decalin will have entries for two six membered rings and one ten membered ring. Care should be taken in selecting the correct ring entry to search. During the search the ring will be broken between the first two atoms of the Atom Number list. These are displayed as Closure Atom 1 and 2. The Closure Atoms may be edited but they must be adjacent atoms.

Setup Bonds
To designate rotatable bonds select the Setup Bonds option in the dialog box. A new dialog box will be shown. The atom numbers of the rotatable bond can be entered in the edit boxes. When the ADD button is selected the atom numbers are read and checked to see that a bond exists between the two atoms. If a bond is found then the default rotation increment for this bond is then determined and the bond is added to the scrolling list of bonds for searching. Bonds may also be entered by using Sel-Bnd from the Draw Tools. To use this option, select the bonds before bringing up the Setup Bonds dialog box.

71

The selection may be done while the main GMMX dialog box is up. Just move the box to the side and rotate and select the bonds. When Setup Bonds is selected, the selected bonds will appear in the scroll list. To delete a bond, select the bond from the scrolling list and then select Delete.

Bond Resolution If the bond is an ester or an amide, the resolution set will have two angles (0 and 180 degrees). If the bond is an SP3-SP3 bond, then three angles are used (0, 120, and 240 degrees). If the bond is SP3-SP2, twelve angles are considered (0, 30, 60, 90, 120, 150, 180, 210, 240, 270, 300, and 330). If the bond is SP2-SP2, 6 angles are chosen (0, 60, 120, 180, 240, and 300). Finally, if the bond is a ring bond, then if the ring is 10 atoms or less, the resolution set will have 12 angles of 30-degree increments as above and if it is 11 atoms or more, 6 angles of 60-degree increments will be used. The user can override these default settings (except for no-rotatable bonds) and enter their own values. Additional Ring Information For each ring defined, GMMX will calculate closure angles and a closure window. Again, the user can accept these values or enter others. Within GMMX each bond is randomly rotated by each angle in the resolution set. The newly created geometry is then analyzed to see if each ring can be reclosed within the defined limits of closure. It is checked for bad 1,5 C-C VDW interactions and trans annular interactions. Structures with up to 2 additional bad 1,5 C-C VDW interactions beyond the original structure may be kept. The structure is also screened for epimerizations and all other constraint criteria such as distances, dihedral angles etc. imposed during the interactive procedure. If all the constraint criteria are satisfied, the conformer is processed by GMMX.

Setup Query
The Setup Queries dialog box allows one to specify query information for the second cycle. Clicking on the four options will bring up dialog boxes for entering atom numbers. For distances the atom numbers must be entered in sets of 2, for angles sets of 3, for dihedrals sets of 4 and for coupling constants sets of 2. This process may be automated by performing the query on the structure on the screen. This may be done prior to setting up GMMX or while the main GMMX dialog box is on the screen. Use Query on the Draw Tools. When one of the four options is selected, the query information will appear in a scroll list. The final output file, jobname2.pkm, will contain a list of the queries for each structure as well as a summary total based on a Boltzmann average.

Comparison Method
The Comparison Method dialog box allows one to choose various options for structural comparisons during the search. The default is to compare all nonvolatile atoms. Nonvolatile atoms are those that are not removed in an H/AD operation, i.e. all heavy atoms and hydrogens on O or N. The equivalent atoms and Specific Sequence options have not yet been implemented. Numerical isomers are found in

72

unsubstituted rings such as cyclooctane. Reflection isomers are found in unsubstituted acyclic compounds such as butane. A set of specific atoms may be compared by clicking the Specific non-volatile atoms box and entering a set of numbers in the edit box. The default comparison routine uses a fast analytical method during the first cycle and a slower but more accurate method during the second run. It is usually better to err on the side of over collecting on the first run and then weed out duplicates on the second run.

Options
The Options dialog box allows one to change several options for the search. Chirality will be checked and enantiomers rejected if the box is checked. Hydrogen bonding will be on by default. If the system has pi atoms turned on, a pi calculation will be performed. Note that this may not be necessary and may require more time for rigid systems like benzene. By default, structures will be screened for bad 1,5 interactions and rejected if found prior to minimization. Conformational enantiomers such as the two gauche forms of butane will be kept. An early bail out of the minimization can be done. This applies a sliding high energy cut off of 5-15 kcal/mol above the minimum energy found to the structure during the 50-100 iterations of the firstderivative minimization. If the structure is 15+ kcal/mol above the lowest energy structure at the 50th iteration, it will be rejected and the search will move on. This cut off is not used during the second cycle. This method may lead to premature bail out if there are a lot of hydrogen bonds in the structure. The structures will be sorted by energy. The option to sort by heat of formation has not been implemented. The Boltzmann temperature is used to calculate the distribution of isomers. The cutoff distance and energy are used during structure comparisons. During both cycles all structures within 0.25 kcal/mol of the current structure will be examined. During the first cycle a structure will be considered to be unique if any atom is more than 0.25 Angstroms away for the least squares fit. In the second cycle the RMS average must be greater than 0.25 Angstroms. The random number generator uses the seed value. A search starting with the same starting structure will always follow the same path unless the seed value is changed. The search will stop after the lowest energy structure (emin) has been found 5 times or there are 25 duplicates in a row without a new structure. However at least 50 structures have to be minimized. The default number of duplicates required is 2.5 times the number of heavy atoms with a maximum of 50. The search will also stop if 100,000 structures have been minimized.

73

Run GMMX
Once all the setup is done GMMX can be run. In all methods the original structure is minimized to generate the initial minimum energy conformation. The structure is then modified by either the bonds or Cartesian method and the new conformation is checked for bad-1, 5 interaction, and epimerization. If all the acceptance criteria are satisfied, the structure is minimized. If the minimization yields an energy acceptable conformer that HAS NOT BEEN FOUND BEFORE, it is saved and its geometry is used to spawn a new conformer, otherwise, it is rejected. Only accepted conformers are permitted to carry-on the search. If attempts to create a new conformer fail, based on the acceptance criteria, it is sent back to try again until a NEW ACCEPTABLE conformer is created. Statistical searching on coordinates should be selected whenever a rigid or fused polycyclic system is to be analyzed. If there are side chains then the Mixed procedure should be favored. The Bonds method is best for acyclic structures.

Run 2nd Cycle


Run GMMX will automatically run the first and second cycle of the search. The second cycle refines the structures found in the first cycle by throwing out unnecessary duplicates and cutting down the energy window. It also performs any Query operations. The structure output files from both searches are kept. You can send the search output through the second cycle at a later time by using this option. The run will use any criteria (such as Query information) which you have selected.

Save Job
This option will bring up a file-save dialog box. The default file name will be the Job Name entered in the GMMX dialog box. The file save will save <JobName>.pcm and <JobName>.inp files. The inp file contains the name of the pcm structure file and the directions for doing the search. Thus the pcm file cannot be deleted or renamed if you wish to run this job later. The inp file may be read by the DOS or text-based version of GMMX.

Read Job
This option brings up a file read dialog box and reads the inp file saved in Save Job. All options will be updated to the settings in the job file.

74

15 FILE FORMATS
15.1 PCM File format
The PCM file format is a free format, context driven grammar for the description of molecular structures. This gives the user more and easier control of their structure files for additions and corrections, and leaves room for expansion of the definition of the force field. The context also allows for greater error correction within the program. At the end of this section is an example of a PCM file which describes the structure in Figure 6.1. Key words have been printed in bold for clarity. In this structure, we have included substructures, Pi atoms, hydrogen bonding, a metal, atomic charge, and coordinate bonds. The features of the file are fairly straight forward. From the top is the file type marker {PCM which indicates that the file is a PCM type file. This is required of all PCM files so that PCMODEL and MMX will recognize them. Immediately following the file type marker, on the same line, is the name of the structure. This name may be up to 58 characters long. On the next line appears NA followed by a number. This tells the number of atoms in the structure. This number is optional and is used only to check for consistency. Next, is SS 1 which introduces substructure number 1 and gives it a name which follows. The name can be up to 20 characters long. This is also optional. Currently, this statement only gives the substructure a name though at some point, it may be used to tell other aspects of the substructure. The fourth line contains FLags: EINT UV. Currently, the following flags are defined: EINT 0: dipole interaction energy will be calculated excluding interactions between dipoles with a common carbon. 1: as above but with all interactions. 2: dipole interaction calculation suppressed. 3: charge interaction energy will be calculated (excluding interactions between atoms bound to a common atom). If this option is used you must read in atomic charges. 4: charge interaction energy used. Charges calculated from bond dipoles with additions for pi atom charges. UV 0: if pi calculation, will be prompted before minimization: Number of electrons? Singlet, doublet, triplet, ? RHF or UHF calculation ? if singlet, RHF then restrictions on wave function? # of identical pairs of atoms? atom numbers of pairs? # of sets of identical bonds? atom numbers of bonds of bond pairs ? 1: default to singlet RHF pi calculation - no questions. 2: default doublet UHF pi calculation (for mono radicals) no questions. DIELC Floating point value of the new dielectric constant. Negative values give a distance dependent dielectric constant. (nonsense with dipole-dipole interactions). Line 5 contains the first atom record. After the atom specifier AT, the index number appears. These are the atom number and are used to reference the bond numbers. Next after some separator, usually a comma though a colon or a space will do, is that atom type number (see chapter 7 for a description of atom types). In the case of metals, the metal symbol is included in this space instead of the atom type number as in the sixth atom record.

75

Next, appears some separator; we use a colon here for clarity. Then in order, is X, Y, and Z in angstroms. After the next separator, the bond marker, B, appears the list of bonds attached to the atom. 2,1 means a bond to atom two is a single bond. In atom 6, you will notice 10,9, which indicates a metal coordinate bond (type 9) to atom 10. At the end of the first few lines is a C followed by a number. C specifies a charge. Again looking at atom 6, an M appears. Following M is the specified electronic state of the metal, in this case 3 for low spin. R introduces the covalent radius. This number is used to determine the bond lengths of all ligand attachments to that metal. The record for atom 8 contains an H which indicates that it is a hydrogen bonding hydrogen. Atom 9 contains a P which marks it as a pi atom and an S, which refers to the list of substructures of which this atom is a member. The comma after the S is optional but is added for clarity. Finally, the structure file is closed with a }. This is mandatory so PCMODEL will know when to stop reading.

Figure 6.1 Structure of molecule described in PCM file below.

76

PCM Example File


{PCM example pcm file NA 31 SS 1cyclo pentadiene FL EINT4 UV1 PIPL1 AT 1,8:5.00395,5.41685,4.14865 B 2,1 5,1 8,1 14,1 C .04109 AT 2,1:5.35786,3.99136,4.43375 B 1,1 3,1 15,1 16,1 C .04758 AT 3,3:3.89305,3.05742,4.72868 B 2,1 4,2 7,1 C .31634 AT 4,7:2.81165,3.08691,5.38735 B 3,2 6,9 17,1 18,1 C-.29483 AT 5,1:4.01102,5.78059,5.13175 B 1,1 6,1 19,1 20,1 C .00583 AT 6,Fe:3.37201,4.37477,5.09242 B 5,1 4,9 9,9 10,9 11,9 12,9 13,9 M3 R1.26000 C1.00000 AT 7,1:4.34527,1.64177,4.88598 B 3,1 21,1 22,1 23,1 C .04175 AT 8,23:5.80025,6.06569,4.07001 B 1,1 H C .15724 AT 9,48:2.72317,5.71177,3.69643 B 10,1 13,1 6,9 24,1 S, 1 P C-.03818 AT 10,2:2.74283,5.08259,2.49706 B 9,1 11,2 6,9 25,1 S, 1 P C-.03815 AT 11,2:2.37909,3.77508,2.58554 B 10,2 12,1 6,9 26,1 S, 1 P C-.03815 AT 12,2:2.02517,3.57846,4.02085 B 11,1 13,2 6,9 27,1 S, 1 P C-.03815 AT 13,2:2.22179,4.80733,4.59105 B 12,2 9,1 6,9 28,1 S, 1 P C-.03815 AT 14,20:4.70902,5.38735,3.60795 B 1,1 C-.21000 AT 15,5:5.98704,3.92254,5.39718 B 2,1 AT 16,5:5.92805,3.54897,3.55880 B 2,1 AT 17,20:2.37909,2.71334,4.92530 B 4,1 C-.05250 AT 18,20:2.75266,2.61503,5.84941 B 4,1 C-.05250 AT 19,5:3.46049,6.68504,4.76800 B 5,1 AT 20,5:4.51240,6.04603,6.10501 B 5,1 AT 21,5:3.51947,.88478,4.78767 B 7,1 AT 22,5:5.12192,1.35667,4.14865 B 7,1 AT 23,5:4.80733,1.51396,5.89856 B 7,1 AT 24,5:2.94928,6.78335,3.89305 B 9,1 S, 1 C .03818 AT 25,5:3.04759,5.59380,1.55329 B 10,1 S, 1 C .03815 AT 26,5:2.31027,3.01810,1.76957 B 11,1 S, 1 C .03815 AT 27,5:1.67126,2.64452,4.53206 B 12,1 S, 1 C .03815 AT 28,5:2.02517,5.05310,5.65279 B 13,1 S, 1 C .03815 AT 29,21:6.64571,6.42943,4.91547 B 30,1 AT 30,6:6.48842,6.06569,4.91547 B 29,1 31,1 AT 31,21:6.90132,5.87890,4.91547 B 30,1 } The PCM file syntax or grammer is given below. PCMFILE --> ( {PCM [NAME(60)] (ENTRY) * }) ENTRY SSNAME NATOM FLAGS CONST --> { SSNAME | ATOM_REC | NATOM | FLAGS | CONST | FIX_REC} --> SS [SP] NUM(16) [SP] [NAME(20)] --> NA NUM() --> FL [SP] ({ PR | UV | EINT | PIPL } [NUM()] SP) * --> CO {

77

ATOM_REC FIX_REC

--> AT [SP] NUM() [SP] ATYPE SP X,Y,Z ([SP] ATFIELD)* --> FIX { ATOM NUM() ( X Y Z ) | DIS NUM() NUM() ( R FLOAT K FLOAT) }

ATYPE --> { NUM(79) | NAME(2) } X,Y,Z --> FLOAT SP FLOAT SP FLOAT ATFIELD --> { BONDS | SUBSTR | ATOMFLG } BONDS --> B [SP] NUMBND SP (SP ATNUM,BNDORD )NUMBND NUMBND --> NUM(10) ATNUM,BNDORD --> NUM() (SP) NUM(10) SUBSTR --> S (NUM(16) SP)* ATOMFLG --> FL { P | M NUM(3) | H | C FLOAT | R FLOAT } *

NAME(n) --> letter{letter|digit|punc}* for size <= n NUM(n) --> {digit} for value <= n SP --> {comma|blank|colon|tab} FLOAT --> [-](digit)*[.](digit)(digit)*[{e|E}[{-|+}(digit)(digit)*] letter digit --> character a-z A-Z $ _ --> character 0-9

{ } - choose exactly one: all choices separated by | ( ) - grouping: multiple tokens treated as one token * - zero or more occurrences of previous token [ ] - optional (one or zero occurences) BOLD characters appear exactly Starter token MUST appear in column one ALL OPTIONAL SP NOTE: If the removal of an SP(space) from a string would place two members of the same token primitive class adjacent, the SP is not optional!

78

15.2 MMX INPUT FILES


It is a pleasure to acknowledge the contributions of Professor Allinger and his students to the algorithms and constants used in MMX whose progenitors are MM2 and the pi routines from MMP1. PCMODEL can read: A) a single structure file or B) a file containing multiple but separate structure files. For option B the first three characters on the first line of each new structure must be MMX. This tag is used by PCMODEL to recognize the start of a new structure. Each structure contains complete information about that structure. The file format for an MMX and MM2 file are similar, differing only in the lack of a line of logical variables to mark pi atoms in the MM2 file format. The file formats which follow are for FORTRAN formatted files, that is column placement is critical if the file is to be interpreted correctly. The format descriptors used in fortran are: a# character string of # length i# integer value up to # digits in size f fixed floating point number Thus the format for line one (see below) says to read 30 fields of two character (30a2), a series of integers that are 5 digits, 2, 3, 3, and 3 digits in length (i5,i2,i3,i3,i3) and a floating point number that is 5 units wide with no number beyond the decimal point (ie 10. ). For a more complete description please see any Fortran programming guide. The most important idea to remember is that data must be in specific columns in a fortran formatted file.

First line: columns 1-60 61-65 67

format (30a2, i5,i2,i3,i2,i3,f5.0) ID name, title, or information N number of atoms including lone pairs. IPRINT print control

Initial Calculation Only: 0 or 1 simple; 2 or 3 full printout. Part 1-Minimize Part 3-Final minimum full (can be 100+kb) minimum simple (useful)* full full (can be 100+kb) simple simple minimum minmum (bare bones) no print (Driver has its own out files)

0: 1: 2: 3: 4: 5:

* simple print omits VDW interactions <.1 kcal/mol 70 72 74 NRSTR 0: INIT no restricted atom data 1: 0: standard minimization yes (see below) 1: initial calculation only

IDOCK docking if zero then none but MMX can provide other options including simulated annealing of intermolecular interactions. These require the beginning atom number for the substructure in the atom list. This number is on NCON line (3rd line).

79

75

NCONST

0:

no added constants

1:

added constants

If NCONST = 1, added constants may be appended to input file see below or in the form of a file which will be read if there are no input lines after the 13th line as defined below. WARNING, if there are added constants, the use of the multi coordinate input files will require that the added constants be in the input file NOT in an external file.Since PCMODEL now uses a text file of added constants this part of the MMX file format is not used. 76-80 time - ignored in all versions format(60l1,i2,2x,i2,2x,2i2,4i1,2i2)

Second line 1-60

LOGARY: t or f in each column indicates whether or not the atom number represented by the column number is to be considered a pi atom in a VESCF calculation. If no atom numbers are marked t, no pi calculation will be performed. If there are more than 60 atoms, add more lines with these logical variables up to 4 lines total. non planar

62 NPLANE 0: planar pi calculation 1:

In a planar pi calculation, the first calculation is on a Huckel matrix whose elements depend on the atom types and geometry Diagonalization gives a density matrix which is used along with Slater orbital exponents and calculated overlap integrals to give the elements of the FOCK matrix which is diagonalized to give a new density matrix. Subsequent iterations give a self-consistent field with an invariant energy. In a non planar calculation two scf calculations are required-one on the non planar system and one on a planar projection. The bond orders of the planar projection are used to weight the natural distances, force constants, and two-fold torsional barrier terms, see below (ADDED CONSTANTS). 66 JPRINT: print out for pi calculation 1st SCF during minimization energy only energy only energy + matrices energy only energy + matrices energy + matrices on each iteration ditto energy + matrices on each iteration LIMIT limit for self-consistency (10)-this integer (in ev). default is (10)-5 ITER maximum number of SCF iterations to be run. (ignored in this version with open shell pi calculation)

0: 1: 2: 3: 69-70 72

73 IHBD 0: 1: no read of hydrogen bonding logical array-all OH, NH, & SH hydrogens participate in hydrogen bonding. read hydrogen bond logical array after line 7 below; first line for 80 atoms; additional lines for up to total atoms; array marks which hydrogen atoms will hydrogen bond.

74 ICOV 0: no metal atoms are present in input file

80

1:

read line containing metal type and radius (3 different metals maximum) then read logical array of atoms coordinated to each metal; lines must be after the hydrogen bond array or after line 7.

75 IUV 0: if pi calculation, MMX will ask for options before minimization: number of electrons? singlet, doublet, triplet, etc.? RHF or UHF calculation? if singlet, RHF then restrictions on wave function? # of identical pairs of atoms? atom numbers of pairs? # of sets of identical bonds? # of identical bonds and # of atom pairs? default to singlet RHF pi calculation - no questions. default doublet UHF pi calculation (for mono radicals) no questions.

1: 2: 76 IHUCK 0: 1:

for full VESCF calculation before first MM energy minimization for Huckel calculation (of planar systems) before first MM; subsequent minimizations will be full SCF

78 NSETAT - number of sets of equivalent pi atoms 80 NSETBD - number of sets of equivalent pi bonds The previous two options are over-ridden by IUV greater then zero. Then lines of logary (in 80l) for atoms beyond 60. if NSETAT .gt. 0, must read ISETA(I) in 10i4, number of equivalent atoms in each set; then must read nsetat lines in (20i4) which contain the atom numbers in each set. if NSETBD .gt. 0, must read ISETB(I) in 10i4, number of equivalent bonds in each set; then must read lines in (20i4) which contain the atom numbers, in pairs, for each bond. A new set must start a new line. Third line format(i5,1x,i1,2x,f5.1,5x,9i5,5X,2i5)

(if no pi atoms in molecule with more than 60 atoms otherwise this is the 4th line) 1-5 NCON # of connected atom lists (not atoms) to read in 30 Max. 7 IDYN 0: 1:

no dynamics dynamics program is run interactively After filename is given, user will be asked: Timestep in femtoseconds (1.0 fsec is default) Viscosity in cp (0.000 is default) Temperature of Bath (300.K is default, initial sample temperature is 300.K so implies no heat in or out) Time constant for heat transfer (1015 femtosecond is default, this implies no coupling of bath to sample).

81

After 20 cycles, user can alter values and number of cycles. Dynamics can be used to remove KE increasing the viscosity or cooling with a time constant ~ timestep so that an unstable structure can be optimized slowly invariably to a local minimimum. At a given temperature kinetic and potential energy are conserved an provide ranges of motion. A minimized structure can be heated to escape a local minimum then cooled perhaps to a global minimum. Global optimization can also be performed by minimizing many initial structures generated by MODEL in MULTIC mode or PCMODEL in MULTOR mode - see NDRIVE 11-15 DIELE dielectric constant. If 0., default to 1.5. If negative, a distance dependent dielectric constant is used in the charge-charge interactions (DIELC=Rij). If a negative dielectric is used with the dipole-dipole calculations, nonsense will result. JSTART atom number begining substructure in atom list-for docking only-IDOCK of first line (col 74) must be set to >0. 0: no 1: yes cyclopropropanes or cyclobutanes present

16-20

25 IBUT

26-30 31-35

NATTCH # atoms attached to previously defined atoms. NSYMM number of symmetry matrices to read in. PCMODEL writes 0 for this variable. NX number of coordinate replacement lines to read this option is unnecessary with structure file generation programs like MODEL, and PCMODEL. See QCPE 395 documentation for its use. The coordinate replacement lines must immediately follow the cartesian coordinate input. LABEL 0: names and weights of atoms defined in program are used. #: number of different atom types whose names and weights will be changed below. PCMODEL writes 0 for this variable.

36-40

45-50

55 NDC dipole interaction enery will be calculated excluding interactions between dipoles with a common carbon. 1: as above but with all interactions. 2: dipole interaction calculation supressed. 3: charge interaction energy will be calculated (excluding interactions between atoms bound to a common atom). If this option is used you must read in atomic charges below. 4: charge interaction energy used. Charges calculated from bond dipoles with additions for pi atom charges. PCMODEL writes 0 or 4 (default). 60 NCALC determines crystal conversion and/or orientations to be performed on input coordinates. 0: 1: 2: no crystal options. input coordinates are reduced crystal coordinates. They will be converted to cartesian coordinates (see below). input coordinates will be oriented according to instructions below. 0:

82

3: Combination of 1 and 2 above. PCMODEL writes 0 for this variable.

65 HFORM 0: no 1: yes heat of formation calculated 2: read new values for partition function contribution PCMODEL writes 1 for this variable. 80 NDRIVE 0: 1: -1:

to the heat of formation.

no endocyclic dihd. driver side chain dihd.driver

For options -1 and 1, a data line must be added before an added constants list at the end of the input file. See below. In MMX (a separate batch processing program) if NDRIVE >= 10000: sets of coordinates can be read in after all other input. Each set must consist of one line in format i5 which indicates the number of the set (starting from 1), and the coordinates must be in the standard format (see SIXTH LINE below). The previously read connectivity information will be used in the calculation. Files with the filename + sequentially numbered extension will be generated. A brief output file, *.SUM, with the filenames and energies will also be generated; an energy ordered file MMXE.ORD is also generated. In MMX: if NDRIVE=-9999: global minimum finder by simulated annealing. Requires additional input lines after the loghbd and logmet lines of logicals. OPTIONS FROM LINE ABOVE: If LABEL _ 0 add # of LABEL lines in format (i5,3x,a2,f10.5) 4-5 9-10 11-20 ITY(i) atom type whose name and weight are to be redefined. NAM(i) two character name for new atom type. WGHT(i) Atomic weight for the atom. If NDC = 3 add lines whose format is 8f10.5 listing the charge on each atom-for N total atoms including lone pairs. Fourth line(s) ICONN(i,j) NCON lines each in format (16i5) i=number of the line (1 to NCON), j=number of entry on line (from 2 to 16)

Each list contains up to 16 CONTIGUOUS atoms and each list can repeat atoms in a previous list. Cyclohexane would have one list with values 1,2,3,4,5,6,1 Bicyclo[3.2.1]octane would have two lists the cyclohexane list above and a second list with values 1,7,8,3 There are no restrictions on lists other than to avoid non contigous atoms during the sequence.

83

Fifth line(s) NATTCH lines each in format (16i5) number of lines sufficient to add all atom pairs up to NATTCH in format (16i5). The pairs are: JATTCH(i), KATTCH(i) where i equals 1 to NATTCH. JATTCH(i) is an atom already defined either in NCON or previously defined as a KATTCH atom. KATTCH(i) is an atom attached to JATTCH(i). For cyclohexane NATTCH = 12 which would require two lines. The values are: 1,7,1,8,2,9,2,10,3,11,3,12,4,13,4,14 5,15,5,16,6,17,6,18 Sixth line(s) 1-10 11-20 21-30 31-35 36-40 X(1) COORDINATES format(2(3f10.5,2i5)) Coordinates in Angstroms of atom 1-These may be reduced crystal coordinates if NCALC =1 or 3.

41-50 51-60 61-70 71-75 76-80

Y(1) Z(1) ITYPE(1) MMX atom type( see Table of Atom types) MATOM(1) 36-38 atom number this atom is multiply bonded to. 39 (for metal atoms only 0:coordinately saturated; 1:unsaturated 2:>18 electrons 3:square planar). 40 Number of bonds to atom defined in 36-38 minus 1; to others will have matom = 00000 X(2) Y(2) Z(2) ITYPE(2) MATOM(2) Repeat lines until all coordinates are included.

Non metal atoms singly bonded

Seventh line(s) NX > 0 COORDINATE REPLACEMENT or ATOM ADDITION See documentation to QCPE 395 or QCPE MMP2(85) for options and data structure. Seventh line(s) If NCALC = 1 or 3 CRYSTAL CONVERSION in format (6f10.5) If reduced crystal coordinates were read in above instead of cartesians coordinated, the following parameters will convert them to cartesian. 1-10 11-20 21-30 31-40 41-50 A Dimensions of unit cell in Angstroms (a,b,c) B C ALPHA Angles associated with unit cell. BETA

84

51-60

GAMMA Hydrogen Bonding flags

Eighth line(s)

If IHBD=1 format(80l1). If N >80, more lines in format (80l1) are read. These should be "f"s unless a particular hydrogen, whose atom number is represented by the column number, will participate in hydrogen bonding-then a "t" should be in the column. Ninth line(s) If ICOV=1, Metal Names, Covalent Radii and Charge

Program checks number of metal atoms present (itypes 44,56,57) and reads line of metal characters, covalent radii, and the charge on each metal (necessary for isolated ions & might be useful for organometallics if electrostatics are used) in format (3(a2,f8.6),9f5.2); these variables are amet,rcova,bmet,rcovb,cmet,rcovc, and chrgmet(i), i=1,9. [Changing rcova will change the covalent radius used for all bonds involving the metal amet.] Then the number of metal logical lines of format(80l1) are read. A t or f indicates whether or not the atom number represented by the column number is coordinated (not bound as would appear in the NCON or NATTCH lists) to the first (or second etc. 9 Max) metal atom in the atom number list. If more than 80 atoms are present more lines should follow - in (80l1) format-to mark the rest of the atoms. Tenth line(s) 3-4 Is If NSYMM .ne. 0 read NSYMM lines in format(2i4,9f8.5) number of the independent atom in the symmetry-related pair it will be moved during the minimization. number of the dependent atom in the pair. Its coordinates are calculated from a symmetry matrix operating on IS.

5-8

Ks

ELEMENTS OF SYMMETRY MATRIX - ON SAME LINE 9-16 17-24 25-32 33-40 41-48 49-56 57-64 65-72 73-80 SXX SXY SXZ SYX SYY SYZ SZX SZY SZZ Amount by which X(K) must be changed when X(I) is moved by 1 in order to maintain molecular symmetry. Change in Y(K) for X(I) change =1. Change in Z(K) for X(I) change =1. Change in X(K) for Y(I) change =1. Change in Y(K) for Y(I) change =1. Change in Z(K) for Y(I) change =1. Change in X(K) for Z(I) change =1. Change in Y(I) for Z(I) change =1. Change in Z(K) for Z(I) change =1.

An atom may be used as Is as often as necessary but cannot appear in another symmetry pair as Ks. In order to specify symmetry correctly, the coordinate axes used in the minimization must be known. Do an initial calculation with INIT=1. This procedure is retained from MM2 but has not been tested in MMX. Eleventh line(s) Restricted atoms If NRSTR =1 first line in format (i5) is number of atoms whose motions are restricted, the remaining lines in format (16(i2,3i1) define the restrictions in any or all of the X,Y, and/or Z directions. Thus the atom may be kept along an axis, or in a plane, or at a point.

85

1-2 3 4 5

AT X Y Z

Number of the atom whose motion is to be restricted. 0: 0: 0: no restriction ditto 1: ditto 1: 1: no X axis movement. no Y axis movement. no Z axis movement.

repeat up to 16 times per line. Note that this will fail if the atom number to restrict is greater than 99. This option is often considered for transition states, but its use is limited in that connection. It is better to use transition state atoms whose constants can be generated by PCMODEL.

Twelfth line 1-5

if HFORM =2 read a line in format (3f5.2,f10.3) POPI

population increment (Enthalpy contribution due to higher energy conformations) Default=0 6-10 TORI torsional increment (Enthalpy contribution due to torsional degrees of freedom) Default=0 11-15 TROTI translation-rotation increment (Enthalpy increment due to translational and rotational degrees of freedom) Default=2.4 kcal/mol. 21-30 31-40 41-80 Experimental heat of formation (kcal/mole) Experimental error (absolute value) Literature reference (optional)

Thirteenth line Dihedral Driver Information if NDRIVE is not equal to 0. if NDRIVE < 10000 and NDRIVE .ne. -9999 in format(2(4i5,5x,3f5.0)) (Note that this line usually occurs at the end of an MM2 input file, but in MMX it must preceed an added constants list which is appended to the input file.) This is used to drive one or two dihedral angle(s) through a range of values and to minimize all other degrees of freedom at each point. 1-5 6-10 11-15 16-20 26-30 31-35 36-40 41-45 46-50 51-55 56-60 66-70 M1 M2 M3 M4 atom numbers for first dihedral angle. START1 starting angle FIN1 final angle DIFF1 step or increment N1 N2 N3 N4 atom number for second dihedral angle. START2 starting angle

86

71-75 76-80

FIN2 final angle DIFF2 step or increment Before use, remove restricted motions from atoms to be moved and symmetry. NDRIVE may be -1 for side chain or 1 for angles whose 2nd & 3rd atoms are confined to rings. The side chain driver is useful if the dihedral is not part of a ring. It carries out a rigid rotation for each step in dihedral angle. The endocyclic drive is confined within 0 and 180 degrees or 0 and -180. The step cannot be large (5-10 degrees is normal). The minimization will stop if either 0 or 180 degrees is achieved. Use the side chain driver if possible. IF NDRIVE >= 10000 - see description for Third line above-requires multiple sets of cartesians. MMX option only IF NDRIVE = -9999 - Global minimization by simulated annealing. MMX option only 1st line: format 6i5 krbnds - number of dihedrals to rotate, 30 MAX kring - 0: no ring 1: ring irange - for rings-angle window for closure usually 20 degrees krand - 0: randomize all dihedrals before an energy check (usual) #: randomize # dihedrals in the sequence below before an energy check igen - generating function type (use 0) 0: 15*temperature*(tan(pi*(ran-.5)) 0.<ran<1. 1: 360.*ran ntimes - Number of times structure with energy within eminct is found before exiting (usually 4). icomp - normally zero, if 1: read in three distances in line after dismin etc. 2: for tricoordinate analysis - not thoroughly tested. 2nd line(s) format(16i5) mbonds(kk,1),mbonds(kk,2),kk=1,krbnds : atom #s of bonds to rotate in pairs If ring is chosen above, the first atom pair bond is broken and used for evaluation closure distances and angles; the second atom pair should be an bond adjacent to the first because it will be rotated in order to obtain the best closure angle. 3rd line format(7f11.5) dismin for ring - minimum distance between mbonds(1,1) and mbonds(1,2) usually 1.50. dismax for ring - maximum distance between " " " " " " usually 1.58 resln dihedral angle to rotate through if igen = 0- usually 15 tempi initial temperature in kcal/mole usually 30.

codec decrement of temperature according to tempi*(1/(1.+ codec)) usually 0.5.

87

eminct

energy difference between acceptable structures to count for minimum found ntimes in order (usually 0.01)..

4th line format(6f11.5) if icomp eq.2 3X (distance, tolerance) cdis1,tol1,cdis2,tol2,cdis3,tol3 Thirteenth line if IDOCK = 9 For simulated annealing in docking-main and substructure rigid-requires that a substructure be at the end of the atoms list with the starting atom identified in cols 16-20 on the third line (NCON and other options). format(6f10.5,i5) xyzds factor for substructure translation-usually 1.0

resln constant for substructure rotation-usually 15. degrees tempi initial temperature-usually 3. kcal/mole

codec decrement of temperature according to tempi*(1/(1.+ codec)) usually 0.1 tempf lowest temperature before quitting usually 0.01 kcal/mole eminct energy difference between acceptable structures to count for the minimum if found ntimes in order (usually 0.2) number of times a low energy is obtained to assume it is a minimum usually 5.

ntimes

ADDED CONSTANTS if NCONST .ne.0 This may be the last set of entries in the input file or may be a separate file which MMX will read if this list is not at the end of the input file. PCMODEL will generate this file only for transition state bonds. MODEL on the VAX will generate this file. The first line is a header listing the numbers and types of constants to be read in format (8i5,3f5.0,5x,f5.0,i5) 1-5 6-10 11-15 16-20 21-25 26-30 31-35 NT number of torsion parameter sets. NS number of stretching parameter sets (includes bond dipoles). NV number of van der Waals' parameters sets. NB number of bending parameter sets. MUA number of bond dipole line (according to type) - use NS. MUB number of bond dipole lines (according to atom numbers) - use NS. NSB 0: no 1: yet stretch-bend parmeters to be read. NH number of bond enthalpy parameter sets-if NH_0, a heat of formation will be calculated regardless of HFORM.(untested in MMX) DLC new dielectric constant for dipole or charge interaction calculations-if DLC=0, the default, 1.5 is used. CST new cubic stretch term-default = -2.00 SF fraction of the bending constant that is to be used as sextic term in bending energy equations-if SF=0, a default is used (7E-8).

36-40 41-45 46-50 51-55

88

61-65 66-70

RDN hydrogen vdw reduction factor-if RDN=0, the default is used (.915)- if RDN=1, no reduction is used. KTR number of transition state bond types.

next line if NT > 0 TORSION CONSTANTS - NT lines in format (4i5,3f10) from: 1 over 2 V sub 1 left ( 1 + cos OMEGA right ) + 1 over 2 V sub 2 left ( 1 - cos (2 OMEGA) right ) + 1 over 2 V sub 3 left ( 1 + cos ( 3 OMEGA ) right ) 1 2-5 6-10 11-15 16-20 21-30 JBT I1 I2 I3 I4 V1 4 for cyclobutane torsion 0 for all others

31-40 V2 41-50 V3

atom type numbers for dihedral; I2>I3 or if I2=I3, I1<I4 One-fold barrier; if >0, 0 deg is potential energy maximum, 180 deg is potential energy minimum. Two-fold barrier; if >0, 90 deg is potential energy maximum 0 and 180 deg are energy minima. Three-fold barrier; if >0, 0 and 120 deg are energy maxima 60 and 180 deg are energy minima.

Note that ethylene has four torsional interactions making up the ca 60 kcal barrier so V2=15 for type 2 type 2 torsions. Ethane has six torsional interactions, etc. next line if NS > 0 STRETCHING CONSTANTS NS lines in format (2i5,6f10.5) from: EC = 143.88*1/2ks*(1+CST*(r-r0))*(r-r0)2 E sub c = 143.88 * 1 over 2 k sub s left ( 1 + cst ( r - r sub 0 ) right ) cdot ( r - r sub 0 ) sup 2 where CST is cubic stretch term. 1-5 6-10 11-20 21-30 31-40 41-50 51-60 61-70 I atom type number K atom type number; I .le. K S stretching constant (ks above) in md/angstrom for bond I-K T1 natural, minmum energy bond length (r0 above) in angstroms T2 if I and K have one or less hydrogens attached T2 is the length used; if T2 is zero T1 is used. BMOM bond moment for pair I,K; if >0, K is more electronegative atom SLPS for pi atom pairs: S = S-SLPS + SLPS * rhoI,K SLPT for pi atom pairs: T1 = T1+SLPT - SLPT * rhoI,K Note that the dipole-dipole interaction potential energy is:

Emu= 14.39418*BMOM(I,K)*BMOM(M,N)*(cos X - 3* cos A * cos B)/ DLC*r3 where r is the distance between the midpoint of the two bonds I-K and M-N; X is the angle between the two bonds; A is the angle between bond I-K and line represented by r; B is the angle between bond M-N and line represented by r; DLC is the dielectric constant whose default is 1.5. The dipole moment is calculated from the vector sum of all bond moments. If a pi calculation is performed, the dipole moment

89

from the pi system is added vectorially to the sigma dipole moment. There is presently no calculated interaction between the pi and sigma dipoles. next line if MUA > 0 BOND DIPOLE CONSTANTS BY ATOM TYPE MUA lines in format (2i5,f10.5) 4-5 6-10 11-20 I atom type number k atom type number; I _ K BMOM bond moment for pair I,K; if >0, K is more electronegative atom

It is more consistent to change the BMOM in the added Stretch constants since all six values are read from the data base. next line if MUB > 0 BOND DIPOLE CONSTANTS BY ATOM NUMBER MUB lines in format (2i5,f10.5) 4-5 9-10 11-20 N1 N2 V12 atom number of the less electronegative atom atom number of the more electronegative atom new bond moment for bond N1-N2. V12 should be negative if N1 is more electronegative than N2.

(untested in MMX) next line if NV > 0 VAN DER WAALS' CONSTANTS NV lines in format (i10,2f10.5) EV = eps*290000*e-12.5/p - 2.25*p6 where p = (RDi+RDk)/r0 < 3.311 eps = sqrt(EPi*EPk) "hardness" r0 effective distance (for CH bonds the H is effectively 10% closer to the C than the actual C-H distance). EV = eps*336.176*p2 if p > 3.311 9-10 11-20 21-30 IT atom type number whose parameters are to be changed EP epsilon (kcal/mole) for type IT atom RD radius in angstroms for type IT atom The user has no control over the parameters used for hydrogen bonding and coordination to a metal. These are included in the vdw parameters because the vdw attractive term for hydrogen bonding and for the lone pair or pi atom interaction with a metal is proportional to p2 not p6. The attractive are much larger than 2.25, depend on the donor and acceptor and angles, and is matched by a larger repulsive term. next line if NB > 0 BENDING CONSTANTS NB lines in format (i1,i4,2i5,2f10.5,i5) from: EB = 0.043828*1/2*kb*((theta0-theta)2)*(1+SF*(theta0-theta)4)

90

2-5 9-10 11-15 21-30 31-40 45

where SF is sextic bending constant (.00700E-5). JBN 3 for cyclopropane angle 4 for cyclobutane angle I1 atom type number; I1<I3; if I1 is -1 read OPB constants where I3 is the central atom which is OP. I2 atom type number I3 atom type number B bending constant (kb from above) T natural, minimum potential energy bond angle in degrees (theta) J 0: T value applies to all I1-I2-I3 angles 1: T value applies only if I2 has no attached hydrogens 2: T value applies only if I2 has one attached hydrogen 3: T value applies only if I2 has two attached hydrogens.

Out of Plane bending (OPB): trigonal atoms involved in out of plane bending have an in-plane component, which is characterized by the standard values from above, and an out of plane component with a different constant. The atom types which respond to OPB constants are types 2 (SP2C), 3 (carbonyl C), 9 (amide and pyrrole nitrogen), 29 (carbon radical), 30 (carbocation), and 48 (carbanion when it is a pi atom). The default values for these are similar to that in MM2 for type 3 namely: 0.8. next line if NSB > 0 STRETCH-BEND CONSTANTS NSB lines in format (4f10.5) from: ESB = 2.51118 * ksb * theta * (r1,2 + r2,3) where ksb are constants described below; theta is angle 1-2-3; r1,2 is distance between atoms 1 and 2; r2,3 is distance between atoms 2 and 3. 1-10 11-20 21-30 31-40 SB1 SB2 SB1H SB2H constant for angle a-f-c where f is a 1st row atom; a and c are any non-hydrogen atoms. Both bonds are considered in ESB. Default value is 0.12. constant for angle a-s-b where s is a 2nd row atom; a and c are any non-hydrogen atomsDefault value is 0.25. constant for angle a-f-H where f is a 1st row atom; a is any non-hydrogen atom; only bond a-f is considered in ESB. Default value is 0.09. constant for angle a-s-H where s is a 2nd rwo atom; a is any non-hydrogen atom; only bond a-s is considered in ESB Default value is -0.40

next line if NH > 0 HEAT OF FORMATION BOND AND STRUCTURAL PARAMETERS NH lines of format (2i5,2f10.5,i5,5x,5a4) 1-5 6-10 11-20 21-30 35 IT KT BE SBE NEW atom type number defining a bond or structural feature number atom type number defining a bond (zero if structural feature) bond enthalpy increment for IT-KT bond or structural feature bond entropy increment for IT-KT bond (zero if structural) 0: Default comment on bond enthalpy (use zero if structural) 1: change or add comment on bond enthalpy

91

41-60

NTITLE Comment on bond enthalpy (ignore if structural) (not tested in MMX) The user has no control over the parameters used for the heat of formation contribution from the pi system.

next line if KTR > 0 this line has the fraction bond orders used to generate the transition state constants file in MIO or in PCMOD. MMX does nothing with these bond orders except prints them in the output for reference purposes. Thus users need not supply this information from an external file. The default values for parameters read by MMX are in a binary file called DATA.STO. An ASCII file of the values used can be obtained by running MMXDATA and requesting a file PARAM.OUT. It is no longer possible to submit a nul file to MMX and obtain a file with the parameters.

15.3 Mopac Files


Mopac input and output files are written as 'free format' internal coordinate files. The free format refers to the fact that data may be entered at any position on a line, there is no fixed column number in which the data must appear to be read correctly. The internal coordinate format consists of lists of bond lengths, angles and dihedrals and the defining atoms which describe the molecular geometry, as opposed to a list of cartesian coordinates for all the atoms. In internal format for any atom (i) there is an interatomic distance in Angstroms from an already defined atom (j), and interatomic angle in degrees between atoms i and j and a previously defined atom (k), where j and k are different, and a dihedral angle in degrees between atoms j,k and a previously defined atom (l), where l is different from j and k. Atom 1 has no coordinates since this is defined as the origin, atom 2 is connected to atom 1 by an interatomic distance only, and atom 3 is connected to atom 1 or atom 2 by an interatomic distance and makes an angle (either 3-1-2 or 3-2-1) with no dihedral angle defined. A mopac file for benzene is given below : am1

C C C C C C H H H H H H

0.000000 1.400211 1.399883 1.400132 1.399876 1.400199 1.103109 1.103110 1.103109 1.103109 1.103109 1.103109

0 1 1 1 1 1 1 1 1 1 1 1

0.000000 0.000000 119.996750 120.005653 119.993233 120.006828 119.998314 120.002739 119.998428 120.002640 119.997231 120.001671

0 0.000000 0 0 0.000000 0 1 0.000000 0 1 0.000000 1 1 0.000000 1 1 0.000000 1 1 180.000000 1 1 180.000000 1 1 0.000000 1 1 0.000000 1 1 0.000000 1 1 0.000000 1

0 1 2 3 4 5 1 2 3 4 5 6

0 0 1 2 3 4 2 3 2 3 4 5

0 0 0 1 2 3 3 1 2 3 4 5

line 1: line 2:

keywords separated by one or more spaces such as : am1 title line up to 60 characters long: blank

92

line 3: second title or comment line up to sixty atoms long: blank line 4 to end : Atom Symbol Dist Opt Angle Opt Dihed Opt

NA

NB

NC

The atom symbol is the atomic symbol for that atom. The dist is the interatomic distance between the current atom and the atom number given in the NA column. Thus atom 2 is referenced to atom 1, atom 7 (which is a hydrogen) is reference to atom 1 also. The Opt columns determine whether this parameter (distance, angle or dihedral) will be optimized by Mopac, with a 1 to optimize and a 0 to not optimize. The angle is the angle formed by the current atom and the atoms NA and NB. Thus atom 4 is referenced to atom 3 for the distance and makes the angle of 120.005 with atoms 3 and 2 ( angle 4-3-2). The dihedral angle is defined for atoms 4-3-2-1 to be 0.0 degrees. For a more complete description of Mopac files and options please see the MOPAC documentation which is available from QCPE or from Serena Software.

15.4 X-RAY Files


Since there is no universally accepted file format for x-ray data, we have written a free format file parser that should allow you to input the data in any format. Our routines expect: Line 1: Line 2: Line 3 to end: Title character string of up to 80 characters Cell Parameters A,B,C,Alpha, Beta and Gamma X,Y,Z coordinates and atom symbol.

Note that these routines do not read symmetry matrices or point groups. Thus we assume the coordinates are orthogonal and that the cell paramters are floating point numbers (including the decimal point). If you understand this then you probably have the necessary programs to output this type of data. A sample x-ray input file is included below. A set of dummy cell parameters to use with a set of Cartesian coordinates would be: 1. 1. 1. 90. 90. 90. (note you need the decimal point, all the cell parameters are read as real numbers). ui120ab.c 1.0 1.0 1.0 90.0 90.0 90.0 0.124381 -0.144487 -0.435695 -1.526023 0.158658 0.185498 0.818900 1.213381 0.125534 0.638991 -1.256924 0.361156 -1.905993 1.082317 -0.230632 -2.152138 -0.671798 -0.115534 -1.491980 0.209408 1.267034 1.053019 1.088258 1.043743

s c o o h h h h

93

16 THE MMX FORCE FIELD


16.1 The MMX Force Field
The MMX force field embodies many years of work by J.J. Gajewski and K.E. Gilbert to extend and improve upon the MM2 force field of N. L. Allinger. The starting point for MMX was the MM2(77) program of Allinger. To this was added the VESCF pi routines from MMPI, and the concept of generalized parameters from C. Still. The goal of this work was to be able to treat more compounds of interest to synthetic and mechanistic organic chemists. For example, such functional groups as radical, cations and anions are not handled in MM2. The current version of MMX recognizes nearly 60 different atom types including radicals, anions, cations, transition metals and transition state atoms. The table below lists the atom types, symbols and a brief description.

ATOM TYPE NUMBERS - MM2/MMX


Type 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 Symbol C C C C H O O N N N F Cl Br I S S+ S S Si LP H C H H P B B H C. C+ Ge Sn Pb Se Description Carbon, SP3 Carbon, SP2 (Alkene) Carbon, SP2 (Carbonyl) Carbon, SP Hydrogen (note special types 21,23,24,28 below) Oxygen (singly bonded) Oxygen (doubly bonded) Nitrogen, SP3 Nitrogen, SP2 of enamines and amides Nitrogen, SP Fluorine Chlorine Bromine Iodine Sulfide (-S-) Sulfonium (>S-+), NOT sulfoxide! Sulfoxide-use double bond to O. Sulfone-use double bonds to O. Silane Lone pair (:) Hydrogen, Hydroxyl (-OH) Carbon, Cyclopropane Hydrogen, Amine (NH) Hydrogen, Carboxylic acid (COOH) Phosphorus [PRELIMINARY] Boron (trigonal) Boron (tetrahedral) Hydrogen (enol) C Radical Carbonium ion Germanium Tin Lead Selenium

94

35 36 37 38 39 40

Te D N S Se Ar

41 N+ 42 O43 B# 44 xx 45 H* 46 O+ 47 P5 48 C49 C* 50 C# 51 C$ 52 C% 53 O# 54 I% 55 N# 56 C 57 C 58 Z1 59 Z2 60 Aq ???? other types Parameters were developed by standard methods, using literature values, interpolated values from known parameters, or chemical intuition, followed by comparison to experimental data for geometries and heats of formation. The complete parameter set is found in the data files MMXCONST.PAR and GENCONST.PAR.

Tellurium Deuterium Nitrogen SP2 of Imine Thion sulfur Selenoxide Se Aromatic Carbon (1.40 bonds between 40-40) (Use this atom type only when conjugation effects are not a concern. If conjugation effects are important, use normal carbon and mark it as a pi atom.) N+ Ammonium nitrogen O- Oxy anion TS Boron Not Used TS Hydrogen Onium ion, also metal bound carbon monoxide Pentavalent phosphorous Carbanion Transition State (TS) Carbon Transition State (TS) Carbon TS Trigonal carbon bonds to C* TS Pentavalent carbon bonds C*, O#, and I% TS Oxygen bonds to C# and C% TS Iodine bonds to C% in SN2 TS Nitrogen bonds to C# Cyclobutane carbon as in MM3 Cyclobutene carbon as in MM3 WILD CARD 2 WILD CARD 3 Spherical water

16.2 Minimization with PCMODEL (hints)


Whenever a molecule is minimized in PCMODEL, an Output window displays the results of the calculation. The upper portion of the Output window displays the total energy, as well as a breakdown by component, dipole moment, heat of formation (Hf) and Strain Energy (SE). The components of the energy are Str (stretching), Bnd (bending), StrBnd (stretch-bend cross term), Tor (torsion), VdW (Van der Waal interactions), and QQ (charge-charge electrostatic interaction). The lower portion of the Ouptut window indicates the dielectric constant that was used in the calculation, whether default or added constants were used, and the current level of printer output (default is minimal). The MMX energy has a correction for 1-3 electrostatic interactions when two non-zero charge atoms are attached to a sp3 carbon. This correction is not made with any other central atom and can be a source of error. If there are no cations or anions in the structure or no charged pi systems, a single point calculation using the DP-DP interaction should give a better energy for polar molecules. A major problem in minimizing structures draw without a template is that hydrogens and lone pairs are often forced into inappropriate positions, resulting in high energy structures. It is a good idea delete then re-add hydrogens (by pressing H/AD on the TOOLS menu twice) then reminimize. Also, for large structures reminimizing is recommended (select MINIMIZE again) to make sure the structure is fully converged.

95

Since minimization is quite time-consuming with large molecules, it is often economical to minimize the structure without added hydrogens to get the three-dimensional structure approximately right prior to adding hydrogens and doing a final minimization. PCMODEL has access to a number of generalized force field parameters (torsional potentials dependent only on the two central atoms) to which the program defaults if no standard MM2 parameters are available. These parameters are not optimal and energies produced with them should be considered accordingly. However, the geometries can be considered fairly accurate. The use of generalized parameters is always noted in the output file PCMOD.OUT. All of the parameters for ligands directly bonded to a metal or for ligands coordinated to a metal assume no charge on the metal. A charge on the metal can be set using the Metal_Coord choice in the Mark Menu. This option should be used only if the electrostatic attraction between ligands and a metal are to be employed and not formal bonding or coordination via the Metal_coord option. Formal charges are not a good representation of charge distribution in metal complexes. The heat of formation calculation will be marked as INC (incomplete) if some bond parameters are missing. The pi contribution to the heat of formation depends on whether the default point charges or the bond dipole interaction are used (NDC=0). The default uses atom contributions and the pi energy calculated; if NDC=0, a bond contribution scheme based on pi order is used. The list of bond types handled by the former option is incomplete relative to the latter; however there is no warning that this is the case. It is important to note that the MMX energy resulting form a minimization with a pi calculation does not contain the potential energy of the pi system. So, a comparison of MMX energies for isomeric pi systems is invalid, just as a comparison of MMX energies is invalid for structural isomers. A comparison of the heats of formation, on the other hand, is appropriate. The only circumstances where MMX energies may be compared directly is with conformers or diastereomers.

16.3 TRANSITION STATE ATOMS


Transition states can be modeled using PCMODEL. The eleven transition state bonds that are recognized are: C* - C* C# - C# C - H* C* - H* C# - B# C* - C$ C# - O# C* - C% C% - I% C% - O# C# - N# e.g. forming bonds in Diels Alder rxn or in Cope rear. e.g. forming bonds in Diels Alder or breaking bonds in Cope. e.g. allylic C-H bond in ene reaction. e.g. forming H-C bond in ene reaction or hydroboration. e.g. forming B-C bond in hydroboration. e.g. forming bond in addition of nitrile oxide sp C to sp2 C. e.g. breaking bond in Claisen rearrangement. e.g. carbon nucleophile in SN2. e.g. leaving group bond in SN2. e.g. oxygen nucleophile in SN2. e.g. breaking bond in an Aza Cope rearrangement.

These transition state bonds are fractional. The user must input a bond order for each when requested by PCMODEL. Note that Diels Alder transition states are "early" and have both bond orders roughly 0.3 for symmetrical addends. Bond orders of 0.3 will mimic the nitrile oxide addition to ethylene transition state calculated by Houk (make sure that the nitrogen has a double bond so that it is a type 37 atom). Cope transition state bond orders depend on substitution, as do all of these as revealed by secondary deuterium isotope effect studies. That is why the user is in complete control of the transition state bond orders. For other pi atoms in pericyclic transition states use C.s, the radical carbons, eg. for C2 and C5 of 1,5-hexadienes in the Cope transition state or for C2 and C3 of butadiene in the Diels-Alder transition state. NOTE: You CANNOT do a Pi calculation on C* or C#; Further, the bonds between the C.s in the transition state are fixed at 1.40 and have k=7.0. PCMODEL generates a set of transition state atom parameters from the fraction bond input by the user. These parameters for C* and C# force 90 degree angles between C*1 C*2 and any other atom as the fractional bond order approaches 0. This angle will spread to 109 degrees as the fractional bond order

96

approaches 1, as will C#-O#. The parameters generated for B# additions attempt to mimic the acute angle between B# and C#-C* with appropriate hydrogen angles and distances. The angle at C$ of a C$-C* starts at 180 degrees for no bond and goes to 120 when the bond order approaches 1. The angles of the bonds to C% start at 109 with C%-I% bond order of 1 and an incoming nucleophile bond order of 0.; they go to 120 when both bond orders to C% are equal, and then return to 109 as the incoming nucleophile -C% bond order approaches 1 and the C%-I% bond order approaches 0. The transition state bond natural bond lengths are determined by the input bond orders according to Badger's Rule cast in Pauling bond order terms: distance = distance of fully formed bond - 0.6 (bond order). ln force constant = force constant for fully formed bond * bond order. The constant 0.6 is twice as large as the Pauling constant for bond orders greater than 1. This value is used because it reproduces the structure of the Diels-Alder transition state calculated by Houk from 3-21G basis set, if the kinetic isotope effects determined at Indiana on the Diels- Alder reaction are used as input bond orders.

16.4 PI CALCULATIONS
The constants in PCMODEL will reproduce acids, esters, amides, eneamines, vinyl ethers, carboxylates, nitro groups, alpha, beta unsaturated aldehydes and ketones, allyl cations, anions, and radicals NOT conjugated with a pi system without having to do a pi calculation on these specific functional groups. However, adjacent pi systems will not "experience" the pi electron effect of these groups unless they are included in a pi calculation. Any diene or material with more than four conjugated pi electron should be subjected to a pi calculation (See hints on specific atoms below). If a pi calculation is done with these atoms, non-delocalized constants are used and are adjusted by the bond orders. Pi calculations must be done on other delocalized moieties such as alpha, beta unsaturated imines, nitriles, etc. The pi calculation can be done as either a closed shell or open shell type so that monoradicals are handled reasonably unlike earlier versions of the MM2 force field such as is found in MMX87, MMPMI, MMP2, or MMP1. The pi portion of the code, which is the R.D. Brown VESCF procedure, has been checked against MMP1 (QCPE 318). 16.4.1 PI ATOMS The pi atoms treated originally are: C sp2 (type 2) C (carbonyl carbon type 3) C sp (type 4) O sp3 (type 6) O (carbonyl oxygen type 7) N (amide-like type 9) N (nitrile type 10). The additional new pi atoms are: C. C+ N+ ON C(type 29) (type 30) (ammo-& immonium type 41) (type 42) (imine type 37) (type 48).

97

MMP1 and MMP2 will not recognize any of these atoms. Pi calculations cannot be performed on third row elements like S or P, nor can any of the halogens be included. Compounds with twisted or pyramidalized monoolefins should be submitted to a pi calculation for comparison purposes.

16.4.2 HEAT of FORMATION of PI SYSTEMS


The contribution of the pi system to the heat of formation depends on pi bond orders. First the number of double bonds in a non-charge-separated valence bond structure is determined from the number of filled MOs minus the appropriate number of electrons from any type 6, 7, 9, 29, 42, and 48 atom. The number of each type of double bond is multiplied by the contribution each makes to the heat of formation as described in MM2, e.g. 22.8 for each C=C, -24.5 for each C=O, 31.0 for each C=N, etc. Bonds involving a type 6, 9, 29, 30, 42, or 48 atom have a contribution of a constant which is roughly that if the electron(s) were not delocalized, e.g. 0. for 03-09, -16.7 for 02-06 and for 03-06, 12. for 02-48, etc. In MMX for pi atom pairs of the latter description, the pi bond order multiplies a negative constant unique to each so as to decrement the heat of formation; this seems reasonable-higher bond order, lower energy. However, for pi atom pairs not in the latter category, a pi bond order between 0.4 and 0.85 will decrement the heat of formation by a constant characteristic of the atom pair; the decrement falls of as rho2 when rho is less than 0.4 and as (1-rho)2 when rho is greater than 0.85. It is this function which allowed the use of bond orders for the heat of formation calculation; however, it does poorly for larger >C8 carbocyclic pi systems. The calculation of the heat of formation of the pi system depends on whether only type 2 (olefinic), type 29 (radical), type 30 (cation), and type 48 (carbanion) carbons are present or (& heteroatoms without a charged atom or C.) or whether N+, O- or C.,C+, or C- and heteroatoms are present. In the former cases with just trigonal carbon, the sum of pi bonds multiplies 90.2 kcal/m (the strength of a strainless Csp2-Csp2 bond). To this is added the compression energy using 1.51 as the strain free value ( the quadratic constant is 517 kcal/ang*2). To this is added the Pople pi energy (a negative quantity); then the contribution of each pi atom to the pi portion of the heat of formation is added (these values are 92.8 kcal/m for a pi atom attached to one other pi atom; 141.95 for a pi atom attached to two others; and 189.0 for a pi atom attached to three others). A similar parameterization applies to other cases within the first category. In the latter cases, each C=C contributes 11.2 kcal/m to the heat of form.; then the compression energy is calculated using 400 kcal/m as the constant. Finally, the difference in the total bond order of the pi system relative to that of the number of classical pi bonds is multiplied by 35 kcal/mol to obtain the pi stabilization. The contribution from the other bonds is similar to that in MMPMI where the bond orders key the contributions to the heat of formation. Generally the heats of formation are good to 2 kcal/mol, but it is unclear where large deviations might occur.

16.5 TRANSITION METALS


Note that many basic atoms can be docked (not attached with a bond that appears in the connected and attached atom lists) with their lone pair to a metal. Alternatively, atoms can be attached with a bond replacing the lone pair, and despite the increase in positive charge at the atom, the neutral atom types above should be used, e.g. M-N tetravalent (use type 8 not type 41). TYPES of BONDS to Metals that MMX will recognize (Use the Atom Types option in the LABELS of the VIEW Menu to make sure that the proper atom types are used): to NITROGEN/ M <- :N- type 8 amino, can dock for automatic Hs & lps or specify all four bonds \ including Hs (no lp). M - N< type 9 amido, to coordinately saturated M, specify 0944 bond. no lone pair

98

M - N-C= type 9 amido, N next to unsaturated carbon, specify 0944 bond, no lone pair \ on N. M <-:Nsp type 1 nitrile or nitrogen coordinate or specify all bonds (no lp), MMX recognizes this relative to nitride below. M N: sp type 10 M = N+< type 41 M <-:N= type 37 \ nitride amido, to coordinately unsaturated M, specify 4144 bond. imino, coordination-requires docking; or specify all four bonds including metal.

M -N:= type 37 imido, to coordinately saturated M, eg. bent nitrosyl, specify 3744 bond, leave lone pair on N. M = N+= type 41 imido, to coordinately unsaturated M, eg. linear nitrosyl, specify 4144 bond, MMX will recognize the difference between this and amido group. M - N: sp type 10

to OXYGENM - O - R type 6 M = O:: type 7 M - Otype 42 two lone pairs on O, coordinately sat'd metal. oxo, two lone pairs on O. two lone pairs on O-, coordinately sat'd metal.

M <-:O< type 6 coordination between 44 and lone pair on type 6, can dock with ether and alcohol oxygen; alternatively, specify three bonds and lone pair. M - O:=C type 7 + M = O- type 46 one lone pair on O, MMX recognizes this and oxo one lone pair on O+, coordinately unsat'd metal.

to HALOGENM - F, M - Cl, M - Br, M - I, can be terminal or bridging, specify each bond.

to HYDROGENM - H type 5 terminal hydride. M - H* - M type 45 bridging hydride.

to CARBON/ M - C- type 1 sigma alkyl, specify 0144 bond. \

99

M - C= type 2 sigma vinyl, specify 0244 bond. M - C<| type 22 M - C=O type 3 sigma cyclopropyl, specify 2244 bond. sigma acyl & bridging carbonyl, specify 0344 bond.

M - CO+ type 4 terminal carbonyl or terminal cyano, specify 0444 bond; Note that a terminal carbonyl has a triple bond between C & O so that the O is really O+ PCMODEL will recognize this if a neutral O is used. M - C+< type 30 M - C:< type 48 / M <-:C- type 48 \ metal carbene, specify 3044 bond. specify 4448 bond, possible bridging with lone pair.

coordination only.

to SULFURM -:S:- R type 15 M = S:: type 38 two lone pairs on S, coordinately sat'd metal. two lone pairs on S.

M <-:S:< type 15 coordination between 44 and lone pair on type 15, can dock for thioether and thiol sulfur or specify three bonds to S- one lone pair. M = S:+- type 16 one lone pair on S+, coordinately unsaturatedd metal.

to PHOSPHOROUS/ M <-:P- type 25 \

coordination to lone pair or specify all four bonds (no lp).

to METALSM - M 4444 bond MMX will alter natural bond distances if bond order is specified.

COORDINATION is possible with the following ligands: / :N- type 8 \ / :P- type 25 \

::O< type 6

::O= type 7

:N type 10 /

::S< type 15

100

:C- type 48. \ Pi coordination also from types 2, 4, 29, 30, 40, and 48 carbons.

101

Das könnte Ihnen auch gefallen