Professional Documents
Culture Documents
SYBYL® 7.0
Fall 2004
This material contains confidential and proprietary information of Tripos, Inc. and third parties
furnished under the Tripos Software License Agreement. This material may be copied only as
necessary for a Licensee’s internal use consistent with the Agreement. The allowed use includes
printing of hardcopy versions hereof as minimally necessary for Licensee’s internal use. Neither
Tripos, Inc., nor any person acting on its behalf, makes any warranty or representation, expressed
or implied, with respect to the accuracy, completeness, or usefulness of the material contained in
this manual or in the corresponding electronic documentation, nor in the programs or data
described herein. Tripos, Inc. assumes no responsibility nor liability with respect to the use of this
manual, any materials contained herein, or programs described herein, or for any damages
resulting from the use of any of the above. Except for printing of hardcopy versions as stated, no
part of this manual may be reproduced in any form or by any means without permission in writing
from Tripos, Inc., 1699 South Hanley Road, St. Louis, Missouri 63144-2917, USA (314-647-1099).
Selected software programs for Comparative Molecular Field Analysis (CoMFA) and HQSAR
methodologies contained or documented herein are covered by one or more of the following
patents: US 6,208,942; US 5,025,388; US 5,307,287; US 5,751,605; AT E150883; BE 0592421; CH
0592421; DE 691 25 300 T2; FR 0592421; GB 0592421; IT 0592421; NL 0592421; SE 0592421.
SYBYL, UNITY, CoMFA, CombiFlexX, GeneFold, LeapFrog, LITHIUM, OptDesign, Alchemy, and
ChemPrint are registered trademarks of Tripos, Inc. AUSPYX , CAPRI, ChemEnlighten, CScore,
DISCOtech, Distill, FlexiDock, GASP, HiVol, HQSAR, Legion, MOLCAD, Molecular Spreadsheet,
OptiDock, OptiSim, PD Triplets, ProTable, RECEPTOR, SARNavigator, Selector, SiteID, Tuplets,
TRIAD, and Tripos Bookshelf are trademarks of Tripos, Inc.
Chapter 1.
Introduction to SYBYL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Chapter 2.
The Very Basics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.1 Start SYBYL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.2 How to Get Help . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.3 Work with Molecules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.4 Undo Last Operation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
2.5 Exit SYBYL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Chapter 3.
Sketch and Modify Molecules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
3.1 Small Molecule Sketching Tutorial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
3.2 Access the Sketcher . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.3 Modify Molecules Outside of Sketcher . . . . . . . . . . . . . . . . . . . . . . . . . . 47
3.4 Create/Modify Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
Chapter 4.
SYBYL Objects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
4.1 Glossary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
4.2 Identify Atoms, Bonds, etc. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
4.3 Create Expressions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
4.4 Sets of Objects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
Chapter 5.
Selection of Atoms/Bonds/Sequences . . . . . . . . . . . . . . . . . . . . . . . 109
5.1 Selection via the Expression Dialogs . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
5.2 Atom Selection Tutorial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
5.3 Bond Selection Tutorial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
5.4 Substructure Selection Tutorial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
Chapter 6.
Molecule Databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
6.1 Database Tutorial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
6.2 Open/Close Databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
6.3 Obtain Information on Databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
6.4 Retrieve Molecules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
6.5 Managing Database Content . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148
6.6 Save Database Molecules to MOL2 or MOL Files . . . . . . . . . . . . . . . . 153
6.7 Connect to External Relational Databases . . . . . . . . . . . . . . . . . . . . . . . 154
6.8 DATABASE Command List . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
Introduction to SYBYL
Menubar
A menu item can be a simple command or a check box; it can also lead to a
submenu or a dialog. Additional categories may be available if you have
licensed special modules.
By default, all options are shown in the menus. Any options for which you do
not have a license are greyed out (these most likely will occur in the Tools
menu). The environment variable TA_HIDE_UNLICENSED_PRODS controls
whether or not the unlicensed options appear in the menus. To hide unlicensed
options, type the following before starting SYBYL:
setenv TA_HIDE_UNLICENSED_PRODS TRUE
Toolbox Icons
The SYBYL tools used to interact with the graphics are represented by icons
along the left edge of the SYBYL screen.
Descriptions of the individual icons can be found in the Toolbox Icons section
of the Graphics Manual. To activate an icon, place the pointer on it and press
the left mouse button. In cases where a dialog is displayed, you can close the
dialog by pressing the Q button.
Textport Window
The textport window enables you to enter any command, including HELP.
Review the Tripos Bookshelf for a complete listing of all SYBYL commands
available. After entering a command, the prompt is again printed on the
terminal.
Note: When running SYBYL, both menu selections and command line entry
are available simultaneously.
The default molecule area (or work area) is the area targeted by operations. In
many cases, the default molecule area of M1 works just fine. Once in a while,
however, a number of operations may need to be performed on a structure in a
different area (e.g., M3). You could specify M3 for each operation. It may be
much easier, though, to change the default area, while working on that structure,
to M3.
Index Citations
Connection to the Tripos web site Articles
Full color pictures Work flow charts
FAQs PDF library
Search functionality
Menubar: The Help menu is on the far right side of the menubar.
• Help >>> On Help—Activate Tripos Bookshelf and
provide information about SYBYL’s help system.
• Help >>> Start Bookshelf—Display Tripos Bookshelf’s
main page. From here you can view documentation
regarding every module of SYBYL.
Note: Tripos Bookshelf depends on a browser. To prevent
SYBYL from spawning a browser window, type TAILOR SET
HELP USE_BROWSER NO in the textport, before starting the
help system within SYBYL. This disables all help buttons
within SYBYL. (To always disable the buttons when you log
into SYBYL, modify the .sybylrc file and add: tailor set
help use_browser no ||.
Command In the textport, type HELP and press return to activate the Tri-
Line: pos Bookshelf and display information about SYBYL’s help
system.
In the textport, type HELP <a SYBYL command/dialog> to dis-
play information on the specified command/dialog. For exam-
ple:
• HELP GRAPHICS—Displays information on the GRAPHICS
command.
• HELP ANNOTATE_DIALOG—Displays information on the
Annotate dialog.
To subcommands available are:
• ~HISTORY—Displays a numbered list of all help topics
viewed since invoking help. Use these numbers with the
RECALL command.
• ~RECALL {history_number—Redisplays help on any
item in the history list. history_number’s value is obtained
using the HISTORY command.
Additional Information:
• Libraries of Chemical Groups and Fragments on page 187.
Example
¾ Select 1,3-DIOXANE.
¾ Press OK.
The molecule loads into SYBYL’s display area 1, also known as D1. Its
default location is in molecule area 1, also known as M1.
If additional molecule areas are used, they simply recycle through the display
areas:
• M5 is always in display area D1.
• M6 is always in display area D2.
• M7 is always in display area D3.
• M8 is always in display area D4.
¾ Press .
The Display Options dialog is displayed. This dialog enables you to modify
the various look of your display area and molecules.
¾ In the Display Options dialog, select Sticks.
The lines making up your molecule become thicker and the color codes are
more defined. From now on, if you load any molecule in SYBYL, it will
load in Sticks mode.
¾ Select 1,2,4-Trioxolane.
¾ Press OK.
¾ Press OK.
You have now placed two molecules within SYBYL (1,3-Dioxane and
1,2,4-Trioxolane). The two molecules reside in two unique molecule areas
(M1 and M2).
The Read File dialog uses the filename as the default for the molecule name
when reading a single file. If any form of molecule name is present in the input
file, that name is used instead.
Files Select a file from the list. Its name appears in the Files
to read field.
Files to read File selected in the list is shown here with its path.
Alternately, type the name (and path) of the desired file
in this field.
File Type Selecting a file type from this menu reduces the list of
files presented in the Files list.
Molecule Select a work area from the list. If a work area is
needed, SYBYL automatically selects the first available
one. If the file type selected does not require a work
area, this field is inactive.
¾ Specify Format.
Multiple structures can be selected from the list and saved to a MOL file,
MOL2 file, SD File, or SLN file. Use the Select All, Invert, and Clear buttons
to manipulate the selections in the molecule list.
The filename is used as the default for the molecule name when reading a single
file. If any form of molecule name is present in the input file, that name is used
instead.
MOL2:
• MOL2 files are ASCII files containing all information necessary to
reconstruct the molecule. The format is based upon the convention of a
keyword for each type of data needed to reconstruct the molecule,
followed by a group of records. (See the MOL2 File Format chapter in
the Toolkit Utilities Manual.)
• MOL2 files are used by SYBYL to store molecules resulting from many
batch minimizations (ANNEAL, MAXIMIN2, and MULTIFIT), LATTICE
generations, BIOPOLYMER CONSTRUCT_BACKBONE, and other computa-
tions.
• The MOL2 command is functionally equivalent to the MOL command.
However, MOL2 is not affected by the command TAILOR SET MOL
FILE_FORMAT and, therefore, always writes files in MOL2 format.
MOL:
• A MOL file does not preserve the complete information available with
molecules in SYBYL 6.x and above. It is provided for compatibility with
existing user programs. Data written to a MOL file include atoms and
MREAD:
• The MREAD command is functionally equivalent to the MOL command. It
has been kept for compatibility with previous versions of SYBYL.
Additional Information:
• Record Output from a Single Command on page 173.
• Measure Intra- and Intermolecular Distances/Angles on page 26.
• BIOPOLYMER MEASURE to conveniently measure omega and zeta angles.
• BIOPOLYMER CHECK_GEOMETRY to report deviations from standard
geometry.
In the atom, bond, and substructure list, an asterisk (*) in the column following
the ID indicates that the object belongs to an internal ring (i.e., a ring totally
contained within a substructure), whereas an “at” sign (@) indicates an external
ring (i.e., a ring which spans substructure boundaries). Substructures cannot
participate in internal rings but they can be members of external rings.
An asterisk (*) in the column after the ID of sets indicates that the set is defined
and managed by the system.
Additional Information:
• Record Output from a Single Command on page 173 to copy the listing
into a file.
• TAILOR SET GENERAL ATOM_IDENTIFIER to alter the characteristics
of atom listings.
The full generality of the object expression syntax can be used to determine
which objects to include. The PRINT command writes out the file SYBYL-
PRINT.LIS and submits it to lpr for printing.
In the atom, bond, and substructure list, an asterisk (*) in the column following
the ID indicates that the object belongs to an internal ring (that is, a ring totally
contained within a substructure), whereas an “at” sign (@) indicates an external
ring (a ring which spans substructure boundaries). Substructures cannot partic-
ipate in internal rings but they can be members of external rings.
An asterisk (*) in the column after the ID of sets indicates that the set is defined
and managed by the system.
Additional Information:
• TAILOR SET GENERAL ATOM_IDENTIFIER to alter the characteristics
of the listings.
2.3.4 Copy/Paste/Merge
• Copy/Paste Contents between Molecule Area and Clipboard on page 24
• Copy Contents To Different Molecule Area on page 24
• Copy Atoms and Associated Data Structures To Different Molecule
Area on page 24
• Merge Copy of Atoms and Associated Data Structures Into Molecule
Area on page 24
Makes an exact copy of all contents (properties, colors, and associated displays
(backgrounds)) of one work area into another. Origin molecule is unaltered.
Previous contents of target area (if any) are moved to the recovery stack for that
area.
Copied structures replace contents of target area. All local set definitions
associated with extracted atoms are copied to new area as well. Origin area is
left unchanged.
If atom_expr does not include a work area specifier, the default work area is
used.
If atomic charges are present before extraction, atoms in target molecule still
bear same charges. However, they are marked invalid, and will not be used on
subsequent SYBYL operations. Validate these charges manually via the
command CHARGE mol_area VALIDATE YES.
Merge Copy of Atoms and Associated Data Structures Into Molecule Area
If atom_expr does not include a work area specifier, the default work area is
used.
Any set definitions, with names and types identical to those present in the target
work area, are merged into the target area.
An atom with the same type and coordinates as an atom in the target area is
considered to be non-unique (identical). If coordinates are the same but atom
types are different, the atom is still considered to be unique and is merged into
the target area. A message states that atoms of different types have the same
coordinates.
Non-unique atoms may be merged into the target area when the target atom,
identical in coordinates and type to an atom in the origin area, does not have
enough unfilled valences to make bonds with the atoms being merged. The non-
unique atom is included in the merge so that these bonds are successfully added.
A message states that atoms have the same coordinates due to lack of free
valences in the target area atom.
If molecules have been rotated and/or translated, use FREEZE to transform the
coordinates before invoking MERGE.
If atomic charges were calculated for either molecule before the merge, atoms
in the target molecule will bear the same charges. They are marked “possibly
invalid” but can be used in SYBYL by setting the Charges option menu to
Use Current in the Energy dialog. Note that the MMFF94 force field always
recalculates its own set of charges.
Additional Information:
• TAILOR SET MERGE to alter the characteristics of merging.
1. Use the Mouse Focus Option dialog to differentiate between the two molecules
that were previously displayed:
The display area is set to Global by default when you start a SYBYL session.
All images on the screen — molecules and backgrounds — are affected simulta-
neously by rotations and translations.
¾ Rotate the molecules by pressing the right mouse button and moving
your mouse in any direction.
Notice that both molecules move. This is because your current molecule
setting is on Global (global).
¾ Press the middle mouse button and move 1,3-DIOXANE to the upper
right corner of SYBYL.
Additional Information:
• BIOPOLYMER MEASURE to measure omega and zeta angles.
• TAILOR SET GENERAL ANGLE_RANGE to specify how an angle range
should be displayed.
Intra-/Intermolecular Measurements
Angles
Menubar: Analyze >>> Measure >>> Angle
Command Line: MEASURE ANGLE {atom1 atom2 atom3}
Loops until you type the end-loop character (|).
UIMS Variable: measure_angle
Distances
Menubar: Analyze >>> Measure >>> Distance
Command Line: MEASURE DISTANCE {atom1 atom2}
Loops until you type the end-loop character (|).
UIMS Variable: measure_distance
Height of Atoms Above Plane
Menubar: Analyze >>> Measure >>> Height Above Plane
Command Line: MEASURE HEIGHT atom_expr plane_name
• atom_expr—Atoms whose height is to be measured.
• plane_name—Name of plane, in default work area, to
use. Use LIST PLANE to find names of defined planes.
UIMS Variable: measure_height
Torsion Angles
Menubar: Analyze >>> Measure >>> Torsion
Command Line: MEASURE TORSION {atom1 atom2 atom3 atom4}
Loops until you type the end-loop character (|).
UIMS Variable: measure_torsion
Option:
UIMS2 Variables:
The following variables are assigned values:
ZAP deletes molecules and their associated data structures from program
memory. It clears the molecule area. All associated display structures (e.g., dots,
ribbons, …) are removed from the graphics screen as well.
MONITOR pairs are not saved and, therefore, are lost if RECOVER is executed.
This is because MONITOR pairs may involve more than one molecule area.
SAVE mol_area
Additional Information:
• SET AUTOSAVE for information on automatic saving of molecules on the
recovery stack and for an explanation of the operation (in the Graphics
Manual).
• Sketch Molecules
• Small Molecule Sketching Tutorial on page 34
• Access the Sketcher on page 41
• Modify Molecules Outside of Sketcher on page 47
• Atoms on page 47
• Bonds on page 53
• Group on page 56
• Substructures on page 56
• Molecule’s Center of Rotation, Name, Type, etc. on page 58
• Combine Two Molecules From Different Molecule Areas on page 59
• Ring Fusion Tutorial on page 60
• Combine Two Molecules or Groups in Same Molecule Area on page
66
• Chirality on page 66
• Adjust Bond Lengths and Angles to Match Standards on page 69
• Scan Torsions to Remove van der Waals Contacts on page 69
• Create Molecule by Averaging Existing Molecules on page 70
• Create/Modify Features on page 71
3.1.1 Preface
In this tutorial, you will build and minimize Atropine by building the most
complex ring system first an then adding the substituents. Typically, molecular
fragments from the Standard Fragment Library are used to quickly construct
ring systems with good geometry. However in order to better demonstrate
SYBYL’s sketching capabilities, you will use the Sketch Molecule menu items
to construct and optimize the most complex ring system.
3.1.2 Set Up
1. Clear the SYBYL screen of all molecules and background images.
¾ If there are more than one molecule on the screen, click All, then OK
in the Molecule Expression dialog.
¾ View >>> Delete All Backgrounds
¾ Set your display to Full screen mode with the Display Options
graphics tool.
2. Display a grid to aid in building the molecule to scale. The spacing between
grid points is 1.54Å. (This is the sp3 carbon to sp3 carbon bond length.) The
grid scales with the molecule to always show the correct bond length.
¾ Pick a point above this atom and about one grid spacing to the right
(see atom 2 in the figure below).
A bond is drawn to the newly created second atom.
¾ Use the right mouse button and rotate the molecule about the X axis
until it has an orientation similar to that shown below.
¾ Select N in the ring and then pick a new screen location that is
somewhere above the ring.
¾ Select atom 4 and move it below the ring by the same amount.
Do not be concerned if the structure sketched is not a perfect chair. The ring is
optimized later.
¾ Pick a point below this atom and then another point diagonal to the
new atom.
¾ Pick atom 6 to close the ring.
¾ Press End.
There is an initial setup period while the minimizer parameters are being read
from the database. Energy values are then printed in the text window after each
iteration. The molecule display on the screen is not updated until the end of the
minimization in order to reduce execution time.The minimization is complete
when the “Existing atom or new point:” prompt returns in the textport window.
¾ Rotate the molecule until its orientation is similar to that shown in the
figure below.
¾ Select O from the atomic symbol menu and then pick each of the
three atoms.
The atoms are labeled with an O and colored red to reflect the change.
¾ Select Chiral on the Sketch Molecule menu and select atom 11.
3.1.8 Clean Up
1. Since the ring system has been already optimized, use the 4_SCAN option,
which involves non ring bonds only, to clean up the model. (Note that any clean
up option from 1 to 6 includes all options preceding it in the list, therefore, all
non ring bonds have their bond lengths and angles adjusted, and the torsion
angles are scanned and adjusted to relieve bad contacts.)
¾ Select Tailor on the Sketch Molecule menu.
¾ Press End.
¾ Press Save.
A file named atropine.mol2 is created in the current directory.
When you work on your own research, you can use the same technique to save
your molecules. To view them again, use the Read File dialog.
This concludes the small molecule sketching tutorial. We suggest that you
repeat this tutorial on your own using molecules from your own research.
Additional Information:
• CONCORD for fast conversion of 2D coordinates to 3D.
• TAILOR SET GRID to customize the displayed grid.
Although flexible enough to enable building any structure, the sketcher does
have enough chemical sense to warn you when something unnatural has been
done. For example, if the valence of an atom is about to be exceeded, a message
is issued to inform you of this aberration. It is then your decision to continue or
not.
Only atomic symbols are used to designate atom types in order to eliminate the
burden of having to decide the proper SYBYL atom type.
Atom Types
Branching
To draw an atom not connected to the last atom displayed, a pen up action must
be signified by choosing this last (highlighted) atom again. The highlight is
removed and continuous draw mode is temporarily shut off. Choose a new point
of attachment. Once the new attachment point is selected, the continuous
drawing mode is automatically turned on and a new chain of atoms can be
added as described above.
the molecule maintains its current geometry. This option may be disabled by
setting TAILOR SET SKETCH AGGREGATES to OFF. In this case, the whole
molecule is considered in the cleanup phase.
Heteroatom labeling and molecule color, coded by atom type, are the defaults
used by the sketcher to make different atom types easily identified. Labeling
can be toggled off and a different coloring scheme can be chosen if you desire.
Multiple Bonds
The same strategy can be repeated for sketching triple bonds. Aromatic bonds
are designated by alternating single and double bonds within the ring.
Rings
Z Coordinate
To add a third dimension to the molecule, apply rotations to the model either
interactively on graphics workstations or via one of the Rotate_85 options on
static terminals. Once an atom has a Z-coordinate, any subsequent atoms
attached to it are drawn in the same Z-plane. For example, if a rotation precedes
the Move option, the atom being moved is drawn in a different plane from the
rest of the molecule. Many different conformations of a molecule can be
achieved with this method, such as chair or boat cyclohexane.
3.3.1 Atoms
• Add Atom to Existing Structure on page 47
• Add Atom Without Attaching to Any Present Structure on page 48
• Add Chain of Atoms of Same Type on page 48
• Fill Empty Valences on page 48
• Add Pseudoatoms (Centroids) on page 49
• Modify an Atom on page 49
• Remove Atoms or Atom Attributes on page 53
Coordinates are automatically determined, based on bond length and bond angle
data from parameter tables.
Chains are attached to existing structure with ideal geometry. Coordinates of all
atoms are determined from the parameter tables.
Bond lengths and angles are set to standard values determined from the
parameter file.
Pseudoatoms (centroids) are added to prochiral, methyl, and phenyl ring groups.
They are often used for defining constraints to the prochiral, methyl, or
aromatic protons. Constraints are needed when you do not have stereospecific
resonance assignments for prochiral atoms (or methyl pairs, such as in leucine
and valine), or when fast motions are present, such as methyl rotors and
aromatic ring flipping.
The dummy atom’s name at the centroid position is defined according to the
nomenclature first presented in Kurt Wüthrich, NMR of Proteins and Nucleic
Acids, J. Wiley and Sons, 1986. For example, the beta methyl group on alanine
is named “QB”.
Modify an Atom
Alternate atom types can come from two sources: the MACROMOL dictionary
(also referred to as the default source) or user input. Alternate atom types are
needed for energy calculations using non-Tripos force fields. MODIFY ATOM
OTHER_TYPES displays or lists alternate atom types from either source, and
allows input of user-assigned alternate types. User-assigned types are stored
with the molecule, and take precedence in force field calculations over
dictionary-supplied alternate atom types. Currently supported alternate atom
type sets are:
• Kollman all-atom (KOLL_ALL) and Kollman united-atom (KOLL_UNI)
force fields. A list of Kollman atom types is provided in the Kollman
Force Field section in the Force Field Manual. Note: Alternate atom
types must be defined for both KOLL_UNI and KOLL_ALL force fields for
energy setup to work. If the atom type is defined for only one, the
ENERGY, MAXIMIN2, and DYNAMICS SETUP commands all fail with an
error condition.
• AMBER7 FF99 (AMBER7_FF99) force field, which is essentially
AMBER95 atom types with a few types added. See the AMBER7_FF99
Force Field section in the Force Field Manual.
• AMBER7 FF02 (AMBER7_FF02) force field, which is essentially
AMBER7 FF99 atom types with different charges and polarization
included See the AMBER7_FF02 Force Field section in the Force Field
Manual.
• AMBER 95 (AMBER95_ALL) force field.
• Changing an atom type to a Kollman or AMBER atom type requires a
“BIOPOLYMER” license. The same is true for viewing labels for
Kollman or AMBER atom types.
• MMFF94 force field. A list of MMFF94 atom types is provided in the
MMFF94 Force Field theory section in the Force Field Manual.
Additional Information:
• The Load Charges section of the Biopolymer Manual to load atomic
charges and alternate atom types from the dictionary.
All bonds involving that atom are removed and any features (normal, plane,
constraint) attached to that atom. Atoms and bonds are renumbered to reflect
removal of objects from the molecular description. If removed atom is a
member of a static set, set membership is updated. (REMOVE ATOM * is equiv-
alent to ZAP for the molecule.)
The bond between the specified atoms is deleted along with all atoms on the
target side of that bond. Note: The Split functionality cannot be used if the
indicated bond is in a ring. The ring must first be broken by removing a bond or
an atom.
Attributes can be removed from one or more atoms without deleting the atom
itself.
3.3.2 Bonds
• Add Bonds on page 53
• Modify a Bond on page 55
• Remove a Bond or Bond Attributes on page 55
Add Bonds
The bond type of the new bond is set by the atom types at its endpoints. If there
is ambiguity regarding the bond type, a prompt asks for the resolution. Atomic
positions are not altered by adding a bond.
Single bonds may also be added using the Quick Bonds functionality. A single
bond is added between two atoms if the distance between them is within an
acceptable range. This is particularly useful for PDB files containing discon-
nected HETATM records.
A bond is added between two atoms A and B if the distance between them,
DISTAB, is within acceptable range:
The asymmetry of the acceptance window (Tol_Neg > Tol_Pos) allows for
alkynes (Ideal_Bond = ~1.15 Å) and certain short aromatic C-C bonds to be
recognized as bonds without making chemical oddities from non-covalent
intramolecular hydrogen bonding patterns.
Additional Information:
• TAILOR SET CONNECT to alter the characteristics of the connectivity
determination.
Modify a Bond
When changing the length, angle, or torsion, coordinates of last specified item,
and all atoms attached to it, are altered. The atoms must be connected to form a
bond, angle, or torsion, respectively. If the bond, angle, or torsion is in a ring,
an error is reported and no alteration is made.
Features (rotatable bonds) attached to the deleted bond are removed as well.
Bonds are renumbered to reflect removal of objects from the molecular
description.
Attributes can be removed from one or more bonds without deleting the bond
itself.
3.3.3 Group
A chemical group can be added to the molecular structure and their geometry
determined from the parameter tables. The atoms in the group and their relative
positions are read from the Group Library (refer Group Library Structure and
Contents on page 187).
The permission on the Group Library should normally be read only to prevent
accidental modification. However, the Group Library is user expandable, and
before adding a group to this library, you must change the permission on the file
$TA_DATA/GROUP to allow writing:
chmod +w $TA_DATA/GROUP
Set the permission back to read only after the group has been added:
chmod -w $TA_DATA/GROUP
See Group Library Structure and Contents on page 187 for details.
3.3.4 Substructures
The two structures do not have to be cyclic. At least two atoms must be selected
in each molecule, more atoms help direct fusion of non planar bonds. Terminate
the input list with the end-loop character (|).
The fused structure is placed in the molecule area of the first atom of each pair.
Coordinates of atoms used for the fusion are taken from the first molecule.
Bonds directly connecting fusion atoms in each molecule are discarded. An
attempt is made to retain all other bonds in both molecules. If the atomic
valence of the fusion atom is exceeded, any Hs attached to fusing atoms are
discarded and the fusion rechecked. If the operation still fails, an error is
reported and the command terminated. You must then discard enough atoms to
make fusion legal. If the operation succeeds, Hs are replaced to fill valences of
atoms from which they were removed.
Spiro fusions cannot be specified by selecting a single atom pair. They can be
specified by adding Hs and indicating the fusion of an internal bond in one
molecule with the bond involving the H atom.
Additional Information:
• TAILOR SET FUSE to select default parameter set to alter character-
istics of the fusion.
Set Up
¾ If there are more than one molecule on the screen, click All, then OK
in the Molecule Expression dialog.
¾ View >>> Delete All Backgrounds
2. Set the screen to quartered mode using the icon (press Q).
The fusion of two planar systems requires only two pair of atoms. In this
example, furan (M1) is fused to pyridine (M2) and the resulting model appears
in M2.
1. Read in the two fragments, color them by atom type and label both structures.
¾ Build/Edit >>> Get Fragment
¾ Click on 6 in pyridine (M2), and then 2 in furan (M1) to form the first
pair.
¾ Click on 5 in pyridine, and then 3 in furan to form another pair.
You can specify a spiro fusion by first selecting the two atoms that become the
spiro center. The second pair involves a ring atom in one molecule and a
hydrogen in the other.
1. Use only two pair of atoms to fuse two non planar bonds. The ambiguity is
resolved automatically by selecting the alternative which gives rise to the best
fusion geometry. The results are displayed in M2.
The selection of two atom pairs is usually insufficient for this type of fusion
since the resulting fused structure may not have the desired configuration.
¾ Build/Edit >>> Get Fragment
2. Use three pair of atoms to fuse two non planar bonds. As in the previous
attempt, the ambiguity is resolved automatically by the program. The results are
displayed in M3.
¾ Build/Edit >>> Get Fragment
3. Use four pair of atoms to fuse two non planar bonds. The results of the fusion
are displayed in M4.
Manual fitting of the two bonds to be fused reveals that the torsional angles of
the four atoms involved are 60° in piperidine and -60° in tetrahydropyran. In
order to produce a geometry better suited for the cis fusion you want to perform,
tetrahydrofuran is inverted first.
¾ Build/Edit >>> Other Tools >>> Invert
1. Fuse benzene (M1) to hexahydroazepine (M2) and the results displayed in M1.
Notice the poor quality of the geometry at the ring fusion and the fact that extra-
neous hydrogens are left over from the hexahydroazepine. Some clean up and
minimization would be necessary before this model could be used.
2. Fuse hexahydroazepine (M3) to benzene (M2) and the results displayed in M2.
Notice the poor quality of the geometry at the ring fusion. Here too, a minimi-
zation would be required.
3. A simple strategy to assure the quality of the fusion and reduce the need for
minimization is to ascertain that the geometry of the bonds to be fused is
identical in both molecules before the fusion occurs.
Fuse benzene (in M3) with tetrahydroazepine (in M4) and display the results in
M4.
¾ Build/Edit >>> Get Fragment
Notice the quality of the geometry of the atoms at the ring fusion. This approach
where the bonds to be fused are both planar gives the best result.
The length of the new bond is determined by the type of the atoms being joined
and is taken from a table of standard bond lengths. The two atoms specified are
eliminated by the join operation.
Groups being joined may be in same molecule area or in different areas. In the
latter case, atoms to be joined are copied into the target atom’s molecule area
and then the bond formed.
Additional Information:
• Bonds on page 53 to connect two atoms.
• MERGE to move a copy of specified atoms into a new work area.
• TAILOR SET JOIN to customize the parameters for joining.
3.3.9 Chirality
• Determine Chirality on page 67
• Invert Chirality on page 68
• Reflect Atoms Through a Plane on page 68
Determine Chirality
Both the chirality of an atom and the cis-trans isomerism about a double bond
can be determined by using a set of rules developed by Cahn, Ingold and Prelog
and adopted by IUPAC (See J. Org. Chem., 35, 9, 2849, (1970)). These rules
govern the sequencing of substituents about the chiral atom or double bond.
Once the substituents are assigned a priority, simple geometric algorithms
determine which type of isomerism is present. These same rules are used to
determine the prochirality of an atom.
Atom chirality attributes are used by the expression generator %sln() when
converting a SYBYL molecule to an SLN string. The chirality will then be
expressed as [S=N] or [S=I] (see the Tripos SLN Manual for details). The bond
stereo attributes are also used by the expression generator %sln().
Additional Information:
• TAILOR SET CHIRAL to alter the method for calculating chirality.
Invert Chirality
If all atoms in the molecule are to be inverted, the entire molecule is inverted by
reflecting the X-coordinate through the YZ plane. Otherwise, each individual
tetrahedral atom is inverted by exchanging two substituents. Note that non-
chiral centers may also be inverted.
Atoms at ring fusions cannot be inverted individually, but only as part of the
inversion of the whole molecule.
The plane must be defined on the molecule. Any arbitrary atoms may be
reflected through the plane. Examine resulting bonding geometry carefully,
since the program pays no attention to the geometrical arrangement during this
operation.
Additional Information:
• DEFINE PLANE to calculate the equation of the plane.
L ij ≤ D ij ≤ U ij [EQ 2]
Lij is the lower bound for the distance (Dij) between atoms i and j, and Uij is the
upper bound. A delta of 0.05 is added to or subtracted from the value obtained
from the parameter tables to derive the upper and lower bounds respectively.
The specified torsion angles are scanned, through a full 360°, for positions
which relieve bad steric interactions. Only one bond at a time is altered. After a
position is found, that bond is removed from the set being considered. Scanning
continues until all interactions dependent upon these bonds are relieved or until
no progress is made from one iteration to the next. Interrupt the process by
typing <control> C.
Additional Information:
• TAILOR SET SCAN to alter characteristics of the scan.
• The following BIOPOLYMER subcommands: ADD_SIDECHAIN, CHANGE,
INSERT, JOIN, SET CONFORMATION.
Assuming the starting set of molecules are different conformations of the same
structure, this command creates another conformation (of the same molecule)
where:
• Coordinates of every atom are the average of x, y, z coordinates of
corresponding atoms in selected molecules.
• All other aspects of the molecule (bonds, substructures, features, etc.)
are taken (arbitrarily) from one of the selected molecules.
If the selected molecules do not all contain the same number of atoms, an error
message is displayed and no molecule is created. Make sure all other aspects of
the molecules are consistent (bonds, substructures, etc.).
Note:
Additional Information:
• VISUALIZE, in the Graphics Manual, to display some of the defined
features.
• List Information on One or More SYBYL Objects on page 22.
3.4.2 Centroid
A centroid is a dummy atom at the center of a group of atoms. When defined, it
is added to the coordinate list and a feature in the molecular description. The
dummy atom is connected through a dummy bond to the atom used in the calcu-
lation closest to its position. (See TAILOR SET CENTROID to alter the charac-
teristics of the centroid.)
3.4.4 Line
A line adds a dummy atom at a specified distance along the path between two
atoms.
3.4.5 Normal
A normal is a line normal to a plane through an atom. It is represented by two
dummy atoms on either side of a specified atom. Two dummy bonds connect
them to the midpoint. The bonds are perpendicular to the specified plane.
Distance from the midpoint to the dummy atoms is a variable set at 1 Å by
default. The normal is stored as two dummy atoms, two dummy bonds and a
feature in the molecular description. (See TAILOR SET NORMAL to alter the
characteristics of the normal.)
3.4.6 Plane
A plane is represented by four dummy atoms connected by four dummy bonds
delineating a parallelogram. (See TAILOR SET PLANE to alter the character-
istics of the plane.) The plane is stored as four dummy atoms, four dummy
bonds and a feature in the molecular description.
For details on defining specific features and constraints, see the “Specifying 3D
Queries” chapter, in the SLN Manual.
SYBYL Objects
This chapter defines some of the basic vocabulary and syntax you need when
working with SYBYL.
• Glossary on page 80
• Identify Atoms, Bonds, etc. on page 84
• General Naming Conventions on page 84
• Atom(s) Specification on page 85
• Bond(s) Specification on page 86
• Substructure(s) Specification on page 87
• Set(s) Specification on page 88
• Molecule(s) Specification on page 88
• Molecule Area(s) Specification on page 88
• Monomer Sequence(s) Specification on page 89
• Conformational Specifications on page 90
• Create Expressions on page 92
• Logical Operators on page 92
• Parentheses and Grouping of Operations on page 94
• Sets of Objects on page 95
• Global Sets on page 96
• Local Sets on page 99
• Dynamic Sets on page 100
• Built-in Sets on page 101
• Static Sets on page 106
4.1 Glossary
Atoms—The fundamental building blocks of molecules. You may name them
arbitrarily and specify their type. Parameters for 32 different atom types are
provided in SYBYL. An extended list of 103 atom types is available in the file
$TA_DEMO/metals.tpd. You can easily expand this number by adding new
types of your own. Atoms can exist as bonded entities or singly.
Rings—SYBYL detects the formation and records the presence of rings at all
times in all molecules. Rings have wide-ranging implications for conforma-
tional manipulations as well as modification of internal parameters such as bond
lengths and angles and they play an important role in identifying similarities
among molecules.
The figure below illustrates both internal and external rings. The boxes
delineate substructure (monomer) boundaries in this peptide fragment. The
phenyl group in the phenylalanine monomer is an internal ring since it occurs
completely within the confines of a substructure. The heavy, dark bonds
indicate a ring formed by the cross-linking of the peptide by a disulfide bridge
between two cysteine monomers. It is termed an external ring because it crosses
substructure boundaries.
time. The built-in sets in SYBYL are dynamic sets: Aromatic, H-bonds,
Backbone, Sidechain, Rings, and Bumps.
In the case of non-polymers, the only control you have over the creation and
designation of substructures is in the order you construct the molecules or in
fragments chosen from the standard fragment library. (All fragments in the
fragment library are designated as substructures.) There is no unique assignment
of substructures to molecules. One person might assign them differently from
another. For example, the figure below shows two copies of a single molecule
which have been partitioned differently into substructures. Neither one is neces-
sarily a better choice than the other; they are merely different.
In addition to the conventions listed in the table above, there are also object-
specific protocols. These are discussed in the following sections:
• Atom(s) Specification on page 85
• Bond(s) Specification on page 86
• Substructure(s) Specification on page 87
• Set(s) Specification on page 88
• Molecule(s) Specification on page 88
• Molecule Area(s) Specification on page 88
• Monomer Sequence(s) Specification on page 89
• Conformational Specifications on page 90
See also the Atom Expression dialog to make your selection with the mouse.
See also the Bond Expression dialog to make your selection with the mouse.
The molecule area name (M#) associated with a molecule remains the same
during a session.
is equivalent to:
arg=arg=val=phe=ser=cys=ser=cys=ser=cys
Note that monomer names, as defined in the dictionary, may not contain digits,
and each monomer in a molecule should contain a sequence number in its name
(usually indicating its position in the chain).
A number refers to the monomer with that sequence number as part of its name
(e.g. 8 in GLY8). As a special case, to identify the monomer by its substructure
ID number, precede the ID number with a hash or pound sign (#). This is partic-
ularly useful when dealing with unresolved ends of protein chains.
Examples:
alpha_helix
alph
phi=-58.0,psi=-47.0
staggered,beta=120
When SYBYL prompts you for an object (atom, bond, etc.), you can use any of
the methods described in the previous section, which yields exactly one object
when interpreted. When you are prompted for an object expression, you may
enter a group of objects. Object expressions allow you to combine various
objects to produce a resultant set, which is the exact portion of the molecule that
you want to manipulate.
The Venn diagrams below illustrate the logical operators. Shaded areas
represent the selected set D which results from the indicated operations. The
outer circle represents the total set from which the subsets are chosen.
Examples
Union
Locate all carbons and oxygens in a molecule and color them green.
COLOR ATOM <C>+<O> GREEN
Color all alpha carbons, other carbons, nitrogens, and oxygens red (these atoms
make up a peptide backbone).
COLOR ATOM CA,C,N,O RED
Intersection
Identify atoms in the active site of an enzyme and also in a helical secondary
structure:
LIST ATOMS {HELIX*}&{ACTIVE_SITE} BRIEF
This presumes that the sets {HELIX} and {ACTIVE_SITE} have been defined
previously.
Locate all carbons which have a partial charge between 0.0 and 0.10 in a
particular molecule and color them red.
COLOR ATOM <C>&{CHARGE(0.0,0.1)} RED
Difference
Color all atoms not in the backbone of a biopolymer yellow:
COLOR ATOM *-{BACKBONE} YELLOW
The asterisk selects all atoms and then the backbone atoms are subtracted.
{BACKBONE} is a built-in set defined for all polymers.
Negation
Select sidechain atoms for a biopolymer, exclude backbone atoms:
COLOR ATOM ~{BACKBONE} YELLOW
Examples
Locate all carbons and oxygens which are in hydrophobic residues of a protein:
DISPLAY (<C>+<O>)&{HYDROPHOBIC}
When you reference a set name in the context of a command, the members of
the defined set are automatically identified as the object of the action. If the
request is for atoms or bonds, the specified substructures are expanded to their
respective atom or bond constituents automatically.
The diagram below shows sets used in SYBYL and their interrelationships.
Global sets which are built into the program are typically defined in the
MACROMOL dictionary. (See Global Sets in the MACROMOL Dictionary on
page 97.) When the dictionary is opened, sets are available for use automati-
cally.
• Define a Global Set on page 96
• Modify a Global Set on page 97
• Remove a Global Set on page 97
• Global Sets in the MACROMOL Dictionary on page 97
Note: If a global definition is deleted, its copies associated with the molecules
in the work areas are also deleted. If the local (molecule-associated) copy is
modified, it is no longer considered related to the global definition from which
it was derived. In that case, deletion of the global set of the same name does not
affect the local copy.
Although you can define a new global set as it becomes necessary, several
global sets are already associated with the MACROMOL dictionary and
automatically become available for use when the dictionary is opened.
The table below provides a complete listing of global sets currently available in
the MACROMOL dictionary, accompanied by objects to which they apply and
a defining expression explaining how the various sets were created.
Note: Aggregates are local sets and their description can be changed using this
command.
When an atom or a bond involved in a static local set is removed from the
molecule, the set membership is automatically updated.
Because objects are not permanently assigned to a set of this type, dynamic sets
are most often used to monitor properties of molecules which are subject to
change, such as conformation, charge, strain energy among many others.
To color the atoms that are members of the dynamic set ACTIVE_SITE:
COLOR ATOM {active-site} MAGENTA
The membership is evaluated at the time of reference according to the definition
rule. Substructures belonging to the set are expanded into their constituent
atoms for the execution of the command.
Color H-Bonds
Dynamic H-bonds, since they are created via AUTOMONITOR, are treated as
monitor lines. The color of the line changes with the distance between the
atoms: red if less than a minimum value; yellow if within a certain range, and
no line is drawn if farther apart than a maximum distance. The minimum and
maximum values are specified by the tailor variables HBONDS MIN_DISTANCE
and HBONDS MAX_DISTANCE, respectively.
Delete H-Bonds
To delete dynamic H-bonds, you must turn automonitoring off by selecting
VIEW >>> AUTOMONITOR >>> AUTOMONITOR >>> ALL_OFF.
Resize H-Bonds
Dynamic H-bonds (created using VIEW >>> DISPLAY H-BONDS >>>
DYNAMIC (or the AUTOMONITOR HBOND command) cannot be resized.
The table below contains a complete listing of built-in sets currently available in
SYBYL, the forms in which they are invoked (commands and arguments
required, if any), explanations, and examples.
AROMATIC {AROMATIC(atom_expression)}
• Atoms Atoms in same aromatic system as specified atom(s).
LIST ATOM {AROMATIC(9)} BRIEF
BACKBONE {BACKBONE}
• Atom Set Atoms belonging to the backbone as defined in the MAC-
ROMOL dictionary.
COLOR ATOM {BACKBONE} RED
• Bonds {BACKBONE}
Bonds belonging to the backbone as defined in the MAC-
ROMOL dictionary.
SCAN {BACKBONE}
BUMPS {BUMPS(atom1,atom2)}
• Atoms Atoms in one group having van der Waals contacts with
atoms of the other group. van der Waals parameters stored
in the file $TA_ASCTABLES/ATOM_DEF are used.
Use TAILOR SET GENERAL
BUMPS_CONTACT_DISTANCE to define the cutoff dis-
tance. Negative values allow overlap of van der Waals
spheres, positive values prohibit it. Default is -0.16 Å.
COLOR ATOM {BUMPS(atom1,atom2)} MAGENTA
CHARGE {CHARGE(minimum,maximum)}
• Atoms Atoms having a residual charge in specified range.
COLOR ATOM {CHARGE(-.05,-.01)} BLUE
CHIRAL {CHIRAL(atom_expression,RS)}
• Atoms Atoms of specified chirality or pro-chirality. Specify
atoms to search as an expression. Chirality is indicated by
second argument as: R, S, RS, PRO_R, PRO_S, or PRO_RS.
If RS or PRO_RS is entered, all centers are included in the
set. Default is to search all atoms (*) for all chiral centers
(RS).
COLOR ATOM {CHIRAL(CA,S)} YELLOW
FINDCONF {FINDCONF(state1+state2+,sequence)}
• Substructures Monomers having specified conformational state(s) as
defined in the MACROMOL dictionary. Entering a
sequence limits search to specified regions of the biopoly-
mer (“*” searches whole biopolymer). Separate conforma-
tional states by plus signs.
LABEL SUBSTRUCTURE {FINDCONF(ALPHA_HELIX,*)}
HBOND {HBOND(atom_expression,type)}
• Atoms Atoms of specified type participating in hydrogen bonds.
Valid types include: ALL, DONOR, ACCEPTOR, or HYDRO-
GEN. Definitions for donor and acceptor atoms are in the
parameter table $TA_ASCTABLES/ATOM_DEF as
H_ACCEPTOR and H_DONOR fields.
LIST ATOM {HBOND(1+2+3+4+5+6,donor)} BRIEF
MONPROP {MONPROP(keyword,minimum,maximum)}
• Substructures Monomers having specified property as identified by a
keyword and (optional) minimum and maximum values.
Enter only the keyword to select all monomers having that
keyword. Enter keyword and minimum to select mono-
mers with the keyword whose value matches the mini-
mum. The keyword may be any arbitrary string. Values
may be real, integer, or string. Properties are stored in the
MACROMOL dictionary (molecular weight is stored as
MOL_WT).
LABEL SUBSTRUCTURE {MONPROP(MOL_WT,150,200)}
MONTYPE {MONTYPE(type1,type2,...)}
• Substructures Monomers of specified type(s). Types are defined in the
MACROMOL dictionary. As many types as desired may
be specified as arguments. An asterisk (*) specifies all
substructures that are monomers.
LABEL SUBSTRUCTURE {MONTYPE(A,T)}
POSSIBLE_HBON {POSSIBLE_HBOND(atom_expression,type)
D Atoms of specified type which can potentially participate
• Atoms in hydrogen bonds. Valid types include:
• ALL
• DONOR—Potential H bond donor atom, attached to a
hydrogen or has at least one free valence.
• ACCEPTOR—Potential H bond acceptor.
• HYDROGEN—Hydrogen attached to an H bond donor.
LIST ATOM {POSSIBLE_HBOND(*,all)} BRIEF
RINGS {RINGS(atom_expression,type)}
• Atoms Specified atoms which are included in rings of specified
type. Types include:
• I—Internal rings (completely contained within a
substructure).
• E—External rings (crossing substructure boundaries).
• EI (IE)—Either internal or external.
If no arguments are entered, defaults to {rings(*,EI)}.
COLOR ATOM {RINGS(*,E)} BLUE
• Bonds {RINGS(bond_expression,type)}
Specified bonds which are included in rings of specified
type. Types include:
• I—Internal rings (completely contained within a
substructure)
• E—External rings (crossing substructure boundaries)
• EI (IE)—Either internal or external.
If no arguments are entered, defaults to {rings(*,EI)}.
COLOR BOND {RINGS(*,I)} RED
• Substructures {RINGS(substructure_expression,type)}
Substructures in the expression, which are included in
rings of specified type.
COLOR SUBSTRUCTURE {RINGS(*,E)} YELLOW
SEQUENCE {SEQUENCE(sequence1,sequence2,)}
• Atoms Atoms in monomers of specified sequence(s). Monomers
are defined in the MACROMOL dictionary. Read how to
select specific monomer sequences.
COLOR ATOM {SEQUENCE(GLY=PRO,GLY=GLY)} BLUE
COLOR ATOM {SEQUENCE(A/1:25)} RED
COLOR ATOM {SEQUENCE(<,>)} MAGENTA
• Bonds {SEQUENCE(sequence1,sequence2,)}
Bonds in monomers of specified sequence(s). Monomers
are defined in the MACROMOL dictionary.
SCAN {SEQUENCE(GLY=PRO)}
• Substructures {SEQUENCE(sequence1,sequence2,)}
Monomers in specified sequence(s). Monomers are
defined in the MACROMOL dictionary.
LABEL SUBSTRUCTURE {SEQUENCE(A=T=C,T=*=U)}
SIDECHAIN {SIDECHAIN}
• Atoms Atoms belonging to sidechains as defined in the MACRO-
MOL dictionary.
COLOR ATOM {SIDECHAIN} RED
• Bonds {SIDECHAIN}
Bonds belonging to sidechains as defined in the MACRO-
MOL dictionary.
SCAN {SIDECHAIN}
SPHERE {SPHERE(atom_expression,radius)}
• Atoms Atoms falling within sphere(s) of specified radius. The
expression defines the sphere center(s). When multiple
atoms are selected, the final set is the union of sets of
atoms within spheres of indicated radius about each cen-
ter. All spheres have same radius.
COLOR ATOM {SPHERE(ALA23.CA,10)} MAGENTA
• Bonds {SPHERE(atom_expression,radius)}
Bonds falling within sphere(s) of specified radius. The
expression defines the sphere center(s). When multiple
atoms are selected, the final set is the union of sets of
bonds within spheres of indicated radius about each cen-
ter. Note: Only bonds with both endpoint atoms in the
sphere are accepted. All spheres have same radius.
SCAN {SPHERE(N15,8)}
• Substructures {SPHERE(atom_expression,radius)}
Substructures falling within sphere(s) of specified radius.
The expression defines the sphere center(s). When multi-
ple atoms are selected, the final set is the union of sets of
substructures within spheres of indicated radius about each
center. Note: Substructure is accepted, even if only one of
its atoms falls within the sphere. All spheres have same
radius.
LABEL SUBSTRUCTURE {SPHERE(G16,12)}
SUBST_SPHERE {SUBST_SPHERE(atom_expression,radius)}
Atoms, bonds, or substructures falling within sphere(s) of
specified radius. The expression defines the sphere cen-
ter(s). When multiple atoms are selected, the final
SUBST_SPHERE set is the union of sets of substructures
included in spheres of indicated radius about each center.
Note: Substructure is accepted, even if only one of its
atoms falls within the sphere (identical to sphere for sub-
structures).
TO_ATOMS {TO_ATOMS(atom_expression)}
• Bonds Bonds with one or both atoms in specified expression.
SCAN {TO_ATOMS(CA)} CYAN
The latitude with which you can define members of a static set provides great
flexibility in the manipulation of molecular data. For example, static sets can
define:
• active site portion of an enzyme (select atoms or monomers involved)
• diene and dienophile portions of a molecule designed to undergo an
intramolecular cyclo-addition reaction
• glycone and aglycone portions of a nucleoside
• acyclic precursor region of what becomes part of a larger structure upon
cyclization
Color H-Bonds
The color of static H-bonds is specified at the time of creation. To recolor them:
¾ Select View >>> Backgrounds >>> Color.
Delete H-Bonds
Static H-bonds are treated as backgrounds and can be deleted in the same way
as any background.
Resize H-Bonds
Static H-bonds (created using VIEW >>> Display H-Bonds >>> Static or the
HBONDS command) can be resized:
¾ Select Graphics.
¾ Press Apply button and the bonds are resized (press Close).
Selection of Atoms/Bonds/Sequences
Option
Menu
The option menu in the upper left corner of the dialog can be thought of as the
selection mode. It indicates the type of object on which to base the expression
formula. For example, if Atoms is chosen, clicking on an atom highlights just
that atom and the formula will contain the atom ID. However, if Monomers is
chosen, clicking on an atom highlights the entire monomer to which the selected
atom belongs and the formula will contain the residue number.
Unlike the Atom and Bond Expression dialogs, the Sequence Expression dialog
only has one option available in the upper left menu: Monomer.
Manage Selections
Select Objects
1—Single ar—Aromatic
2—Double du—Dummy
3—Triple un—Unknown (cannot determine from parameter tables)
am—Amide nc—Not connected
Additional Information:
• Monomer Sequence(s) Specification on page 89
• Set(s) Specification on page 88
• Global Sets in the MACROMOL Dictionary
Setup
¾ If there are more than one molecule on the screen, click All, then OK
in the Molecule Expression dialog.
¾ View >>> Delete All Backgrounds
Atom
Option
Button
The option button in the upper left corner is set to Atoms, by default. This
button enables you to select various portions of a molecule (atoms or
monomers).
¾ In the Atom Expression dialog, press All to select all atoms in the
current work area.
The Selected Atoms information box at the top of the Atom Expression
dialog shows that 49 atoms are currently selected.
¾ Press Clear.
The 49 atoms are no longer selected (the Selected Atoms information box
now reports 0).
¾ Press Clear.
For the Atom Expression field to be useful, you must first understand what
atom ID numbers are and how SYBYL defines them.
¾ In the D1 row, press the Atom Labels column (it currently says
None).
¾ Select Id.
Atom ID numbers are unique numbers that SYBYL uses to identify atoms.
All atoms are labeled with an ID #.
¾ Press Clear and press All to select all atoms in the molecule.
An asterisk (*) appears between the parentheses, which represents all atoms
in the molecule.
You can also type atom ID numbers, types, Booleans, and defined sets directly
into the field.
¾ Replace * with 1+2+3 and press Apply.
Nothing happens until you press Apply. Three atoms are highlighted, since
you asked for atoms that had the SYBYL ID # 1, 2, and 3. The ‘+’ signifies
the “AND” Boolean.
¾ Press Clear.
So far, you have been able to add atoms to your selection because the Union
operator is selected in the Atom Expression dialog.
Use the Difference operator, a subtraction Boolean, to keep N.am atoms and
turn off N.2 atom.
¾ In the Atom Types dialog, select N.2 in the list (press OK).
The N.2 atom is no longer selected, since the Difference operator was used
to remove all N.2 atoms.
¾ Press Clear.
SYBYL also has a wide variety of unique “Sets” available. Sets can be used to
highlight unique characteristics of a molecule. Various pre-defined sets exist,
including: Aromatic, H-bonds, Backbone, Sidechain, Rings, and Bumps.
A built-in set is a rule based set, such as {AROMATIC}. For any molecule,
SYBYL identifies the atoms belonging to this set when they are needed.
You are also able to select chirality sets and specify a sphere to select the
molecules that are within the radius of the sphere.
¾ Press Sets.
¾ In the Sets dialog, turn on the Rings check box (press OK).
Four rings are identified and the Atom Expression field has a defined
argument to locate all atoms within a ring.
Select by Substructure
Load the crambin protein into SYBYL and color some of the substructures.
Note: There are many ways to display the Atom Expression dialog. Coloring
atoms is simply the one chosen for this example.
¾ In the textport window, type take $TA_DEMO/tut_big.spl.
¾ Press OK.
All atoms that are part of the two substructures are highlighted.
Select by Monomer
¾ In the Atom Expression dialog, change the option menu from Atoms to
Monomers.
An additional field is displayed in the Atom Expression dialog, just above
the molecule list.
Residue selection can also be done via the Monomer Types button.
¾ In the Sets dialog, select HELIX_H1 and HELIX_H2, and press OK.
¾ In the Sets dialog, click on the Backbone check box and press OK.
Since you used the Intersection operator, only atoms that are part of the
protein’s backbone and in one of the two helices are highlighted.
¾ Press Sets.
4. When finished, reset the atom colors and Atom Expression dialog:
Conclusion
At this point, you should have a good idea how to select atoms, sets of atoms,
and substructures. To understand how this can benefit your work, try the
following exercise.
Setup
¾ If there are more than one molecule on the screen, click All, then OK
in the Molecule Expression dialog.
¾ View >>> Delete All Backgrounds
¾ In the Bond Selection dialog, select two atoms that are connected to a
bond.
The entire bond is highlighted green and the bond number appears in the
Bond Expression field.
¾ Press Clear.
¾ Press Clear.
All possible bonds in the molecule are listed. Highlighting one or more of the
bond types and pressing OK selects all bonds in the molecule of the selected
type(s).
¾ In the Bond Types dialog, press Cancel.
Select by Substructure
The Substructures dialog enables you to select all bonds within selected
substructures.
¾ Press Cancel.
You can also obtain a list of bonds by selecting atoms or monomers in the
option menu (upper left corner of the Bond Expression dialog).
¾ Click on any three atoms in the crambin protein and press OK.
All bonds directly connected to atoms you selected are colored magenta.
Because atom coloring can only be done by also coloring the lines between
atoms, bond themselves are usually not colored. It is a good idea, at this point,
to reset the bond coloring you have just done.
Conclusion:
At this point, you should have a good idea how to select bonds, sets of bonds,
and substructures.
With the crambin structure still on the screen, display the Sequence Expression
dialog. (If crambin is not on the screen, type: take $TA_DEMO/
tut_big.spl.)
Unlike the Atom and Bond Expression dialogs, this dialog only has one option
available in the upper left menu: Monomer.
¾ Use the scroll bar below the field showing the chain residue list to
move to ILE33.
¾ Highlight ILE33 through ALA38 in the chain list by clicking on ILE33
and dragging the cursor through the other residue names and into
ALA38. (Note: As long as any portion of the residue’s name in the list
is highlighted, it is considered to be selected.)
This field lists the monomer sequence in the selected molecule area. It is empty
if the molecule has no monomer sequence.
¾ Press Clear.
¾ While still in the Sequence Expression dialog, press Sets to display the
Sets dialog.
¾ Press OK.
Note the equation in the Sequence Expression field now says:
M2({HYDROPHOBIC}).
¾ Press Clear.
Select by Monomer
The Residue Types dialog lists all possible monomer types available in the
dictionary. Highlighting one or more of the types and pressing OK selects all
residues in the molecule of the specified type(s).
¾ In the Residue Types dialog, select ALA in the list and press OK.
All of the alanine residues are highlighted.
Conclusion
At this point, you should have a good idea how to select protein substructures
by chain, by set, and by monomer. This concludes the Selection Tutorial.
Molecule Databases
Database Formats
Because MOL2 databases are composed of only ASCII files, they are more
portable across different machine platforms than binary databases (e.g., MOL2
databases are portable across SGI and IBM workstation platforms, whereas
binary databases are not).
MOL2 databases are less susceptible to corruption than binary databases and are
more recoverable in case of corruption, since molecules can be held in separate
files. However, manipulating files within a MOL2 database via the system shell
while the database is open can generate error messages. Closing the database
(and any tables using the database) and reopening it will usually eliminate such
errors.
Users can create a MOL2 database using the DATABASE CREATE and
DATABASE XCREATE commands, or from the system shell using existing MOL2
files created by other parts of SYBYL. For an example of this, see the Database
Tutorial on page 132. Also see the TAILOR SET DATABASE command for
information about the MULTIMOL2 variable and how it affects MOL2 databases
created from the system shell.
Database Qualifiers
SYBYL first checks the names of open databases (which can be seen using the
ALLOPEN command). If the database qualifier matches a name of an open
database, that database is selected for the operation. If no open database name
matches, then the SYBYL checks the alias of any open database. If there is a
match, that database is selected for the operation. If no aliases of open databases
match, then the qualifier is considered to be invalid. (The same process applies
to the open database arguments of the ALIAS, DEFAULT, XADD, XCLOSE, and
XSTATUS subcommands.)
Note: Double quotes must be used when spaces occur in a database qualifier.
Also, if the special character “!” occurs in a molecule name, the molecule name
should be enclosed in double quotes.
Examples:
Retrieve tryptophan from the default database and place it in M1. (No database
qualifier is needed.)
DATABASE GET (tryptophan) m1
Retrieve all molecules whose names begin with t (or T) from the database /usr/
me/mydb.mdb. For multiple matches, select one.
DATABASE GET /usr/me/mydb.mdb!(t*) m1
Retrieve the molecule named botulin from the database whose alias is toxins
and place it in M3. (A database such as /usr/me/project_xyz/toxins.mdb is
automatically assigned the alias toxins when opened.)
DATABASE GET toxins!botulin m3
See the OPEN and ALIAS subcommands for additional information about
database aliases.
Since system shell commands like rm and cp behave differently when operating
on regular files versus directories (without additional flags), the following
system shell scripts are provided in $TA_BIN.
• db_rm removes molecular databases of either format:
db_rm db1 ... dbN
Databases called LQSample and nci_2000 are distributed with SYBYL. These
databases are located in $TA_ROOT/data/tdb_databases and are used in
numerous tutorials. They have the Tripos database format. (See the UNITY
Manual for a description of Tripos Databases.)
• LQSample—41393 registered compounds. These compounds are a small
sample from an older version of the LeadQuest library and are no longer
sold. The LeadQuest library contains drug-like compounds with prede-
termined characteristics, guaranteed purity and identity, and of known
synthetic feasibility. To see the current LeadQuest library:
http://leadquest.tripos.com
• nci_2000—250251 registered compounds. These compounds were
screened by the National Cancer Institute in 2000. They are not commer-
cially available compounds, but the CAS number is included with the
structure. For a newer listing see:
http://dtp.nci.nih.gov/docs/3d_database/structural_information/
structural_data.html
For purposes of clarity, a small database containing amino acids has been
created and is used in this tutorial.
Set Up
¾ If there are more than one molecule on the screen, click All, then OK
in the Molecule Expression dialog.
¾ View >>> Delete All Backgrounds
2. Copy a file from the demo account to the working directory. Be careful to
include the space and period at the end of the line.
The names of the molecules that make up the aa database are listed in the
textport.
The MODE command is used for complex commands which have many options.
It allows you to establish the upper level command and only enter options until
you exit this mode, thus the outer command is not repeatedly entered. When
you are in this mode, any command not available as an option can be invoked
by preceding it with the word COMMAND.
2. Partition the polar amino acids, according to their charge, into sets named basic,
acidic, and polar_neutral. Include a comment string describing the set.
Database sets are user-defined groups of molecules which have some shared
property (or properties). These properties are distinguished from the ones which
Tripos defines (molecule types,…) and database classes, which appear below.
The group membership of database sets is static.
The definitions of all sets in the current database are shown in the textport.
There is no limit on the number of sets.
The definition and contents of all classes in the database are displayed. Notice
how HYDROPHOBIC contains everything that is not HYDROPHILIC (or more
precisely, “not in the property group hydrophilic”).
Give the molecule its proper name and add it to the database. Molecule names
may be any arbitrary string.
¾ Build/Edit >>> Name Molecule
Sets may be redefined at any time, even in terms of their own current contents,
so that it is easy to add a new member.
The next few sections will each present a different way of accessing database
molecules. Recall that the simplest way was to retrieve by molecule name. If
you do not know the name, either at all or exactly, you can use a wildcard to
search the database (by name) for any matching string. The wildcard (*) alone
selects all of the molecules.
This command takes either the molecule’s full or partial name (with wildcards).
The DATABASE command can use property group definitions as a basis for
generating selection menus. The Standard Fragment Library is organized using
just this feature.
Menu items are selected by number. You may move down a level, back up a
level, or go to the top.
¾ Enter 1.
Basic
1. ARGININE
2. HISTIDINE
3. LYSINE
¾ Enter 2.
3. Use an expression to retrieve glutamic acid after first restricting your search to
molecules that are both hydrophilic and acidic.
¾ Type: ENDMODE
Continue reading
SYBYL attempts to assign an alias to the newly opened database using the base
name of the full database name. (For example, if the full database name is /usr/
me/mydb.mdb, SYBYL attempts to assign it the alias “mydb”.) This makes
using database qualifiers easier. See the ALIAS subcommand for more infor-
mation about database aliases.
• Open Database via the Menubar on page 139
• Open Database via Command Line on page 140
• Open a New, Empty Database on page 140
• Copy Database Contents to New Database on page 141
• Define Alias for Database via Command Line on page 141
• Specify a Default Database on page 141
• Close Database via the Command Line on page 141
Notes:
• The database that is opened becomes the default user database.
• If the database is already open, the access mode of the open database is
changed to the newly specified mode.
• Any number of users may have the same database open READONLY.
However, if one user has a database open in APPEND or UPDATE
mode, nobody else has any access to it until the database is closed. If
one user has a database open in READONLY mode, nobody else is
allowed to open it in APPEND or UPDATE mode.
• MOL2 files written via the DATABASE command(s) have at least 6 digits
of precision. If the tailor variable MOL COORD_PLACES is set to < 6
(such as the default of 4), it is set to 6 during the operation of the
DATABASE command and reset when complete. If the value is > 6, the
tailor’s value is used throughout.
To unlock a database that was not properly closed because of a system crash,
enter the following in a textport:
$TA_BIN/<your_platform>/dbunlock
UIMS2 Variable:
• DATABASE OPEN assigns a value to the UIMS2 variable
DATABASE_NAME.
Additional Information:
• TAILOR SET DATABASE to alter characteristics of the database opening.
The database that is created becomes the default user database. It is automati-
cally opened in UPDATE mode; there is no need to open the database after
creation. If a file already exists with the given file name, you have a choice of
replacing the old one or issuing the command again to give another file name.
Replacing the old file creates a new file with that same name and deletes the
contents of the old file.
UIMS2 variable:
• The DATABASE CREATE and DATABASE XCREATE commands assign a
value to the UIMS2 variable DATABASE_NAME.
A database can only have one alias. If the specified database already has an
alias, the old alias is overwritten. A user assigned alias is lost when a user
database is closed.
Aliases are useful in conjunction with database qualifiers. An alias can be used
in a qualifier instead of the full database name, thus decreasing typing effort.
UIMS2 Variable:
• DATABASE DEFAULT assigns a value to the UIMS2 variable
DATABASE_NAME.
If the default database is closed and other databases are open, one is arbitrarily
selected as the new default user database.
Full database names are listed along with aliases given in parentheses. The
default user database is denoted.
The Venn diagrams below illustrate the logical operators. A, B, and C are
general object sets. Shaded areas represent the selected set D which results from
the indicated operations. The outer circle represents the total set from which the
subsets are chosen.
In database query expressions, operators are evaluated from left to right, with
operations of highest precedence evaluated first. The order of operator prece-
dence is (from highest to lowest):
Negation ~ highest
Intersection &
Union, Difference +– lowest
Examples:
Retrieve tryptophan from current database and place it in M1.
DATABASE GET (tryptophan) m1
Retrieve all molecules whose names begin with t (or T) from current database.
For multiple matches, you are asked to select one.
DATABASE GET (t*) m1
Retrieve all molecules whose names begin with h (or H) and are members of the
group substrate from current database. For multiple matches, you are asked to
select one molecules.
DATABASE GET (substrate & h*) m1
Retrieve all molecules whose names begin with 1,4,5 T and are members of the
group reaction1 or reaction2 (or both). Use parentheses to ensure the union
operation takes place before the intersection.
DATABASE GET (“1,4,5 T*” & (reaction1 + reaction2))
Double quotes around the (partial) molecule name are required since it contains
special characters.
Additional Information:
• A Note on Query Expressions on page 144.
Additional Information:
• The Molecular Spreadsheet Manual for a complete description.
Deletion of groups from a database has no effect on the molecules which were
inside those groups. Molecules themselves must be explicitly deleted.
The database must be open in UPDATE mode for this command to operate
successfully.
Grouping Mechanisms
Note:
• Database must be open in UPDATE mode for this command to operate
successfully, or APPEND mode is sufficient if the molecule group does
not already exist in the database.
• Each time a defined class’ name appears in a database query expression,
it is reevaluated and its members determined for the database.
• If a molecule, defined as a member of a set, is deleted, that molecule is
automatically removed from the set.
Additional Information:
• The Database Tutorial on page 132 for an example.
Notes:
• If new_name for a molecule already exists in the database, you are asked
whether or not the new molecule should replace the current one.
• If new_name for a group already exists in the database, the operation
fails.
• Database must be open in UPDATE mode for this command to operate
successfully.
• Both names may contain a database qualifier. However, the operation
fails if the qualifiers do not refer to the same database.
mol2dbms:
• MOL2 files containing more than one molecule are broken into multiple
MOL2 files, one molecule per MOL2 file. This “flattens” the database.
• MOL2 files are renamed so file name matches (or nearly matches) name
of molecule in file.
• Reorganization can decrease access time, but has little effect on database
size, since MOL2 databases rarely accrue unused space.
• Reorganizing a MOL2 database is useful only when the database was
created by the user directly from the system shell. This is because MOL2
databases, created and accessed only via the DATABASE command, have
neither multi-mol2 files nor misnamed MOL2 files.
• Whenever SYBYL writes MOL2 files via the DATABASE command(s),
they are written with at least 6 digits of precision. If the value of the
tailor variable MOL COORD_PLACES is less than 6 (such as the default of
4), it is set to 6 during the operation of the DATABASE command and
reset when complete. If, however, the values higher than 6, the tailor’s
value is used throughout.
• Warning: The DATABASE REORGANIZE command creates a new MOL2
database with a temporary name. This name is generated in the directory
set by the environment variable TMPDIR. If the variable is not set, the
new database is created in the working directory. If this variable is
defined in your environment, that is where the new database ends up,
and hence appears to be lost.
mdbms:
• A consistency check ensures the database contents match the index
structure. This is the only accepted method of recovery for a corrupted
database (as indicated by the error message RECORD_KEY_ERROR).
• Database must not be open by any user when the REORGANIZE command
is given. The reorganized database overwrites the old file.
• Highly active databases should be periodically reorganized to recover
unused space. Compressing the files can have a significant impact on
access time.
License Requirement:
Requires a QSAR license.
All molecule areas are cleared, including the one containing the core structure.
Upon completion, all occupied molecule areas are written into the designated
new database in REPLACE mode. If new database does not yet exist, it is
created. For multiple occurrences of core in the template or in the molecules
being aligned, you are prompted to select the appropriate substructure.
Be aware that the core is only used to determine the connectivity of the
substructures to be matched. Conformational information (e.g., chair vs. boat) is
not utilized.
Additional Information:
• Database Align dialog in the QSAR Manual.
The MOL MULT_OUT command (with the variable defined by TAILOR SET MOL
FILE_FORMAT set to MOL) has the same purpose.
Note: MOL2 files written via the DATABASE command(s) have at least 6 digits
of precision. If the tailor variable MOL COORD_PLACES is set to < 6 (such as the
default of 4), it is set to 6 during the operation of the DATABASE command and
reset when complete. If the value is > 6, the tailor’s value is used throughout.
Once connections to external databases and queries have been defined, use the
RDBMS Search menubar option to execute queries against external databases,
creating a MSS from the results. Additional columns can then be added to this
MSS using AutoFill and then selecting the RDBMS column type. You may
make use of a suite of SPL expression generators named %RDBMS_*() to build
custom scripts for accessing external databases using the information defined
via the RDBMS command.
Note: You may have multiple RDBMS instances, but only one active query per
instance.
Environment Variables:
The following environment variables affect the behavior of RDBMS and must
be set prior to starting SYBYL.
• TA_RDBMS_READ_TIMEOUT controls how long SYBYL waits for a
RDBMS query to respond. Complex queries or queries to remote
databases can take long to report back to SYBYL. Increasing this
variable forces SYBYL to wait longer. The time-out value is specified in
milliseconds and is an integer value. Default is 500 milliseconds.
• TA_RDBMS_REFS indicates location of rdbms_ref.col (defines connec-
tions).
• TA_RDBMS_QUERIES indicates location of rdbms_queries.col (defines
queries).
Additional Information:
• TAILOR SET TABLE to alter the treatment of nulls in RDBMS vector
and integer arrays.
• The UNITY Manual.
Each time this command is invoked, a new RDBMS query is added to your
SYBYL session. Each query is referenced by a unique alias name. Use of an
already existing alias name results in the previous query being replaced by the
new definition.
Example:
RDBMS QUERY DEFINE row_names oracle_sgi
select distinct(name) from sample_data
.
STRUCTURE_DB /usr4/krypton/3DB/Databases/sample \
DONE
Queries deleted with this command are removed from SYBYL session only.
Example:
RDBMS QUERY DELETE sigma_sample
Examples:
RDBMS QUERY EVAL row_names output_file regids DO_EVAL
sh cat regids
Examples:
RDBMS QUERY LIST ALL
Alias: bioact_sample
Database: oracle_sgi
Query Type: DB_SPECIFIC
REGID Symbol: REGID
Structure DB: (null)
Query: select bioactivity from sample_data where NAME=:
REGID
Alias: logp_sample
Database: oracle_sgi
Query Type: DB_SPECIFIC
REGID Symbol: REGID
Alias: row_names
Database: oracle_sgi
Query Type: DB_SPECIFIC
REGID Symbol: REGID
Structure DB: /usr4/krypton/3DB/Databases/sample
Query: select distinct(name) from sample_data
All defined queries are saved in a collect file for later use. If file exists, the new
definitions are appended to the file.
Example:
RDBMS REFERENCE DEFINE oracle_sgi oracle oracle polaris \
MACHINE_ACCESS_INFO CURRENT_USERID PROMPT_FOR_PASSWORD \
RDBMS_ACCESS_INFO EXPLICIT_USERID unity PROMPT_FOR_PASSWORD \
RDBMS_REGID_COLUMN NAME \
RDBMS_TABLE sample_data RDBMS_STRUCTURE_DB \
/usr4/krypton/3DB/Databases/sample \
RDBMS_TABLE cas30k_data RDBMS_STRUCTURE_DB \
/usr4/krypton/3DB/Databases/cas30k \
DONE
RDBMS REFERENCE DEFINE oracle_ibm SID oracle eagle \
MACHINE_ACCESS_INFO CURRENT_USERID PROMPT_FOR_PASSWORD \
RDBMS_ACCESS_INFO EXPLICIT_USERID unity PROMPT_FOR_PASSWORD \
RDBMS_REGID NAME \
RDBMS_TABLE sample_data RDBMS_STRUCTURE_DB \
/usr4/krypton/3DB/Databases/sample \
RDBMS_TABLE cas30k_data RDBMS_STRUCTURE_DB \
/usr4/krypton/3DB/Databases/cas30k \
DONE
Example:
RDBMS REFERENCE DELETE oracle_ibm
Example:
RDBMS REFERENCE LIST ALL
All currently defined RDBMS references are written to a collect file for later
use. If specified file exists, RDBMS references are appended to file.
Example:
RDBMS REFERENCE SAVE rdbms_refs.col
sh cat rdbms_refs.col
Expert’s Corner
The following information may be found helpful when working with SYBYL
• Manage Input/Output Formats on page 166
• Generate 3D Coordinates on page 166
• SMILES Strings on page 167
• SD/MACCS Files on page 168
• AMPAC Status Files on page 169
• MOPAC Status Files on page 169
• CSSR files on page 169
• Record and Play SYBYL Sessions on page 171
• Modes for SYBYL on page 176
• Command Modes and Execution Options on page 178
• Execute Operating System Commands from Within SYBYL on page 181
• Markush Atoms on page 182
• Textport Output Presentation on page 183
• Keyboard Shortcuts on page 184
• Control Characters on page 185
• Menubar Shortcuts and Comments on page 186
• Libraries of Chemical Groups and Fragments on page 187
• Edit Text Files with the Text Editor on page 192
References:
[1] R. S. Pearlman, “Rapid Generation of High Quality Approximate 3-
dimension Molecular Structures,” Chem. Des. Auto. News, 2,1, (1987).
[2] CONCORD/StereoPlex Manual, Tripos Inc., St. Louis MO, 1998.
SMILESTOMOL smiles
Reference:
[3] Weininger, D., SMILES, a Chemical Language and Information
Systems. 1. Introduction of Methodology and Encoding Rules, Chem.
Inf. Comput. Sci., 28, 31-36 (1988).
Additional Information:
• CONCORD command description.
• %smiles_to_mol expression generator.
Read/Write SD Files
Multiple molecules in the MOPAC file (.sta extension is implied) are displayed
in successive molecule areas. Any existing molecule in the designated molecule
area (and all subsequent areas used), is overwritten. Connectivity information is
derived from minimum (0.90 Å) and maximum (1.65 Å) distances of atoms.
Atom types are converted to SYBYL atom types. Verify all atom type assign-
ments.
When reading a CSSR (Cambridge Structure Search and Retrieval) file (default
extension .cssr), atom types are converted to SYBYL atom types. Verify all
atom type assignments. Any existing molecule in the designated molecule area
(and all subsequent areas used), is overwritten.
The following characters, when first on one line, have special meaning in a
collect file:
# ignore the line, typically used for comments,
% if current session is interactive ask user for confirmation before
continuing, typically used to pause during playback.
To store actions of menu picks in a collect file, first issue the command MENU
COLLECT ON.
UIMS2 Variable:
• uims2_collect_file—Name of the current collect file.
Additional Information:
• Record Terminal Dialog of Session on page 173
• Read Command Input From Text File on page 175
• Insert a Pause in a Recorded Session File on page 174
UIMS2 Variable:
• uims2_photo_file—Name of the current photo file.
Additional Information:
• Record Output from a Single Command on page 173
• Record Session Commands on page 172
By inserting this command into the recorded session file, you can halt program
execution for a specified number of seconds or indefinitely, if delta_time is set
to zero. In this way, you can give the user ample time to read the comments.
PAUSE is used in preparation of demonstration scripts. Note: These scripts
cannot be played back in menu mode.
Additional Information:
• Record Session Commands to prepare a script.
• Play Recorded Session to play back a prepared script.
Additional Information:
• Record Session Commands on page 172
• Read Command Input From Text File on page 175
TAKE51 filename
This command reads commands from a SYBYL 5.1 collect file. Text in the file
is executed as if it was manually entered through the keyboard. It is no longer
possible to create a SYBYL 5.1 collect file. However, the TAKE51 command
has been provided for reading old style collect files.
Input is read from the specified file (file extension must be included) until an
end-of-file condition is encountered. The text in the file is executed as if it was
manually entered by the keyboard. This is convenient when generating
command procedures without the use of the COLLECT command. (The tutorial
files (.demo) have this format.)
Warning:
TTY does not understand command context as does the TAKE command. Thus
any mistakes in the TTY file are faithfully executed.
The most commonly used mode is menubar and most of the documentation for
SYBYL is written from the perspective of running in this mode.
If auto terminal-typing does not work and sets your terminal type to
NOGRAPHICS or if auto terminal-typing is turned off, the menubar can be
activated by simply typing the command:
MENUBAR
Notes:
• In menubar mode, you may also enter commands without changing
picking modes. To return to the command mode from another mode,
type: set picking no_picking.
Default Values
When entering a command, the computer prompts for the arguments you must
supply. After each prompt, a set of angle brackets occurs containing a word,
numeric value, or phrase. This is the current default which will be accepted if
you simply hit the <return> key.
Special Characters
?—The help character may be entered at any time to obtain information about a
command, legal values of an argument, or format of a specific parameter.
^ (caret)—In the midst of entering arguments for a command, use the abort
character in response to any prompt to terminate the command operation. No
changes are made to the molecule(s).
| (vertical bar)—Use the end loop character to terminate the entry of parameters
being requested in an indefinitely repeating loop. When you have completed all
the required entries, respond to the next prompt with the end loop character.
SYBYL terminates the request loop and moves on to the next parameter.
MODE category
Commands) and photo (Log Session) files and to set the error reporting to
traceback. See Record and Play SYBYL Sessions on page 171 for a description
of the files and their purpose.
BASE and GROUP remain in effect until they are changed or until the end of the
session.
The operation you want to perform must be possible on all molecules, otherwise
an error message is issued for every failed attempt. For example, coloring all
alpha helices will fail on molecules that do not contain that secondary structure.
Examples:
The following examples assume four molecules in M1, M2, M3, M4, respec-
tively, all having backbone, heteroatoms, and hydrogen atoms.
Label all heteroatoms in all molecules on the screen. BASE and GROUP are
assumed to have their default values of M1 and *, respectively.
ALLMOLS LABEL HETERO M1(*)
Color backbones in the molecules in M1, M3, and M4 yellow. BASE is assumed
to have its default value of M1. (The molecule in M2 is unchanged.)
Remove hydrogens from the molecules in M2, M3, and M4. (The molecule in
M1 is unchanged.)
ALLMOLS BASE M2
ALLMOLS GROUP M3,M4
ALLMOLS UNDISPLAY M2(<H>)
TIME command_string
Any legal operating system command whose arguments can be given in a single
input line can be entered. System output from this command is echoed on the
terminal as usual.
Multiple Commands
SYBYL is suspended in its current state, a new process is created, and the
system shell is entered. Any operating system function, normally available
when outside the program, can be performed. Return to the SYBYL program by
typing EXIT.
This differs from the DCL command by making you explicitly aware of the new
environment and allowing any number of commands to be issued to the system
prior to returning control to SYBYL.
This is identical to the UNITY MARKUSH command, except that it does not start
the server and, therefore, does not require a UNITY license.
These single characters can be in upper- or lowercase, and need not be followed
by a <return>.
Key Action
F1 Push textport window back to view entire graphics screen;
press it again to put text on top.
F5 Toggle between:
• Photo display mode—Expand SYBYL graphics area to fill
entire display. Borders, decorations, controls, and textport
are pushed behind graphics area.
• Normal display mode—Show window border, toolbox,
etc.
F6 Toggle display of toolbox icons on and off.
F7 Toggle hardware stereo on and off. F7 is active only if neces-
sary hardware is available and OGLX driver is being used.
F8 Raise/lower SYBYL graphics window. When full screen hard-
ware stereo is active, the menubar is also raised/lowered.
F9 Toggle between Global and D1.
F10 Toggle between Global and D2.
F11 Toggle between Global and D3.
F12 Toggle between Global and D4.
Up arrow Rotate +90° about the X-axis.
Down arrow Rotate –90° about the X-axis.
Left arrow Rotate +90° about the Y-axis.
Right arrow Rotate –90° about the Y-axis.
Control Use in combination with a left mouse pick to select a mole-
cule or background for object transformation.
For convenience, the molecules in the library are given these same names.
Each group has a unique attachment point, internally equal to the root atom of
the root substructure of the molecule. Externally, a convention has been estab-
lished which identifies the attachment point as the first atom in the molecule
name (except for Phenyl). The internal convention must be observed for all
user-added groups in order for commands like ADD GROUP to function properly.
The basic idea is that each molecule in the fragment library is a member of one
(or more) static database sets. These sets form the lowest level of the hierarchy,
and group together those molecules which are very closely related in structure
and/or function. At higher levels of the hierarchy, those molecules related in a
more general sense appear in the same group. This produces a structure which
looks like an inverted tree, with general categories at the top diverging into
more and more specific categories until, at the bottom, a single molecule is left.
The lowest level groups—those which contain the actual molecules—are static
sets, while all higher level categories are dynamic classes, defined as the union
of those groups directly under them. Thus the “56 Systems” are contained in a
static set, such as FIVESIX, but the group which corresponds to “1 Heteroatom”
in the above diagram is a dynamic class defined as the union of FIVESIX and
SIXSIX (FIVESIX+SIXSIX). Similarly, “Heterocyclic 2 Rings” is a dynamic
class defined as the union of “1 Heteroatom”, “2 Heteroatoms” and “>2
Heteroatoms”.
Tripos’ standard fragment library contains about 200 molecules partitioned into
44 static sets, and categorized into a hierarchy comprised of 17 dynamic classes.
Below is the full listing of sets and classes provided by Tripos to organize the
fragment library. In the listing, the dynamic classes are represented in italic
script, all others are static sets.
A: Acyclic Functions
• AA: Carbon Only
• AB: Function N
• AC: Function O
• AD: Function S
• AE: Function NO
• AF: Function SO
• AG: Function PO
• AH: Other
B: Cyclic Functions
• BA: Homocyclic, 1 Ring
BAA: Saturated
BAB: Unsaturated, 1 Double Bond
BAC: Unsaturated, 2 Double Bonds
BAD: Unsaturated, >2 Double Bonds
BAE: Aromatic
• BB: Homocyclic, 2 Rings
BBA: Saturated
BBB: Unsaturated
• BC: Homocyclic, 3 Rings
• BD: Homocyclic, 4 Rings
BDA: Steroids
BDB: Other
• BE: Heterocyclic, 1 Ring
BEA: 1 Heteroatom
• BEAA: 5 Membered Ring, Saturated
• BEAB: 5 Membered Ring, Unsaturated
• BEAC: 6 Membered Ring, Saturated
• BEAD: 6 Membered Ring, Unsaturated
• BEAE: Other
BEB: 2 Heteroatoms
• BEBA: 5 Membered Ring, Saturated
• BEBB: 5 Membered Ring, Unsaturated
• BEBC: 6 Membered Ring, Saturated
• BEBD: 6 Membered Ring, Unsaturated
BEC: >2 Heteroatoms
• BF: Heterocyclic, 2 Rings
BFA: 1 Heteroatom
• BFAA: 56 Systems
• BFAB: 66 Systems
BFB: 2 Heteroatoms
• BFBA: 56 Systems
• BFBB: 66 Systems
BFC: >2 Heteroatoms
• BG: Heterocyclic, 3 Rings
BGA: 1 Heteroatom
• BGAA: 656 Systems
• BGAB: 666 Systems
• BGAC: 676 Systems
BGB: 2 Heteroatoms
• BGBA: 666 Systems
• BGBB: 676 Systems
BGC: >2 Heteroatoms
C: Amino Acids
D: Nucleic Acids
• DA: Bases
• DB: Ribose Monophosphate
E: Biologically Important Molecules
• EA: Vitamins
• EB: Sugars
• EC: Lipids
Save Save any changes made in the text field to the original
file.
Save As Specify a new location and/or file name in the Save
Text File dialog.
Search Search for the specified string entered in the displayed
dialog.
Again Locate the next occurrence of the search string.
Chapter 7. Expert’s Corner
Edit Text Files with the Text Editor
A Attach
chain 48, 56
Abort a command 177
Attributes
Access help 11 all atom
Acidic global se 98 remove 53
all bond
Add
atom 47
remove 55
stereo atom
bond 53
remove 69
chain 48
stereo bond
features and constraints 76
remove 69
group 56
hydrogens 48 Automatic command execution 178
rawatom 48 Average molecule 70
ADD_PSEUDOATOMS 49
Aggregate B
sets 95
Backbone
Alignment built-in set 102
database on a template 152
Basic global set 98
ALLMOLS 179
Bibliography
Alternate atom types 51 chirality assignment 67
Amide bond 86 SHAKE algorithm 69
Amino acid BIOPOLYMER
global sets 98 sets 97
AMPAC Block global set 98
read status file 169 Bond
Angle add 53
between planes 27 angle list 21
measure 27 attributes 67
modify 55 expression rules 86
Aromatic bond 86 length
list 21
Aromatic built-in set 102 naming conventions 86
Atom remove 55
add 47 stereo attributes 67
chirality attribute 67 type
expression rules 85 modify 55
modify 49 Building
naming conventions 85 small molecule 41
rawatom 48
remove 53 101
Built-in sets
Atom expression Bulky global set 98
dialog 112 Bumps
field 115 built-in set 102
Atom types
alternate set 51 C
modify 50
C-alpha carbon global set 98
Atomic coordinate
topography 21 Cambridge
R RS isomerism 67
Rawatom, adding 48
RDBMS 154 S
Read file dialog 16 Saving
in the memory stack 30
Read/save database molecules 153 molecules in a database 148
Recording sketch 40
COLLECT command 172 Scanning
PHOTO command 173 torsions 69
Recover contents of molecule area 30 Scrolling text region 183
References SD file
chirality assignment 67 read/write files 168
SHAKE algorithm 69
Search
Reflect atoms 68 database 147
Remove Sequence
all atom attribute 53 built-in set 104
all bond attribute 55
atom 53 SET
bond 55 PICKING 176
center of mass 72 Sets 95
centroid 72 aggregates 95
extension point 73 built-in 101
global set 97 database 149
line 74 dialog 127
local set 100 dynamic 100
non query atoms 77 for biopolymers 97
U
Undo last action 30
Union
operator
in a database 144
in object expression 92
UNITY
defining features 77
MACCS file conversion to hitlist 168
modify feature 77
remove features and constraints 78
remove non query atoms 77
Unix utilities
dbtranslate
see UNITY manuals
use in MACCS conversion 168
dbunlock 138
User-defined sets 96
Z
ZAP 29
ZE isomerism 67