Professional Documents
Culture Documents
Copyright
Although the information in this document has been carefully reviewed, Platform Computing Corporation (Platform) does not warrant it to be free of errors or omissions. Platform reserves the right to make corrections, updates, revisions or changes to the information in this document. UNLESS OTHERWISE EXPRESSLY STATED BY PLATFORM, THE PROGRAM DESCRIBED IN THIS DOCUMENT IS PROVIDED AS IS AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. IN NO EVENT WILL PLATFORM COMPUTING BE LIABLE TO ANYONE FOR SPECIAL, COLLATERAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING WITHOUT LIMITATION ANY LOST PROFITS, DATA, OR SAVINGS, ARISING OUT OF THE USE OF OR INABILITY TO USE THIS PROGRAM. Wed like to hear from you You can help us make this document better by telling us what you think of the content, organization, and usefulness of the information. If you find an error, or just want to make a suggestion for improving this document, please address your comments to doc@platform.com. Your comments should pertain only to Platform documentation. For product support, contact support@platform.com. Document redistribution and translation Internal redistribution This document is protected by copyright and you may not redistribute or translate it into another language, in part or in whole.
You may only redistribute this document internally within your organization (for example, on an intranet) provided that you continue to check the Platform Web site for updates and update your version of the documentation. You may not make it available to your organization over the Internet. LSF is a registered trademark of Platform Computing Corporation in the United States and in other jurisdictions. ACCELERATING INTELLIGENCE, PLATFORM COMPUTING, PLATFORM SYMPHONY, PLATFORM JOB SCHEDULER, PLATFORM ISF, PLATFORM ENTERPRISE GRID ORCHESTRATOR, PLATFORM EGO, and the PLATFORM and PLATFORM LSF logos are trademarks of Platform Computing Corporation in the United States and in other jurisdictions. UNIX is a registered trademark of The Open Group in the United States and in other jurisdictions. Linux is the registered trademark of Linus Torvalds in the U.S. and other countries. Microsoft is either a registered trademark or a trademark of Microsoft Corporation in the United States and/or other countries. Windows is a registered trademark of Microsoft Corporation in the United States and other countries. Intel, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other products or services mentioned in this document are identified by the trademarks or service marks of their respective owners.
Trademarks
http://www.platform.com/Company/third.part.license.htm
Contents
About this kit .......................................................................................................................... 5 Install a kit .............................................................................................................................. 6 Install a kit ....................................................................................................................... 6 Install kit using CLI .......................................................................................................... 6 Install kit using One Step Install on the Platform HPC Console ...................................... 9 Remove a kit ........................................................................................................................ 10 Remove kit using CLI .................................................................................................... 10 Remove kit using the Platform HPC Console ............................................................... 10 Included components .......................................................................................................... 12 Enable components ...................................................................................................... 13 Related configurations and commands ............................................................................... 14 Related documentation and important links ........................................................................ 15
Important:
This version of the HPC kit for Platform HPC 2.1 is no longer dependent on kernel versions. The BLCR package was dropped as it depends on kernel versions. Linpack, Scalapack, and BLACS packaged with openmpi component only are compiled with and work with OpenMPI, not supporting mpich2 and mvapich2. This kit for Platform HPC 2.1 has a reduced set of packages that builds on RHEL 5.5.
This version of the HPC kit integrates with the OFED kit, Platform LSF kit, and Platform HPC Console kit.
Note:
The OS based HPC kit cannot work with the Platform OFED kit.
Install a kit
Install a kit
Install this kit by using the kusu-kitops or kusu-kit-install on the command-line interface (CLI), or by using the One Step Install feature of the Platform HPC Console. kit_install_overview.xml
Install a kit
Install this kit by using the kusu-kitops or kusu-kit-install on the command-line interface (CLI), or by using the One Step Install feature of the Platform HPC Console.
Note:
Cluster management tools had been renamed in this release. The old command names (such as kitops, repoman, cfmsync, ngedit, etc.) still exist but will be deprecated in future release.
Note:
The kusu-kit-install tool and its library files are packaged with a man page and documentation files as rpm. This rpm is a dependency of the component-base-installer component that is only meant for the head node and not for compute nodes. This is auto installed on the head node with the rest of the base kit during cluster deployment. If a kit is already present in the system, it still has to be added to the repositories. If a kit is already added to a repository, its components have to be associated with the matching node groups.
Follow these steps to install a kit using kusu-kit-install on the CLI: 1. Download the kit ISO. 2. Add the kit using kusu-kit-install tool.
or
kusu-kit-install /dev/cdrom
Default repository:
kusu-kit-install source
where source refers to the location of the kits to be installed; can be /tmp/kit.iso (file), /dev/ cdrom (mount point), or /data/kit (directory) Customize repository:
kusu-kit-install -r repo_names source
where repo_names refer to comma separated list of repository names Examples of repo_names: rhel5.5-5-x86_64,\ sles10.3-10-x86_64 Installation status appears once the process is completed.
where source is the location of the kits to be installed (for example: /tmp/kit.iso (file), /dev/
Options
Note:
Multiple repository names must be separated by commas (,). The default repository used is the native repository (with repo id = 1000).
Examples:
kusu-kit-install -r rhel5.3-5-x86_64, rhel5.2-5-x86_64 /mnt kusu-kit-install -r 'rhel5.3-5-x86_64, rhel5.2-5-x86_64'/mnt
-h, --help Prints the help option and exits. -v, --version Prints the version information and exits.
Install a kit 1. Import the kit information to the database, and copy files from the media. To do this, use the kusukitops command:
kusu-kitops -a -m path_to_kit
Note:
Command option definitions:
-a: Adds a new kit. -m: Points to the ISO (media) containing the kit files.
The kit and its components are registered in the database, and the kit contents are copied to this directory on the head node:
/depot/kits/kit_id
Note:
To check the kit id, use the command:
kusu-kitops -l
2. Add the kit to a repository on the head node. To do this, use the kusu-repoman command: a) Determine which repository on the head node to which you want to add the kit. To do this, list the repositories currently in the system.
kusu-repoman -l
Note:
Command option definitions:
-l: Lists the repositories currently in the system. -r: Selects the repository by name. -a: Adds the kit to the repository. -i: Points to id of the kit you are adding to the repository. -u: Rebuilds the repository metadata with the newly added kits.
3. Associate kit components with node groups. Some kits may automatically create associations with node groups. In this case, you can add components immediately by running kusu-cfmsync -p. If components are not automatically associated, run kusu-ngedit to launch the Node Group Editor tool, and then complete the following: a) Within the Node Group Editor, navigate to the Repository page. b) Ensure the node group is using the repository that contains the new kit, and then select Next. c) Navigate to the Components page. d) Select the components you want to install on all nodes in the node group.
Install a kit e) Save any changes you have made. 4. From the Node Group Editor, you can update existing nodes in the node group with the new kit components. The last page in kusu-ngedit asks if you want to push the kit package to the nodes using cfm (Configuration File Manager utility). You can also choose to run the kusu-cfmsync utility manually at a later time, or simply reboot the nodes to force a reinstall of the new packages.
Note:
Any new nodes added to the node group after saving your changes within kusu-ngedit will automatically have the kit package installed.
Install kit using One Step Install on the Platform HPC Console
The One Step Install integrates with the kusu-kit-install command to perform a one-step kit addition with acceptable defaults. Follow these steps to install a kit using the One Step Install function of the Platform HPC Console. 1. Download the kit ISO. 2. Open a browser and access the Platform HPC Console:
http://primary_host_name:8080/platform
3. Navigate Clusters>Cluster Inventory then click Kits. 4. Click Add. The One Step Install appears on the Installation Options. 5. Select location where kit ISO is stored. 6. Select repository. Select Default Repository to use default repository. Select Custom Selection repository to select the current and snapshot repository.
7. Select the kit ISO. 8. Click Add Kit. A message displays once kit installation is completed.
Remove a kit
Remove a kit
Remove kits via any of these two options: 1. Remove kit via the command-line interface (CLI). 2. Remove kit via the Platform HPC Console.
Important:
If the kit has a support component enabled on the head node group, disable other components (whether on the head nodes node group or other node groups) first before the support component.
Tip:
Use kusu-kitops to check which node groups are associated with the kit to be removed.
Tip:
Use kusu-repoman -l to check which repositories contain the kit to be removed.
Remove a kit
Caution:
A kit is made up of several components which could be associated with a node group. Disassociate the kit from the node group, remove it from the repository, then rebuild the repository first before removing the kit.
6. Click Delete. Delete confirmation prompt appears. 7. Confirm deletion of selected kit(s).
Included components
Included components
The Platform HPC kit is made up of several components. Some of these components contain subcomponents that you can choose to enable or disable, while others serve as wrappers containing required pieces (for example, complete library and tool sets).
Note:
Except for component-hpc-source-code, all components in the Platform HPC kit are associated with the installer and compute (package-based install) node groups by default.
componentbenchmarkingtools
By default, this component is enabled and associated with both the installer and package-based compute node groups. Subcomponents include:
component lets you test the disk and file system performance of your Platform Cluster Manager cluster. iozone v3_327: This is a disk benchmarking tool with graph-presentation capabilities. When enabled, this component lets you test the disk and file system performance of your PCM cluster.
componentThis is a high performance message-passing library. Enable this component if your linear-OpenMPI- application requires this library. By default, this component is enabled and libraries associated with the installer, compute-packaged,compute-imaged, and computediskless node groups.
Important:
This component depends on OpenMPI library which is provided in Platform OFED kit (component: component-OFED-OpenMPI-libraries) or Mellanox kit (component: component-MellanoxMPI-gcc). To prevent dependency problems, make sure that you have enabled the OpenMPI library before enabling component-linear-OpenMPI-libraries.
Subcomponents include:
Linpack (latest HPL package): This is a collection of Fortran subroutines that analyze and solve linear equations and linear least-squares problems. ScaLAPACK v1.8.0: This is a scalable LAPACK library for linear algebra computations on distributed memory MIMD parallel computers. You can use this to validate the HPC kit functionality. BLACS v20000224: This is an MPI library used for basic linear algebraoriented computations that can be used across a range of distributed memory platforms. You can use this to validate the Platform HPC kit functionality.
This component includes the MPICH2 v1.2.1. This is a portable implementation of a high performance MPI (Message Passing Interface). Enable this component if your application requires this library. By default, this component is enabled and associated with both the installer and package-based compute node groups.
component-hpclibraries
By default, this component is enabled and associated with both the installer and package-based compute node groups. Subcomponents include:
netCDF v4.0.1 and netcdf-devel: Set of software libraries and data formats for
managing array-oriented data. Enable this component if your application requires this library. HDF5 v1.8.4: This is a set of data models, file formats, APIs, libraries, and tools used to manage large and complex data collections. When enabled, this component provides for the management of extremely large and complex data collections. Enable this component if your application requires this library. FFTW v3.2.3 and FFTW-devel: A fast-fourier transform library. When enabled, this C-subroutine library computes discrete Fourier transform (DFT) in one or more dimensions of arbitrary input size and of real and complex data. Enable this component if your application requires this library. ATLAS v3.9.21: This is a library used for creating automatically tuned linear algebra software. When enabled, provides C and Fortran77 interfaces to certain implementations. Enable this component if your application requires this library.
component-hpcsource-code
This component includes all SRPMs in other components. By default, this component is not associated with any node group.
Enable components
Use the Node Group Editor (kusu-ngedit) to individually enable or disable components within a node group. 1. Run kusu-ngedit. 2. After selecting the node group on which this kit is installed, navigate to the Components window and select platform hpc. 3. Select (or clear) the desired kit components.
Visit the Platform Cluster Manager download site for all the latest Platform Cluster Manager product and kit documentation, including installation and user guides: http://my.platform.com/products/platform-hpc Guides for various Platform Cluster Manager kits are also installed on your local system:
http://primary_install_host_name/kits/kit_name/version/
hpc-libraries:
Homepage: http://fftw.org/ Documentation: http://fftw.org/#documentation NetCDF: http://www.unidata.ucar.edu/software/netcdf/ Benchmarking-tools: bonnie++: http://www.coker.com.au/bonnie++/ iozone: http://www.iozone.org/ BLACS:
Homepage: http://www.netlib.org/blacs/ Papers available for download: http://www.netlib.org/blacs/BLACS/Papers.html Environment Modules Project: http://modules.sourceforge.net/ MPICH2:
Homepage: http://www.mcs.anl.gov/research/projects/mpich2/ Documentation: http://www.mcs.anl.gov/research/projects/mpich2/documentation/index.php? s=docs MVAPICH: Homepage: http://mvapich.cse.ohio-state.edu/ MVAPICH2 user guide: http://mvapich.cse.ohio-state.edu/support/user_guide_mvapich2-1.2.html FAQ: http://mvapich.cse.ohio-state.edu/support/faq.shtml OpenMPI:
Related documentation and important links Homepage: http://www.netlib.org/scalapack/ Scalapack User forum: http://icl.cs.utk.edu/lapack-forum/ Linpack: http://www.netlib.org/linpack/
Technical support
Contact Platform Computing for technical support in one of these ways:
Web Portal eSupport You can take advantage of our web-based self-support available 24 hours per day, 7 days a week (24x7) by visiting http://my.platform.com. The Platform eSupport and Support Knowledgebase site enables you to search for solutions, submit your support request, update your request, enquire about your request, as well as download product manuals, binaries and patches. Email Support: support@platform.com Telephone Support: contact information available at http://www.platform.com/services/support