Software

The HTCF starts as somewhat of a blank slate for each lab. However, this doesn't mean labs can't get up and running in a matter of minutes!

Each lab has its own dedicated space to install and manage software. This reference space is located in /ref/<lab>/software.

Software building and installation on the HTCF is primarily self-service.

Labs are free to use their /ref software directory to install software using whatever means is most comfortable.

At the lab level, use of Spack to install common software is encouraged. Virtual environments can also be used if the software is well suited.

Spack

Spack is a package management tool designed to support multiple versions and configurations of software on a wide variety of platforms and environments.

Note

Do not install software while on the login node. Please build/install software from an interactive job.

Tutorial

See the official spack tutorial

Initialization

To create a lab (or personal) instance of the spack package manager:

Follow the installation instructions in the spack documentation, placing spack in a directory under /ref/<lab_name>/software
As a convenience, if the path /ref/<lab_name>/software/spack/bin exists, it will automatically be added to $PATH on login. You may wish to make a symlink from /ref/<lab_name>/software/<my_spack_dir_name> to /ref/<lab_name>/software/spack, to take advantage of the automatic $PATH addition. Alternatively, feel free to place spack instances in other locations (and add those instance bin/ paths to your $PATH in $HOME/.bashrc).
After installing spack (and updating your .bashrc if necessary), logout and log back in. This will ensure the spack command is available in the $PATH.

Installing Software

see the official spack documentation

Warning

Please install software from within a Slurm job.

Note

Some compiling requires large amounts of RAM. Using --mem-per-cpu with >= 4G is sometimes needed.

In the case of software that requires the "qt" package, more than --mem-per-cpu=10G could be needed.

Note

When installing within a slurm job, be sure to tell Spack how many CPUs are available.

For example, after getting an interactive session:

spack install -j ${SLURM_CPUS_ON_NODE} .....

Note

Some software can take an extremely long time to install (such as qt and llvm). In these cases, an sbatch job will be needed rather than an interactive job:

#!/bin/bash
#SBATCH -c <num>
#SBATCH --mem-per-cpu=<num>G
spack install -j ${SLURM_CPUS_ON_NODE} ....

Using the software

Once spack has built software, the bash shell needs to have the proper environment variables set to access the software.

This is accomplished using the spack load command. Spack packages can be "loaded" similar to the way modules are loaded.

Given a spec, a spack command can be used to generate the appropriate environment variables to "load" spack-installed software.

To set the environment variables (similar to module load ...):

$ eval $( spack load --sh <spec> )

To unset (unload) these variables:

$ eval $( spack unload --sh <spec> )

To simply view the environment variables that would be set without actually setting them:

$ spack load --sh <spec>

See the official spack documentation for more information on specs.

These commands can be placed in an sbatch file to be used in a job.

#!/bin/bash

eval $( spack load --sh <spec> )

Example

Installing biom-format

Search for the name of the package:

$ spack list biom
==> 7 packages.
microbiomeutil  py-biomine    r-biomart   r-biomformat
py-biom-format  r-biom-utils  r-biomartr

See what versions are available:

$ spack versions py-biom-format
==> Safe versions (already checksummed):
  2.1.10  2.1.9  2.1.7  2.1.6
...

Install:

$ spack install py-biom-format@2.1.10

Load the software in a job:

#!/bin/bash

eval $( spack load --sh py-biom-format@2.1.10 )

...

What if the software I want is not available through Spack

When needed software is not readily accessible via Spack, there are a few options.

Follow the installation instructions from the software creator.

Sometimes, this can be very quick and straightforward.

Sometimes, this can be very painful.

Sometimes, it can be a good idea to pass judgement on the quality of software based on the quality of the installation process and documentation.
Create a custom spack package

Spack can be a wonderful tool for creating and maintaining software. Plenty of documentation is provided for creating and maintaining custom packages, though a firm understanding of python is needed.

R considerations

Please see the R page for more information.

Manual Installation

Sometimes it's just easier to follow the installation steps provided by software creator.

If the software depends on other software, it might be that the dependency software could be installed via spack.

For example, if a piece of software is not available via Spack, but requires samtools to be installed:

# Install samtools via spack
$ spack install samtools

# load samtools before installation
$ eval $( spack load --sh samtools )

(proceed with the manual installation)

Remember

The best place to install software is in reference storage: /ref/<lab>/software.

After manual software installation, it's good practice to then create a module file

Manual module files

Module files that are manually created go in reference storage in /ref/<lab>/software/modules.

The lmod documentation is the best place to learn about creating module files.

What about conda?

Feel free to use conda if it is required/preferred.

The HTCF does not handle Conda support requests. For conda support, please see https://docs.conda.io/en/latest/help-support.html