If you wish to try and perform the steps illustrated in this tutorial, you’ll need to follow all the instructions below to setup the
If you have any questions or issues regarding any of the below instructions, please feel free to attend Data Science office hours at the Zoom link below:
https://uab.zoom.us/meeting/register/tZ0rduCuqzotGtHShsawHLyRROqH3Sdz71mf#/registration
The Cheaha supercomputer is a resource offered by Research Computing to all members of UAB and is a very useful tool for analyzing large datasets.
Please follow the documentation produced by UAB’s Research Computing Team available here in order to create an account to access Cheaha.
Once your Cheaha account is successfully created, make sure to test that you can use the Interactive File System and Terminal available on UAB Research Computing’s OnDemand Application. In order to do that, follow the below steps:
Files
button/scratch/<blazer_id>/
where blazer_id
will be your blazer idOpen in Terminal
button in the upper right of the pagePerforming those steps will ensure that your account has been setup correctly.
In order to setup the environment to run the pipeline on Cheaha we will be using a tool called Anaconda. Anaconda is a package manager and allows for easy and quick installation of tools. We will be using Anaconda to install the tools needed for the initial parts of the workshop.
Files
button/scratch/<blazer_id>/
where blazer_id
will be your blazer idOpen in Terminal
button in the upper right of the pagemodule load Anaconda3
conda create -p $USER_SCRATCH/conda_envs/nfcore_workshop python=3.12 bioconda::nf-core bioconda::nextflow
Type y
when prompted
(Proceed ([y]/n)?
).
This command will create a new conda enviroment called
nfcore_workshop
in your scratch space on Cheaha and under a
sub-directory called conda_envs
. It is important to name
your environments as something intuitive to help you remember their
purpose.
conda activate $USER_SCRATCH/conda_envs/nfcore_workshop
conda deactivate
Globus is a web-based tool that can be used for large data transfers between different locations.
As a member of UAB, you are able to login using your blazer id. In order to login to Globus using UAB, follow the steps below:
The data being used for the workshop comes from the paper below:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6096346/
For this workshop, the data has been pre-downloaded and packaged into a Globus endpoint for ease of transfer. The link for the Globus endpoint can be found here
If you have never logged into the Globus account, please see the instructions here
In order to download the data, perform the following steps:
/scratch/your_blazerid
where your_blazerid
is your blazer idnfcore_workshop
.
The folders will now be transferred to your scratch space on Cheaha. Following this process will match the directory structure that will be used for the first portion of the workshop.
In order to use Visual Studio Code (VSCode) with Cheaha, please follow the instructions created by UAB’s Research Computing team below:
https://docs.rc.uab.edu/cheaha/open_ondemand/hpc_desktop/#visual-studio-code-remote-tunnel