High-Performance Computing at the NIH
@nih_hpc RSS Feed
Presto on NIH HPC Systems

pRESTO is a toolkit for processing raw reads from high-throughput sequencing of B cell and T cell repertoires.

Batch job on Biowulf

Create a batch input file (e.g. script.sh). For example:

#!/bin/bash
module load presto

cd /data/$USER/dir
presto command
......

Then submit the file on biowulf

biowulf> $ sbatch script.sh

For more information regarding sbatch command : https://hpc.nih.gov/docs/userguide.html#submit

Swarm of Jobs on Biowulf

Create a swarmfile (e.g. script.swarm). For example:

# this file is called script.swarm
cd dir1;presto command 1; presto command 2
cd dir2;presto command 1; presto command 2
cd dir3;presto command 1; presto command 2
[...]

Submit this job using the swarm command.

$ swarm -f script.swarm --module presto

For more information regarding swarm: https://hpc.nih.gov/apps/swarm.html#usage

Interactive job on Biowulf

Allocate an interactive session. Sample session:

[biowulf ~]$ sinteractive --mem=5g
salloc.exe: Pending job allocation 15194042
salloc.exe: job 15194042 queued and waiting for resources
salloc.exe: job 15194042 has been allocated resources
salloc.exe: Granted job allocation 15194042
salloc.exe: Waiting for resource configuration
salloc.exe: Nodes cn1719 are ready for job

[cn1719 ~]$ module load presto

[cn1719 ~]$ presto command
Documentation
https://presto.readthedocs.io/en/version-0.5.4---estimateerror-fix/