Additional information for users of the nci-lp-dragen partition

The nci-lp-dragen partition as of June 2023 includes one dragen server configured to run the TSO500 ctDNA pipeline. It has been funded by NCI/CCR/Lab of Pathology until 30-Sep-2026.

Notes:

Changelog:

Running a TSO500 ctDNA 2.6.0 batch job
back to top

Create a batch script similar to the following:

#! /bin/bash
# this is dragen.sh

# set up paths etc
source /etc/profile.d/edico.sh

RUNPATH=/path/to/runs
RUNFOLDER=NNNNNN_XXXXXX_NNNN_XXXXXXXXXX
ANALYSIS="/staging/${RUNFOLDER}-ctDNA-$(date +%s)"
METRICS=${ANALYSIS}/Results/MetricsOutput.tsv
RESULTPATH=/path/to/results

# clean up after run
trap 'rm -rf "/staging/${RUNFOLDER}" "${ANALYSIS}"' EXIT

cp -r "${RUNPATH}/${RUNFOLDER}" /staging || exit 100
mkdir -p "${ANALYSIS}" || exit 101

# may need to specify --sampleSheet if not in the default location
DRAGEN_TSO500_CTDNA-2.6.0.sh --analysisFolder "${ANALYSIS}" \
    --runFolder "/staging/${RUNFOLDER}" \
2>&1 | tee "${ANALYSIS}/test.out" || exit 102

[[ -f "${METRICS}" ]] || printf "Missing metrics file\n"
grep COMPLETED_ALL_STEPS "${METRICS}"

# copy results back to working directory
cp -r "${ANALYSIS}" "${RESULTPATH}" || exit 103

And submit with

[user@biowulf]$ sbatch --mem=0 --cpus-per-task=64 --partition nci-lp-dragen --qos=compass dragen.sh
12345678

if you have access to the compass QOS which increases priority and max. walltime. Users without access to the QOS are limited to 8h max walltime.

Note that the $ANALYSIS folder is lager than the input with Logs_Intermediates taking up most the space. The script above could be modified to only transfer a subset of files back to shared storage.

Running a TSO500 ctDNA 2.1.1 batch job
back to top

Create a batch script similar to the following:

#! /bin/bash
# this is dragen.sh

# set up paths etc
source /etc/profile.d/edico.sh

RUNPATH=/path/to/runs
RUNFOLDER=NNNNNN_XXXXXX_NNNN_XXXXXXXXXX
ANALYSIS="/staging/${RUNFOLDER}-ctDNA-$(date +%s)"
METRICS=${ANALYSIS}/Results/MetricsOutput.tsv
RESULTPATH=/path/to/results

# clean up after run
trap 'rm -rf "/staging/${RUNFOLDER}" "${ANALYSIS}"' EXIT

cp -r "${RUNPATH}/${RUNFOLDER}" /staging || exit 100
mkdir -p "${ANALYSIS}" || exit 101

# may need to specify --sampleSheet if not in the default location
DRAGEN_TruSight_Oncology_500_ctDNA.sh --analysisFolder "${ANALYSIS}" \
    --runFolder "/staging/${RUNFOLDER}" \
2>&1 | tee "${ANALYSIS}/test.out" || exit 102

[[ -f "${METRICS}" ]] || printf "Missing metrics file\n"
grep COMPLETED_ALL_STEPS "${METRICS}"

# copy results back to working directory
cp -r "${ANALYSIS}" "${RESULTPATH}" || exit 103

And submit with

[user@biowulf]$ sbatch --mem=0 --cpus-per-task=64 --partition nci-lp-dragen --qos=compass dragen.sh
12345678

if you have access to the compass QOS which increases priority and max. walltime. Users without access to the QOS are limited to 8h max walltime.

Note that the $ANALYSIS folder is lager than the input with Logs_Intermediates taking up most the space. The script above could be modified to only transfer a subset of files back to shared storage.


Please send questions and comments to staff@hpc.nih.gov