PHASE reconstructs haplotypes from population genotype data using a Bayesian statistical model that considers the decay of LD with distance due to recombination. Inputs can include biallelic SNPs as well as multi-allelic loci like SNPs with more than two alleles, HLA allels, or microsatellites.


Interactive job
Interactive jobs should be used for debugging, graphics, or applications that cannot be run as batch jobs.

Allocate an interactive session, load the phase module and analyze a simple test data set running 1000 iterations

[user@biowulf]$ sinteractive --gres=lscratch:5
[user@cn3114]$ module load phase/2.1.1
[user@cn3114]$ cd /lscratch/$SLURM_JOB_ID
[user@cn3114]$ cp $PHASE_TEST_DATA/test.inp .
[user@cn3114]$ PHASE test.inp test.out 1000
[user@cn3114]$ ls -1 test.out*

[user@cn3144]$ exit
salloc.exe: Relinquishing job allocation 46116226

The input format and options are described in the manual.

Batch job
Most jobs should be run as batch jobs.

Create a batch input file (e.g., which uses the input file ''. For example:

#! /bin/bash

module load phase/2.1.1 || exit 1
PHASE input output 1000

Submit this job using the Slurm sbatch command.

Swarm of Jobs
A swarm of jobs is an easy way to submit a set of independent commands requiring identical resources.

Create a swarmfile (e.g. phase.swarm). For example:

PHASE -X10 input1 output1
PHASE -X10 input2 output2

Submit this job using the swarm command.

swarm -f phase.swarm -g 4 -t 1 -p 2 --module phase/2.1.1
-g # Number of Gigabytes of memory required for each process (1 line in the swarm command file)
-t # Number of threads/CPUs required for each process (1 line in the swarm command file).
--module phase Loads the phase module for each subjob in the swarm