AncestryMap is a software package that allows finding skews in ancestry that are potentially associated with disease genes in recently mixed populations.
Allocate an interactive session and run the program.
Sample session (user input in bold):
[user@biowulf]$ sinteractive salloc.exe: Pending job allocation 46116226 salloc.exe: job 46116226 queued and waiting for resources salloc.exe: job 46116226 has been allocated resources salloc.exe: Granted job allocation 46116226 salloc.exe: Waiting for resource configuration salloc.exe: Nodes cn3144 are ready for job [user@cn3144 ~]$ module load AncestryMap [+] Loading AncestryMap 6210 ... [user@cn3144 examples]$ cp $AMT_DATA/* .Executable ancestrymap can be run by passing it a parameter file with command line option -p:
[user@cn3144 examples]$ ancestrymap -p param0
parameter file: param0
output: (null)
### THE INPUT PARAMETERS
PARAMETER NAME: VALUE
risk: 1.5
indivname: indiv.dat
snpname: snpcnts
genotypename: geno.dat
tlreest: YES
seed: 1011
splittau: YES
fancyxtheta: YES
checkit: YES
details: YES
numburn: 0
numiters: 0
emiter: 10
dotoysim: NO
cleaninit: YES
reestiter: 5
indoutfilename: indjunk
snpoutfilename: snpjunk
## ANCESTRYMAP version: 6210
###GENETIC DISTANCE FOR ALL CHROMOSOMES
##Chr_Num: chromosome num, First_SNP and Last_SNP: First and last markers, Gen_dist: Genetic distance
Chr_Num First_SNP Last_SNP Gen_dist
chrom: 1 first: 0 last: 478 distance: 2.834
chrom: 2 first: 479 last: 910 distance: 2.647
chrom: 3 first: 911 last: 1276 distance: 2.227
...
chrom: 20 first: 5330 last: 5492 distance: 1.067
chrom: 21 first: 5493 last: 5586 distance: 0.604
chrom: 22 first: 5587 last: 5699 distance: 0.711
chrom: 23 first: 5700 last: 5902 distance: 1.180
total distance: 36.242
calling setstatus
emiter: 10
reestiter: 5
trashdir: /var/tmp
###HETXCHECK RESULTS BEGIN
## SNP_ID NUM_HET NUM_HOMOZY
hetxcheck rs6530109 0 600
hetxcheck rs2128516 0 600
...
hetxcheck rs10127175 0 600
hetxcheck rs5945413 0 0
hetxcheck rs884840 0 600
###COUNTS
Num of fake Markers: 3622 Num of real Markers: 2281 Spacing between fake markers: 0.010
Num of Markers: 5903 Num of Samples: 1201
Num of Cases: 601 Num of Controls: 600 Num of Ignored Samples: 0
dup? toyindiv:539 toyindiv:1200
match: 2260 mismatch: 0 2260 2260
dup. toyindiv:539 ignored
### CHECKGENO RESULTS FOLLOW:
Num good genotypes: 2714260 Num bad genotypes: 0
###PHYSCHECK RESULTS FOLLOW:
## SNP1_ID SNP2_ID SNP1_GEN_POS SNP2_GEN_POS SNP1_PHYS_POS SNP2_PHYS_POS
###HWCHECK RESULTS FOLLOW:
## SNP_ID SNP_INDEX CHR_NUM HW_SCORE
hwcheck rs819980 0 1 -1.595
hwcheck rs10907185 1 1 0.376
hwcheck rs897634 4 1 -1.861
...
hwcheck rs4824056 5694 22 -2.308
hwcheck rs138823 5695 22 -2.112
hwcheck rs6520141 5696 22 -1.517
hwcheck rs8142477 5698 22 -2.327
hwcheck rs140514 5699 22 -0.021
hwstats (chrom 1) ave: -1.110 sigma: -15.543
hwstats (chrom 2) ave: -1.330 sigma: -17.183
hwstats (chrom 3) ave: -1.278 sigma: -15.286
hwstats (chrom 4) ave: -1.003 sigma: -11.304
hwstats (chrom 5) ave: -1.322 sigma: -15.698
hwstats (chrom 6) ave: -1.071 sigma: -11.536
hwstats (chrom 7) ave: -1.088 sigma: -12.016
hwstats (chrom 8) ave: -1.157 sigma: -12.247
[user@cn3144 ~]$ exit
salloc.exe: Relinquishing job allocation 46116226
[user@biowulf ~]$
Create a batch input file (e.g. AncestryMap.sh). For example:
#!/bin/bash # this file is called admix.sh module load AncestryMap cp $AMT_DATA/* . ancestrymap -p param0 > data0.out ancestrymap -p param1 > data1.out ancestrymap -p param2 > data2.out
Submit this job using the Slurm sbatch command.
sbatch [--mem=#] AncestryMap.sh
Create a swarmfile (e.g. AncestryMap.swarm). For example:
cp $AMT_DATA/* . ancestrymap -p param0 > data0.out ancestrymap -p param1 > data1.out ancestrymap -p param2 > data2.out
Submit this job using the swarm command.
swarm -f AncestryMap.swarm [-g #] --module AncestryMapwhere
| -g # | Number of Gigabytes of memory required for each process (1 line in the swarm command file) |
| --module AncestryMap | Loads the AncestryMap module for each subjob in the swarm |