Tractor is a statistical framework and software package to facilitate the inclusion of admixed individuals in association studies by leveraging local ancestry. It generates accurate ancestry-specific effect-size estimates and P values, can boost genome-wide association study (GWAS) power and improves the resolution of association signals
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive
[user@cn0799 ~]$ module load Tractor
[+] Loading java 17.0.3.1  ...
[+] Loading singularity  4.2.2  on cn0799
[+] Loading nextflow  24.10.3
[+] Loading Tractor 1.4.0  ...
[user@cn0799 ~]$ ls $TRACTOR_BIN
flare      phase_common_static  rfmix     simulate_static  xcftools_static
gnomix.py  phase_rare_static    simulate  switch_static
[user@cn0799 ~]$ cp -r $TRACTOR_DATA . 
[user@cn0799 ~]$ tree test_data
test_data
├── admixed_cohort
│   ├── ASW.deconvoluted.fb.tsv
│   ├── ASW.deconvoluted.msp.tsv
│   ├── ASW.deconvoluted.rfmix.Q
│   ├── ASW.deconvoluted.sis.tsv
│   ├── ASW.phased.vcf.gz
│   └── ASW.phased.vcf.gz.csi
├── phenotype
│   ├── Phe_linear_covars.txt
│   ├── Phe_linear.txt
│   └── Phe_logistic.txt
└── references
    ├── chr22.b37.gmap.gz
    ├── chr22.genetic_map.modified.txt
    ├── TGP_HGDP_QC_hg19_chr22.vcf.gz
    ├── TGP_HGDP_QC_hg19_chr22.vcf.gz.csi
    └── YRI_GBR_samplemap.txt
3 directories, 14 files
[user@cn0799 ~]$ phase_common_static    \
                        --input test_data/admixed_cohort/ASW.phased.vcf.gz  \
                        --reference test_data/references/TGP_HGDP_QC_hg19_chr22.vcf.gz \
                        --region 22   \
                        --map test_data/references/chr22.b37.gmap.gz    \
                        --output test_data/admixed_cohort/ASW.phased.bcf 
[SHAPEIT5] phase_common (jointly phase multiple common markers)
  * Author        : Olivier DELANEAU, University of Lausanne
  * Contact       : olivier.delaneau@gmail.com
  * Version       : 5.1.1 / commit = 990ed0d / release = 2023-05-08
  * Run date      : 29/05/2025 - 11:19:37
Files:
  * Input         : [admixed_cohort/ASW.phased.vcf.gz]
  * Reference     : [test_data/references/TGP_HGDP_QC_hg19_chr22.vcf.gz]
  * Genetic Map   : [test_data/references/chr22.b37.gmap.gz]
  * Output        : [test_data/admixed_cohort/ASW.phased.bcf]
  * Output format : [bcf]
Parameters:
  * Seed    : 15052011
  * Threads : 8 threads
  * MCMC    : 15 iterations [5b + 1p + 1b + 1p + 1b + 1p + 5m]
  * PBWT    : [window = 4cM / depth = auto / modulo = auto / mac = 5 / missing = 0.1]
  * HMM     : [window = 4cM / Ne = 15000 / Recombination rates given by genetic map]
Reading genotype data:
[W::hts_idx_load3] The index file is older than the data file: test_data/references/TGP_HGDP_QC_hg19_chr22.vcf.gz.csi
  * VCF/BCF scanning done (9.04s)
      + Variants [#sites=182525 / region=22]
         - 11249 sites removed in reference panel [not in main panel]
      + Samples [#target=61 / #reference=1572]
[W::hts_idx_load3] The index file is older than the data file: test_data/references/TGP_HGDP_QC_hg19_chr22.vcf.gz.csi
  * VCF/BCF parsing done (10.94s)
      + Genotypes [0/0=76.122%, 0/1=16.366%, 1/1=7.512%, ./.=0.000%, 0|1=0.000%]
      + Reference haplotypes [0=85.090%, 1=14.910%]
Setting up genetic map:
  * GMAP parsing [n=45329] (0.02s)
  * cM interpolation [s=37946 / i=144579] (0.02s)
  * Region length [34861020 bp / 72.8 cM]
  * HMM parameters [Ne=15000 / Error=0.0001 / #rare=8659]
Initializing data structures:
  * Impute monomorphic [n=330498] (0.01s)
  * HAP update (0.04s)
  * H2V transpose (0.80s)
  * PBWT parameters auto setting : [modulo = 0.045 / depth = 6]
  * PBWT initialization [#eval=173447 / #select=1550 / #chunk=11] (0.01s)
  * PBWT phasing sweep (1.72s)
  * Build genotype graphs [seg=607426] (0.11s)
Burn-in iteration [1/5]
  * PBWT selection (0.90s)
  * HMM computations [K=716.5+/-235.1 / W=3.54Mb / US=0 / UP=0] (36.17s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.06s)
Burn-in iteration [2/5]
  * PBWT selection (0.90s)
  * HMM computations [K=635.5+/-222.1 / W=3.52Mb / US=0 / UP=0] (32.56s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.04s)
Burn-in iteration [3/5]
  * PBWT selection (0.90s)
  * HMM computations [K=629.1+/-207.3 / W=3.47Mb / US=0 / UP=0] (31.67s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.05s)
Burn-in iteration [4/5]
  * PBWT selection (0.90s)
  * HMM computations [K=645.1+/-227.3 / W=3.57Mb / US=0 / UP=0] (33.06s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.06s)
Burn-in iteration [5/5]
  * PBWT selection (0.90s)
  * HMM computations [K=631.5+/-212.7 / W=3.50Mb / US=0 / UP=0] (31.98s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.06s)
Pruning iteration [1/1]
  * PBWT selection (0.91s)
  * HMM computations [K=648.8+/-219.8 / W=3.61Mb / US=0 / UP=0] (34.58s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.04s)
  * Trimming [pc=43.57%]
Burn-in iteration [1/1]
  * PBWT selection (0.90s)
  * HMM computations [K=632.6+/-210.8 / W=3.50Mb / US=0 / UP=0] (29.10s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.05s)
Pruning iteration [1/1]
  * PBWT selection (0.90s)
  * HMM computations [K=642.6+/-213.5 / W=3.57Mb / US=0 / UP=0] (30.67s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.05s)
  * Trimming [pc=68.12%]
Burn-in iteration [1/1]
  * PBWT selection (0.87s)
  * HMM computations [K=635.6+/-225.6 / W=3.55Mb / US=0 / UP=0] (28.82s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.04s)
Pruning iteration [1/1]
  * PBWT selection (0.90s)
  * HMM computations [K=650.4+/-232.0 / W=3.66Mb / US=0 / UP=0] (29.86s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.05s)
  * Trimming [pc=80.64%]
Main iteration [1/5]
  * PBWT selection (0.90s)
  * HMM computations [K=619.3+/-210.4 / W=3.44Mb / US=0 / UP=0] (27.01s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.04s)
Main iteration [2/5]
  * PBWT selection (0.90s)
  * HMM computations [K=623.0+/-213.4 / W=3.48Mb / US=0 / UP=0] (27.55s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.05s)
Main iteration [3/5]
  * PBWT selection (0.90s)
  * HMM computations [K=633.2+/-209.4 / W=3.51Mb / US=0 / UP=0] (27.49s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.05s)
Main iteration [4/5]
  * PBWT selection (0.90s)
  * HMM computations [K=638.1+/-219.3 / W=3.59Mb / US=0 / UP=0] (28.06s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.06s)
Main iteration [5/5]
  * PBWT selection (0.90s)
  * HMM computations [K=627.7+/-213.9 / W=3.49Mb / US=0 / UP=0] (27.20s)
  * IBD2 tracks [#inds=0 / #tracks=0 / #merged = 0]
  * HAP update (0.03s)
  * H2V transpose (0.03s)
Finalization:
  * HAP solving (0.06s)
  * HAP update (0.03s)
  * H2V transpose (0.03s)
  * VCF/BCF writing [N=61 / L=182525] (0.52s)
  * Indexing files
  * Total running time = 493 seconds
[user@cn0799 ~]$ exit
salloc.exe: Relinquishing job allocation 46116226
[user@biowulf ~]$