Biowulf High Performance Computing at the NIH
VCF-kit: assorted utilities for the variant call format

VCF-kit is a collection of utility tools for processing and analyzing the VCF (variant call format) files, including primer generation for variant validation, dendrogram production,genotype imputation from sequence data in linkage studies, and additional tools to be used by statistical and population geneticists.

References:

Documentation
Important Notes

Interactive job
Interactive jobs should be used for debugging, graphics, or applications that cannot be run as batch jobs.

Allocate an interactive session and run the program. Sample session:

[user@biowulf]$ sinteractive --mem=4g 
[user@@cn3316 ~]$module load VCF-kit
[user@@cn3316 ~]$git clone https://github.com/AndersenLab/VCF-kit.git
[user@@cn3316 ~]$vk -h
usage:
  vk  [...]
  vk setup
  vk -h | --help
  vk --version

commands:
  calc
  call
  filter
  geno
  genome
  hmm
  phylo
  primer
  rename
  tajima
  vcf2tsv
[user@@cn3316 ~]$vk calc -h
usage:
  vk calc sample_hom_gt 
  vk calc genotypes [--frequency] 
  vk calc spectrum 

Example

options:
  -h --help                   Show this screen.
  --version                   Show version.
[user@@cn3316 ~]$vk calc genotypes VCF-kit/test_data/test.vcf.gz
n       ref     het     alt     mis
937     14      0       0       0
328     13      0       0       1
242     13      0       1       0
168     12      0       0       2
101     11      0       0       3
94      12      0       1       1
89      12      0       2       0
73      10      0       0       4
62      11      0       3       0
46      10      0       4       0
37      11      0       1       2
37      8       0       0       6
36      9       0       0       5
33      11      0       2       1
31      10      0       1       3
29      10      0       3       1
28      9       0       4       1
25      10      0       2       2
...
1       2       0       11      1
1       5       6       0       3
1       1       0       9       4
1       9       2       1       2
[user@@cn3316 ~]$vk calc genotypes VCF-kit/test_data/QX1211.indels.vcf.gz
n       ref     het     alt     mis
74493   0       0       1       0
8567    0       1       0       0
End the interactive session:
[user@cnR3316 ~]$ exit
salloc.exe: Relinquishing job allocation 46116226
[user@biowulf ~]$