CoverageMaster (CoM) is a copy number variation (CNV) calling algorithm i ased on depth-of-coverage maps designed to detect CNVs of any size i n exome [whole exome sequencing (WES)] and genome [whole genome sequencing (WGS)] data. The core of the algorithm is the compression of sequencing coverage data in a multiscale Wavelet space and the analysis through an iterative Hidden Markov Model.
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive -c 16 --mem 45g --gres=lscratch:20
[user@cn3144 ~]$ module load coverageMaster
[+] Loading singularity 3.8.5-1 on cn3144
[+] Loading coverageMaster 20220706 ...
[user@cn3144 ~]$ coverageMaster.py -h
Usage: coverageMaster.py [options] <cov_file> <stats_file> <gene list(file or comma separated gene names)|region(chr:start-end)> -r <reference.cov> -o <output_px>
Options:
-h, --help show this help message and exit
-c CONTROL, --control=CONTROL
<optional> txt file with a control file per line (.cov
with .report.txt in same folder)
-s SINGLE_CONTROL, --single-control=SINGLE_CONTROL
<optional> single control file (.cov with .report.txt
in same folder)
-r REF, --ref=REF reference file
-o OUTPUT_PX, --out=OUTPUT_PX
output prefix
-g CGD, --cgd=CGD <optional> clinical genomic database
-f, --force <optional> force output
-e EXONS, --exons=EXONS
<optional> n. of extra exons
-x OFFSET, --offset=OFFSET
<optional> offset to ref
-d WID, --width=WID <optional> d*std
-l LEV, --level=LEV <optional> max wavelet level
-m MINLEV, --minlevel=MINLEV
<optional> min wavelet level
-w, --wig <optional> write wig
-b, --bed <optional> BED input
-k DGV, --dgv=DGV <optional> DGV file
[user@cn3144 ~]$ git clone https://github.com/fredsanto/coverageMaster
[user@cn3144 ~]$ cd coverageMaster/DEMO
[user@cn3144 ~]$ gene=PGM1 && co=control.PGM1.cov && python ../coverageMaster.py test.PGM1.cov test.PGM1.report.txt $gene -s $co -r ref.PGM1 -o test.PGM1
Fri Sep 9 04:04:48 2022 - --------------------------------------------------------------------------------
CoverageMaster is warming up
Fri Sep 9 04:04:48 2022 - Query index created
Fri Sep 9 04:04:48 2022 - Reference index created
Fri Sep 9 04:04:48 2022 - Processing Gene: XIST - Region chrX:73040485-73072588
chrX not in my list - skipped
/opt/conda/envs/coveragemaster/lib/python3.6/site-packages/numpy/core/fromnumeric.py:3118: RuntimeWarning: Mean of empty slice.
out=out, **kwargs)
/opt/conda/envs/coveragemaster/lib/python3.6/site-packages/numpy/core/_methods.py:85: RuntimeWarning: invalid value encountered in double_scalars
ret = ret.dtype.type(ret / rcount)
Fri Sep 9 04:04:48 2022 - Control index created
Fri Sep 9 04:04:48 2022 - Processing Gene: XIST - Region chrX:73040485-73072588
chrX not in my list - skipped
Fri Sep 9 04:04:48 2022 - Loop with control control.PGM1.cov
Fri Sep 9 04:04:48 2022 - Processing Gene: PGM1 - Region chr1:64058946-64125916
Fri Sep 9 04:04:48 2022 - Processing Gene: PGM1 - Region chr1:64058946-64125916
Fri Sep 9 04:04:48 2022 - PGM1:Zooming 4
Fri Sep 9 04:04:48 2022 - PGM1:Zooming 3
Fri Sep 9 04:04:48 2022 - PGM1:Zooming 2
Fri Sep 9 04:04:48 2022 - PGM1:Zooming 1
user@cn3144 ~]$ exit
salloc.exe: Relinquishing job allocation 46116226
[user@biowulf ~]$