Description from the GitHub repo
Evaluate genome assemblies with k-mers and more Often, genome assembly projects have illumina whole genome sequencing reads available for the assembled individual. The k-mer spectrum of this read set can be used for independently evaluating assembly quality without the need of a high quality reference. Merqury provides a set of tools for this purpose.
$MERQURY_TEST_DATA
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive --mem=10g --cpus-per-task=6 --gres=lscratch:20 salloc.exe: Pending job allocation 46116226 salloc.exe: job 46116226 queued and waiting for resources salloc.exe: job 46116226 has been allocated resources salloc.exe: Granted job allocation 46116226 salloc.exe: Waiting for resource configuration salloc.exe: Nodes cn3144 are ready for job [user@cn3144]$ cd /lscratch/$SLURM_JOB_ID [user@cn3144]$ module load merqury [user@cn3144]$ cp ${MERQURY_TEST_DATA:-none}/* . [user@cn3144]$ for f in *.tar.gz ; do tar -xzf $f ; done [user@cn3144]$ merqury.sh F1.k18.meryl col0.hapmer.meryl cvi0.hapmer.meryl athal_COL.fasta athal_CVI.fasta test [user@cn3144]$ ls test* test.athal_COL.100_20000.phased_block.bed test.athal_COL.spectra-cn.fl.png test.athal_CVI.cvi0.hapmer.wig test.athal_COL.100_20000.phased_block.blob.png test.athal_COL.spectra-cn.hist test.athal_CVI.only.hist test.athal_COL.100_20000.phased_block.counts test.athal_COL.spectra-cn.ln.png test.athal_CVI.qv test.athal_COL.100_20000.phased_block.sizes test.athal_COL.spectra-cn.st.png test.athal_CVI.sort.bed test.athal_COL.100_20000.phased_block.stats test.athal_CVI.100_20000.phased_block.bed test.athal_CVI.spectra-cn.fl.png test.athal_COL.100_20000.switch.bed test.athal_CVI.100_20000.phased_block.blob.png test.athal_CVI.spectra-cn.hist test.athal_COL.100_20000.switches.txt test.athal_CVI.100_20000.phased_block.counts test.athal_CVI.spectra-cn.ln.png test.athal_COL.block.N.png test.athal_CVI.100_20000.phased_block.sizes test.athal_CVI.spectra-cn.st.png test.athal_COL.col0.hapmer.spectra-cn.fl.png test.athal_CVI.100_20000.phased_block.stats test.completeness.stats test.athal_COL.col0.hapmer.spectra-cn.ln.png test.athal_CVI.100_20000.switch.bed test.dist_only.hist test.athal_COL.col0.hapmer.spectra-cn.st.png test.athal_CVI.100_20000.switches.txt test.hapmers.blob.png test.athal_COL.col0.hapmer.spectra-hap-cn.hist test.athal_CVI.block.N.png test.hapmers.count test.athal_COL.col0.hapmer.wig test.athal_CVI.col0.hapmer.spectra-cn.fl.png test.only.hist test.athal_COL.contig.sizes test.athal_CVI.col0.hapmer.spectra-cn.ln.png test.qv test.athal_COL.continuity.N.png test.athal_CVI.col0.hapmer.spectra-cn.st.png test.spectra-asm.fl.png test.athal_COL.cvi0.hapmer.spectra-cn.fl.png test.athal_CVI.col0.hapmer.spectra-hap-cn.hist test.spectra-asm.hist test.athal_COL.cvi0.hapmer.spectra-cn.ln.png test.athal_CVI.col0.hapmer.wig test.spectra-asm.ln.png test.athal_COL.cvi0.hapmer.spectra-cn.st.png test.athal_CVI.contig.sizes test.spectra-asm.st.png test.athal_COL.cvi0.hapmer.spectra-hap-cn.hist test.athal_CVI.continuity.N.png test.spectra-cn.fl.png test.athal_COL.cvi0.hapmer.wig test.athal_CVI.cvi0.hapmer.spectra-cn.fl.png test.spectra-cn.hist test.athal_COL.only.hist test.athal_CVI.cvi0.hapmer.spectra-cn.ln.png test.spectra-cn.ln.png test.athal_COL.qv test.athal_CVI.cvi0.hapmer.spectra-cn.st.png test.spectra-cn.st.png test.athal_COL.sort.bed test.athal_CVI.cvi0.hapmer.spectra-hap-cn.hist [user@cn3144]$ exit salloc.exe: Relinquishing job allocation 46116226 [user@biowulf]$
Create a batch input file (e.g. merqury.sh), which uses the test data in $MERQURY_TEST_DATA
. For example:
#!/bin/bash module load merqury/1.3 cp ${MERQURY_TEST_DATA:-none}/* . for f in *.tar.gz ; do tar -xzf $f ; done merqury.sh F1.k18.meryl col0.hapmer.meryl cvi0.hapmer.meryl athal_COL.fasta athal_CVI.fasta test
Submit this job using the Slurm sbatch command.
sbatch --cpus-per-task=6 --mem=10g merqury.sh