Tombo is a suite of tools primarily for the identification of modified nucleotides from nanopore sequencing data. Tombo also provides tools for the analysis and visualization of raw nanopore signal. Features:
$TOMBO_TEST_DATA
/fdb/igenomes/
Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive --mem=5g --cpus-per-task=4 --gres=lscratch:10 salloc.exe: Pending job allocation 46116226 salloc.exe: job 46116226 queued and waiting for resources salloc.exe: job 46116226 has been allocated resources salloc.exe: Granted job allocation 46116226 salloc.exe: Waiting for resource configuration salloc.exe: Nodes cn3144 are ready for job [user@cn3144]$ cd /lscratch/$SLURM_JOB_ID [user@cn3144]$ module load tombo [user@cn3144]$ tar xvfz ${TOMBO_TEST_DATA}/${GZFILE} . [user@cn3144]$ export REF="/fdb/igenomes/Saccharomyces_cerevisiae/UCSC/sacCer3/Sequence/WholeGenomeFasta/genome.fa" [user@cn3144]$ tombo resquiggle $FAST5DIR $REF --processes $SLURM_CPUS_PER_TASK --num-most-common-errors 5 --overwrite [18:08:23] Loading minimap2 reference. [18:08:24] Getting file list. [18:08:24] Loading default canonical ***** DNA ***** model. [18:08:24] Re-squiggling reads (raw signal to genomic sequence alignment). 5 most common unsuccessful read types (approx. %): 2.7% ( 93 reads) : Alignment not produced 1.0% ( 35 reads) : Poor raw to expected signal matching (revert with `tombo filter clear_filters`) 0.1% ( 5 reads) : Read event to sequence alignment extends beyond bandwidth ----- ----- 100%|##################################################################| 3621/3621 [05:49<00:00, 10.36it/s] [18:14:14] Final unsuccessful reads summary (3.7% reads unsuccessfully processed; 133 total reads): 2.6% ( 93 reads) : Alignment not produced 1.0% ( 35 reads) : Poor raw to expected signal matching (revert with `tombo filter clear_filters`) 0.1% ( 5 reads) : Read event to sequence alignment extends beyond bandwidth [18:14:14] Saving Tombo reads index to file. [user@cn3144]$ tombo detect_modifications alternative_model --fast5-basedirs $FAST5DIR \ --statistics-file-basename native.e_coli_sample \ --alternate-bases dam dcm --processes $SLURM_CPUS_PER_TASK [18:20:03] Parsing Tombo index file(s). [18:20:03] Performing alternative model testing. [18:20:03] Performing specific alternate base(s) testing. [18:20:03] Calculating read coverage regions. [18:20:03] Calculating read coverage. [18:20:04] Performing modified base detection across genomic regions. 100%|##############################################################| 2392/2392 [01:10<00:00, 33.89it/s] [user@cn3144]$ tombo plot most_significant --fast5-basedirs $FAST5DIR \ --statistics-filename native.e_coli_sample.dcm.tombo.stats \ --plot-standard-model --plot-alternate-model dcm \ --pdf-filename sample.most_significant_dcm_sites.pdf [18:23:05] Loading statistics from file. [18:23:06] Parsing Tombo index file(s). [18:23:06] Loading default canonical ***** DNA ***** model. [18:23:06] Preparing plot data. [18:23:08] Plotting. [user@cn3144]$ tombo text_output browser_files --statistics-filename native.e_coli_sample.dam.tombo.stats \ --file-types dampened_fraction --browser-file-basename native.e_coli_sample.dam [18:23:42] Loading statistics from file. [18:23:42] Parsing and outputting statistics wiggles. [user@cn3144]$ tombo text_output browser_files --fast5-basedirs $FAST5DIR \ --file-types coverage --browser-file-basename native.e_coli_sample [18:24:23] Parsing Tombo index file(s). [18:24:23] Getting and writing coverage bedgraphs. [18:24:23] Calculating read coverage regions. [18:24:23] Calculating read coverage.
Create a batch script file (e.g. tombo.sh). For example:
#!/bin/bash cd /lscratch/$SLURM_JOB_ID module load tombo tar xvfz ${TOMBO_TEST_DATA}/${GZFILE} . export REF="/fdb/igenomes/Saccharomyces_cerevisiae/UCSC/sacCer3/Sequence/WholeGenomeFasta/genome.fa" tombo resquiggle $FAST5DIR $REF --processes $SLURM_CPUS_PER_TASK --num-most-common-errors 5 --overwrite .... ....
Submit this job using the Slurm sbatch command.
sbatch --cpus-per-task=4 --mem=10g --gres=lscratch:100 tombo.sh