DNA shearing is a crucial first step in most NGS protocols for Illumina. Enzymatic fragmentation has shown in recent years to be a cost and time effective alternative to physical shearing (i.e. sonication). We discovered that enzymatic fragmentation leads to unexpected alteration of the original DNA source material. We provide fade as a method of identification and removal of enymatic fragmentation artifacts.
Allocate an interactive session and run the program.
Sample session (user input in bold):
Please note cellbender is compute intensive and needs sufficient memory allocation
[user@biowulf ~]$ sinteractive --mem=20g salloc: Pending job allocation 13632637 salloc: job 13632637 queued and waiting for resources salloc: job 13632637 has been allocated resources salloc: Granted job allocation 13632637 salloc: Waiting for resource configuration salloc: Nodes cn4308 are ready for job [user@cn4308]$ module load fade [+] Loading fade 0.6.0 on cn4282 [+] Loading singularity 4.0.1 on cn4282 [user@cn4308]$ fade -h Fragmentase Artifact Detection and Elimination version: v0.6.0 usage: fade [subcommand] annotate: marks artifact reads in bam tags (must be done first) out: eliminates artifact from reads(may require queryname sorted bam) stats: reports extended information about artifact reads stats-clip: reports extended information about all soft-clipped reads extract: extracts artifacts into a mapped bam -h --help This help information.
Create a batch input file (e.g. fade.sh). For example:
#!/bin/bash module load fade fade annotate -b sam1.bam ref.fa > sam1.anno.bam
Submit this job using the Slurm sbatch command.
sbatch [--cpus-per-task=#] [--mem=#] fade.sh