Scramble: a tool for mobile element insertion detection
Scramble is a mobile element insertion (MEI) detection tool. It identifies clusters of soft clipped reads in a BAM file, builds consensus sequences, aligns to representative L1Ta, AluYa5, and SVA-E sequences, and outputs MEI calls.
Mobile element insertion detection in 89,874 clinical exomes
Genetics in Medicine (2020).
Important Notes
- Module Name: Scramble (see the modules page for more information)
- Environment variables set
- SCRAMBLE_HOME Scramble installation directory
- SCRAMBLE_BIN Scramble executable directory
- SCRAMBLE_DATA Scramble sample data directory
- If you are using your own reference file make sure you generate *.nhr, *.nin, and *.nsq files using makeblastd as follows:
module load load ncbi-toolkit
makeblastdb -in file.fasta -input_type fasta -dbtype nucl
Interactive job
Interactive jobs should be used for debugging, graphics, or applications that cannot be run as batch jobs.
Allocate an interactive session and run the program. Sample session:
[user@biowulf ~]$ sinteractive --mem=4g salloc.exe: Pending job allocation 56730292 salloc.exe: job 56730292 queued and waiting for resources salloc.exe: job 56730292 has been allocated resources salloc.exe: Granted job allocation 56730292 salloc.exe: Waiting for resource configuration salloc.exe: Nodes cn3148 are ready for job [user@cn3148 ~]$ module load Scramble [+] Loading singularity 3.5.3 on cn3148 [+] Loading Scramble 0.0.20190211.82c78b9 ...Copy sample data into your current directory:
[user@cn3148 ~]$ cp $SCRAMBLE_DATA/* .You can run Scramble in two different ways.
1) When you execute the scramble command without arguments, a new shell will be opened for you within a Singularity container:
[user@cn3148 ~]$ scramble Singularity>Your environment will change and you will have access to a different set of commands and executables. For example, you can run the command:
Singularity> Rscript --vanilla /app/cluster_analysis/bin/SCRAMble.R \ --out-name ${PWD}/sample.mei.txt \ --cluster-file ${PWD}/sample_cluster.txt \ --install-dir /app/cluster_analysis/bin \ --mei-refs /app/cluster_analysis/resources/MEI_consensus_seqs.fa \ --ref /app/validation/test.fa \ --eval-dels \ --eval-meis \ --no-vcf Running sample: /gpfs/gsfs8/users/apptest2/SCRAMBLE_TEST/sample_cluster.txt Running scramble with options: blastRef : /app/validation/test.fa clusterFile : /gpfs/gsfs8/users/apptest2/SCRAMBLE_TEST/sample_cluster.txt deletions : TRUE indelScore : 80 INSTALL.DIR : /app/cluster_analysis/bin mei.refs : /app/cluster_analysis/resources/MEI_consensus_seqs.fa meis : TRUE meiScore : 50 minDelLen : 50 nCluster : 5 no.vcf : TRUE outFilePrefix : /gpfs/gsfs8/users/apptest2/SCRAMBLE_TEST/sample.mei.txt pctAlign : 90 polyAdist : 100 polyAFrac : 0.75 [...output truncated...] Sample had 0 MEI(s) Done analyzing MEIs Sample had 0 deletions Done analyzing deletions Warning message: In predict.BLAST(bl, seq, BLAST_args = "-dust no") : BLAST did not return a match!Please remember to exit this new shell when you are finished with your session.
Singularity> exit exit [user@cn3148]$
2) Alternatively, you can run the Rscript or other command(s) directly from the Linux shell, but in this case the command(s) must be preceded by scramble. For example:
[user@cn3148 ~]$ scramble Rscript --vanilla /app/cluster_analysis/bin/SCRAMble.R \ --out-name ${PWD}/sample.mei.txt \ --cluster-file ${PWD}/sample_cluster.txt \ --install-dir /app/cluster_analysis/bin \ --mei-refs /app/cluster_analysis/resources/MEI_consensus_seqs.fa \ --ref /app/validation/test.fa \ --eval-dels \ --eval-meis \ --no-vcfExit the interactive shell:
[user@cn3148 ~]$ exit exit salloc.exe: Relinquishing job allocation 49998864 [user@biowulf ~]$