Allocate an interactive session and run the program. Sample session:
[user@biowulf]$ sinteractive --cpus-per-task=16 --mem=10g
salloc.exe: Pending job allocation 46116226
salloc.exe: job 46116226 queued and waiting for resources
salloc.exe: job 46116226 has been allocated resources
salloc.exe: Granted job allocation 46116226
salloc.exe: Waiting for resource configuration
salloc.exe: Nodes cn3144 are ready for job
[user@cn3144 ~]$ module load hmmer
[user@cn3144 ~]$ hmmsearch --cpu $SLURM_CPUS_PER_TASK /usr/local/apps/hmmer/tutorial/globins4.hmm /fdb/fastadb/nr.fas
# hmmsearch :: search profile(s) against a sequence database
# HMMER 3.1b2 (February 2015); http://hmmer.org/
# Copyright (C) 2015 Howard Hughes Medical Institute.
# Freely distributed under the GNU General Public License (GPLv3).
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
# query HMM file: /usr/local/apps/hmmer/tutorial/globins4.hmm
# target sequence database: /fdb/fastadb/nr.fas
# number of worker threads: 16
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Query: globins4 [M=149]
[...]
Alignments for each domain:
== domain 1 score: 18.2 bits; conditional E-value: 0.0014
globins4 41 qefFekFkdLstedelkksadvkkHgkkvldAlsdalakld..ekleaklkdLselHakklkvdpkyfkllsevlvdvlaarlpkeftadvqaal 133
q++F++ +L+ ++ + g+ + +A+++ +++d + l ++++ ++++H ++++++ ++++++++ l+++l + + ft dv+ a
WP_087017392.1 30 QRMFDHNPELKDIFNMSH-QRTGRQGVALFEAVAAYAKNIDnlGALTTAVERIAHKH-TSFNIQAEHYQIVGHHLIETLRELASDAFTKDVEEAW 122
677777777873333333.3345679999***********87889999*********.58*******************************9886 PP
globins4 134 e 134
WP_087017392.1 123 T 123
5 PP
Internal pipeline statistics summary:
-------------------------------------
Query model(s): 1 (149 nodes)
Target sequences: 145198384 (53182534605 residues searched)
Passed MSV filter: 4463369 (0.0307398); expected 2903967.7 (0.02)
Passed bias filter: 3672241 (0.0252912); expected 2903967.7 (0.02)
Passed Vit filter: 255552 (0.00176002); expected 145198.4 (0.001)
Passed Fwd filter: 17247 (0.000118782); expected 1452.0 (1e-05)
Initial search space (Z): 145198384 [actual number of targets]
Domain search space (domZ): 10862 [number of targets reported over threshold]
# CPU time: 1306.00u 76.30s 00:23:02.30 Elapsed: 00:11:03.54
# Mc/sec: 11942.31
[user@cn3144 ~]$ exit
salloc.exe: Relinquishing job allocation 46116226
[user@biowulf ~]$
Create a batch input file (e.g. HMMER.sh). The following example uses the query sequence set globins4.hmm in the example directory, run against the NCBI nr protein database.
#!/bin/bash set -e module load hmmer hmmsearch --cpu $SLURM_CPUS_PER_TASK /usr/local/apps/hmmer/tutorial/globins4.hmm /fdb/fastadb/nr.fas
Submit this job using the Slurm sbatch command.
sbatch --cpus-per-task=16 --mem=10g HMMER.sh
Create a swarmfile (e.g. HMMER.swarm). For example:
hmmsearch --cpu $SLURM_CPUS_PER_TASK file1.fas /fdb/fastadb/nr.fas hmmsearch --cpu $SLURM_CPUS_PER_TASK file2.fas /fdb/fastadb/nr.fas hmmsearch --cpu $SLURM_CPUS_PER_TASK file3.fas /fdb/fastadb/nr.fas [...]
Submit this job using the swarm command.
swarm -f HMMER.swarm -g 10 -t 32 --module hmmerwhere
| -g # | Number of Gigabytes of memory required for each process (1 line in the swarm command file) |
| -t # | Number of threads/CPUs required for each process (1 line in the swarm command file). |
| --module hmmer | Loads the HMMER module for each subjob in the swarm |